LOCUS AB000360 2582 bp DNA PRI 17-OCT-1997 DEFINITION Homo sapiens PIGC gene, complete cds. ACCESSION AB000360 NID g2547041 KEYWORDS PIGC; glycosylphosphatidylinositol-synthesis gene. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hong,Y., Ohishi,K., Inoue,N., Endo,Y., Fujita,T., Takeda,J. and Kinoshita,T. TITLE Structures and chromosomal localizations of the glycosylphosphatidylinositol synthesis gene PIGC and its pseudogene PIGCP1 JOURNAL Genomics 44 (3), 347-349 (1997) MEDLINE 97468149 REFERENCE 2 (bases 1 to 2582) AUTHORS Hong,Y. TITLE Direct Submission JOURNAL Submitted (08-JAN-1997) to the DDBJ/EMBL/GenBank databases. Yeongjin Hong, Research Institute for Microbial Diseases, Immunoregulation; 3-1 Yamada-oka, Suita, Osaka 565, Japan (E-mail:kohishi@biken.osaka-u.ac.jp, Tel:81-6-879-8329, Fax:81-6-875-5233) FEATURES Location/Qualifiers source 1..2582 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q23-q25" exon 808..2266 gene 1101..1994 /gene="PIGC" CDS 1101..1994 /gene="PIGC" /standard_name="glycosylphosphatidylinositol-synthesis gene" /codon_start=1 /db_xref="PID:d1023736" /db_xref="PID:g2547042" /translation="MYAQPVTNTKEVKWQKVLYERQPFPDNYVDRRFLEELRKNIHAR KYQYWAVVFESSVVIQQLCSVCVFVVIWWYMDEGLLAPHWLLGTGLASSLIGYVLFDL IDGGEGRKKSGQTRWADLKSALVFITFTYGFSPVLKTLTESVSTDTIYAMSVFMLLGH LIFFDYGANAAIVSSTLSLNMAIFASVCLASRLPRSLHAFIMVTFAIQIFALWPMLQK KLKACTPRSYVGVTLLFAFSAVGGLLSISAVGAVLFALLLMSISCLCSFYLIRLQLFK ENIHGPWDEAEIKEDLSRFLS" mutation 1896 /gene="PIGC" /replace="c" polyA_signal 2246..2251 mutation 2259 /replace="t" repeat_region 2331..2356 /rpt_unit=gt BASE COUNT 694 a 494 c 581 g 813 t ORIGIN 1 ggatccctgc tgcagagggg gtaacggtgt ctggcttgcc aagcaatatt tgttgtggtc 61 tatcatggaa gaaataaagt cgggcaatat gaattttttt tttctcaaat ttgccggatg 121 gctgtggtgt ttctgactct tagttttctc attgtgaaaa aggaatgatt atcttcttcg 181 atcctctcaa gagtttcctt gttttgagta gattgatagc tctttaaagg atgctaagct 241 cagctaatgg aagaagagtc tagtttcttt gaggctttga ttttggttaa actatagagc 301 tcataccttt ctgtatggtg cagcttacta ttgtctttgg attggtaact taaaaaatac 361 aaataacatg cctttgagaa ccaataaaaa ctatggatat tatccctata aatttacaca 421 aatccagata taagcatgca atgtgatata cctaagggat atgtgaacca ctgagttaag 481 aactgcttta gagggagata caatgtgaga cacaggcttt gggataagac tttggtttga 541 atcctggctc tgctctgtta ccttagggca aagttactta agcatcttga atctcagctt 601 ttttaccaaa gcaggactaa tactaactta caaggtggtg aggattaagt gaaagaagat 661 acataaggca cttagcacat agtaggtact caataagcga tagctaacag atgtctatta 721 ttattcaagg aattataatt ttcaaatctg aaatgcagtt ttaatgtccc ataaggtgac 781 taccacatac atttttctca gacttttagt aaactgagtt gatttgactt tatctcagta 841 ctactcttga cctttcacaa ctttcgtagg ttcacagtct ctctttttct aggaacttgg 901 ctgtgttgtc ctgcctcaga gacaaattca tctattgtag gcctagcccc tgcctttgaa 961 aacaaggaaa ggttggtaga acatcaacac agcatggaat ttccagggag gtctcatttc 1021 aaaacttcat aaagaacaag aaccacctgg acttctgtga gggcgatgat taaactggcc 1081 tgagtttgaa tgaaaggata atgtatgctc aacctgtgac taacaccaag gaggtcaagt 1141 ggcagaaggt cttgtatgag cgacagccct ttcctgataa ctatgtggac cggcgattcc 1201 tggaagagct ccggaaaaac atccatgctc ggaaatacca atattgggct gtggtatttg 1261 agtccagtgt ggtgatccag cagctgtgca gtgtttgtgt ttttgtggtt atctggtggt 1321 atatggatga gggtcttctg gccccccatt ggcttttagg gactggcctg gcttcttcac 1381 tgattgggta tgttttgttt gatctcattg atggaggtga agggcggaag aagagtgggc 1441 agacccggtg ggctgacctg aagagtgccc tagtcttcat tactttcact tatgggtttt 1501 caccagtgct gaagaccctt acagagtctg tcagcactga caccatctat gccatgtcag 1561 tcttcatgct gttaggccat ctcatctttt ttgactatgg tgccaatgct gccattgtat 1621 ccagcacact atccttgaac atggccatct ttgcttctgt atgcttggca tcacgtcttc 1681 cccggtccct gcatgccttc atcatggtga catttgccat tcagattttt gccctgtggc 1741 ccatgttgca gaagaaacta aaggcatgta ctccccggag ctatgtgggg gtcacactgc 1801 tttttgcatt ttcagccgtg ggaggcctac tgtccattag tgctgtggga gccgtactct 1861 ttgcccttct gctgatgtct atctcatgtc tgtgttcatt ctacctcatt cgcttgcagc 1921 tttttaaaga aaacattcat gggccttggg atgaagctga aatcaaggaa gacttgtcca 1981 ggttcctcag ttaaattagg acatccatta cattattaaa gcaagctgat agattagcct 2041 cctaactagt atagaactta aagacagagt tccattctgg aagcagcatg tcattgtggt 2101 aagagaatag agatcaaaac caaaaaaaat gaaccaaagg cttgggtggt gagggtgctt 2161 atcctttctg ttattttgta gatgaaaaaa ctttctgggg acctcttgaa ttacatgctg 2221 taacatatga agtgatgtgg tttctattaa aaaaataaca catccatcaa gttgtctcat 2281 gatttttcca taaacaggag gcagacagag gggcatgaag agtgaagtaa gtgtgtgtgt 2341 gtgtgtgtgt gtgtgtaaag tcacttcttt ctaccctttt caatgtgcta atgctctttt 2401 atttatctag ggctcaaatc ttagaacaca gggtgctatg ctcagttttg ttgcccaaga 2461 tcacagaatt ggttacttaa ccttgactca gagtttctac cttgttctta gggaagcata 2521 tcacaactaa ttgcaaagca gagtgtgatg tgtcacaata agcagaatgc tagggggaat 2581 tc // LOCUS AB000732 4474 bp DNA PRI 28-JAN-1998 DEFINITION Homo sapiens gene for insulin receptor substrate-2, complete cds. ACCESSION AB000732 NID g2809058 KEYWORDS insulin receptor substrate-2; IRS-2. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ogihara,T., Isobe,T., Ichimura,T., Taoka,M., Funaki,M., Sakoda,H., Onishi,Y., Inukai,K., Anai,M., Fukushima,Y., Kikuchi,M., Yazaki,Y., Oka,Y. and Asano,T. TITLE 14-3-3 protein binds to insulin receptor substrate-1, one of the binding sites of which is in the phosphotyrosine binding domain JOURNAL J. Biol. Chem. 272 (40), 25267-25274 (1997) MEDLINE 97460123 REFERENCE 2 (bases 1 to 4474) AUTHORS Asano,T. TITLE Direct Submission JOURNAL Submitted (27-JAN-1997) to the DDBJ/EMBL/GenBank databases. Tomoichiro Asano, University of Tokyo, 3rd Department of Internal Medicine; 7-3-1 Hongo, Bunkyo-ku, Tokyo 113, Japan (E-mail:asano-tky@umin.u-tokyo.ac.jp, Tel:+81-3-3815-5411, Fax:+81-3-5803-1874) COMMENT Sequence updated (22-Jan-1998). FEATURES Location/Qualifiers source 1..4474 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3975 /gene="IRS-2" CDS 1..3975 /gene="IRS-2" /codon_start=1 /product="insulin receptor substrate-2" /db_xref="PID:d1025417" /db_xref="PID:g2809059" /translation="MASPPRHGPPGPASGDGPNLNNNNNNNNHSVRKCGYLRKQKHGH KRFFVLRGPGAGGDKATAGGGSAPQPPRLEYYESEKNWRSKAGAPKRVIALDCCLNIN KRADPKHKYLIALYTKDEYFAVAAENEQEQEGWYRALTDLVSEGRAAAGDAPPAAAPA ASCSASLPGAVGGSAGAAGAEDSYGLVAPATAAYREVWQVNLKPKGLGQSKNLTGVYR LCLSARTIGFVKLNCEQPSVTLQLMNIRRCGHSDSFFFIEVGRSAVTGPGELWMQADD SVVAQNIHETILEAMKALKELFEFRPRSKSQSSGSSATHPISVPGARRHHHLVNLPPS QTGLVRRSRTDSLAATPPAAKCSSCRVRTASEGDGGAAAGAAAAGARPVSVAGSPLSP GPVRAPLSRSHTLIGGCRAAGTKWHCFPAGGGLQHSRSMSMPVEHLPPAATSPGSLSS SSDHGWGSYPPPPGPHPLLPHPLHHGPGQRPSSGSASASGSPSDPGFMSLDEYGSSPG DLRAFCSHRSNTPESIAETPPARDGGGGGEFYGYMTMDRPLSHCGRSYRRVSGDAAQD LDRGLRKRTYSLTTPARQRPVPQPSSASLDEYTLMRATFSGSAGRLCPSCPASSPKVA YHPYPEDYGDIEIGSHRSSSSNLGADDGYMPMTPGAALAGSGSGSCRSDDYMPMSPAS VSAPKQILQPRAAAAAAAAVPFAGPAGPAPTFAAGRTFPASGGGYKASSPAESSPEDS GYMRMWCGSKLSMEHADGKLLPNGDYLNVSPSDAVTTGTPPDFFSAALHPGGEPLRGV PGCCYSSLPRSYKAPYTCGGDSDQYVLMSSPVGRILEEERLEPQATPGPTQAASAFGA GPTQPPHPVVPSPVRPSGGRPEGFLGQRGRAVRPTRLSLEGLPSLPSMHEYPLPPEPK SPGEYINIDFGEPGARLSPPAPPLLASAASSSSLLSASSPALSLGSGTPGTSSDSRQR SPLSDYMNLDFSSPKSPKPGAPSGHPVGSLDGLLSPEASSPYPPLPPRPSASPSSSLQ PPPPPPAPGELYRLPPASAVATAQGPGAASSLSSDTGDNGDYTEMAFGVAATPPQPIA APPKPEAARVASPTSGVKRLSLMEQVSGVEAFLQASQPPDPHRGAKVIRADPQGGRRR HSSETFSSTTTVTPVSPSFAHNPKRHNSASVENVSLRKSSEGGVGVGPGGGDEPPTSP RQLQPAPPLAPQGRPWTPGQPGGLVGCPGSGGSPMRRETSAGFQNGLKYIAIDVREEP GLPPQPQPPPPPLPQPGDKSSWGRTRSLGGLISAVGVGSTRGGCGGPGPGAPAPCPTT YAQH" BASE COUNT 720 a 1659 c 1408 g 687 t ORIGIN 1 atggcgagcc cgccgcggca cgggccgccc gggccggcga gcggagacgg ccccaacctc 61 aacaacaaca acaacaacaa caaccacagc gtgcgcaagt gcggctacct gcgcaagcag 121 aagcatggcc acaagcgctt cttcgtgctg cgcggacccg gcgcgggcgg cgacaaggcc 181 acggcgggcg gggggtcggc gccgcaaccg ccgcggctcg agtactacga aagcgaaaaa 241 aattggcgga gcaaggcagg cgcgccgaaa cgggtgatcg ctctcgactg ctgcctgaac 301 atcaacaagc gcgccgaccc caagcacaag tacctgatcg ccctctacac caaggacgag 361 tacttcgccg tggccgccga gaacgagcag gagcaggagg gctggtaccg cgcgctcacc 421 gacctggtca gcgagggccg cgcggccgcc ggagacgcgc cccccgccgc cgcgcccgcc 481 gcgtcctgca gcgcctccct gcccggcgcc gtgggcggtt ctgccggcgc cgccggggcc 541 gaggacagct acgggctggt ggctcccgcc acggccgcct accgtgaggt gtggcaggtg 601 aacctgaagc ccaagggtct gggccagagc aagaacctga cgggggtgta ccgtctgtgc 661 ctgtctgcgc gcaccatcgg cttcgtgaag ctcaactgcg agcagccgtc ggtgacgctg 721 cagctcatga acatccgccg ctgcggccac tcggacagct tcttcttcat cgaggtgggc 781 cgctcggccg tcacaggccc cggcgagctg tggatgcagg cggacgactc ggtggtggcg 841 cagaacatcc acgagaccat cctggaggcc atgaaggcgc tcaaggagct cttcgagttc 901 cggccgcgca gtaagagcca atcgtcgggg tcgtcggcca cgcaccccat cagcgtcccc 961 ggcgcgcgcc gccaccacca cctggtcaac ctgcccccca gccagacggg cctggtgcgc 1021 cgctcgcgca ccgacagcct ggccgccacc ccgccggcgg ccaagtgcag ctcgtgccgg 1081 gtgcgcaccg ccagcgaggg cgacggcggc gcggcggcgg gagcggcggc cgcgggcgcc 1141 aggccggtgt cggtggctgg gagccccctg agccccgggc cggtgcgcgc gcccctgagc 1201 cgctcgcaca ccctgatcgg cggctgccgg gccgcgggaa caaagtggca ttgcttcccg 1261 gcagggggcg gattgcaaca cagccgttcg atgtccatgc ccgtggagca tttgccgcca 1321 gccgccacca gcccgggttc cttgtcttcc agcagcgacc acggttgggg ttcttacccg 1381 ccgccgcccg gcccgcaccc gcttttgccg catccgttgc accacggccc cggccagcgg 1441 ccttccagcg gcagcgcttc cgcttcgggc tcccccagcg accccggttt catgtccctg 1501 gacgagtacg gctccagccc aggcgacctg cgcgccttct gcagccaccg aagcaacacg 1561 cccgagtcca tcgcggagac gcccccggcc cgagacggcg gcggcggcgg tgagttttac 1621 gggtacatga ccatggacag gcccctgagc cactgtggcc gctcctaccg ccgggtctcg 1681 ggggacgcgg cccaggacct ggaccgaggg ctgcgcaaga ggacctactc cctgaccacg 1741 ccagcccggc agcggccggt gccccagccc tcctctgcct cgctggatga atacaccctg 1801 atgcgggcca ccttctcggg cagcgcgggc cgcctctgcc cgtcctgccc cgcgtcctct 1861 cccaaggtgg cctaccaccc ctacccagag gactacggag acatcgagat cggctcccac 1921 aggagctcca gcagcaacct gggggcagac gacggctaca tgcccatgac gcccggcgcg 1981 gcccttgcgg gcagtgggag cggcagctgc aggagcgacg actacatgcc catgagcccc 2041 gccagcgtgt ccgcccccaa gcagattttg cagcccaggg ccgccgccgc cgccgccgcc 2101 gccgtgcctt ttgcggggcc tgcggggcca gcacccacct ttgcggcggg caggacattc 2161 ccggcgagtg ggggcggcta caaggccagc tcgcccgccg agagctcccc cgaggacagt 2221 gggtacatgc gcatgtggtg cggttccaag ctgtccatgg agcatgcaga tggcaagctg 2281 ctgcccaacg gggactacct caacgtgtcc cccagcgacg cggtcaccac gggcaccccg 2341 cccgacttct tctccgcagc cctgcacccc ggcggggagc cgctcagggg cgttcccggc 2401 tgctgctaca gctccttgcc ccgctcctac aaggccccct acacctgtgg cggggacagc 2461 gaccagtacg tgctcatgag ctcccccgtg gggcgcatcc tggaggagga gcgtctggag 2521 cctcaggcca ccccagggcc cacccaggcg gccagcgcct tcggggccgg ccccacgcag 2581 ccccctcacc ctgtagtgcc ttcgcccgtg cggcctagcg gcggccgccc ggagggcttc 2641 ttgggccagc gcggccgggc ggtgaggccc acgcgcctgt ccctggaggg gctgcccagc 2701 ctgcccagca tgcacgagta cccactgcca ccggagccca agagccccgg cgagtacatc 2761 aacatcgact ttggcgagcc cggggcccgc ctgtcgccgc ccgcgcctcc cctgctggcg 2821 tcggcggcct cgtcctcatc gctattgtcc gccagcagcc cggccttgtc gttgggctca 2881 ggcaccccgg gcaccagcag cgacagccgg cagcggtctc cgctctccga ctacatgaac 2941 ctcgacttca gctcccccaa gtctcctaag ccgggcgccc cgagcggcca ccccgtgggc 3001 tccttggacg gcctcctgtc ccccgaggcc tcctccccgt atccgccgtt gcccccgcgt 3061 ccgtccgcgt ccccgtcgtc gtctctgcag ccgccgccac cgccgccggc cccgggggag 3121 ctgtaccgcc tgccccccgc ctcggccgtt gccaccgccc agggcccggg cgccgcctca 3181 tcgttgtcct cggacaccgg ggacaatggt gactacaccg agatggcttt tggtgtggcc 3241 gccaccccgc cgcaacctat cgcggccccc ccgaagccag aagctgcccg cgtggccagc 3301 ccgacgtcgg gcgtgaagag gctgagcctc atggagcagg tgtcgggagt cgaggccttc 3361 ctgcaggcca gccagccccc ggacccccac cgcggcgcca aggtcatccg cgcagacccg 3421 caggggggcc gccgccgcca cagttccgag accttctcct ccaccacgac ggtcaccccc 3481 gtgtccccgt ccttcgccca caaccccaag cgccacaact cggcctccgt ggaaaatgtc 3541 tctctcagga aaagcagcga gggcggcgtg ggtgtcggcc ctggaggggg cgacgagccg 3601 cccacctccc cacgacagtt gcagccggcg ccccctttgg caccgcaggg ccggccgtgg 3661 accccgggtc agcccggggg cttggtcggt tgtcctggga gcggtggatc gcccatgcgc 3721 agagagacct ctgccggttt ccagaatggt ctcaagtaca tcgccatcga cgtgagggag 3781 gagcccgggc tgccacccca gccgcagccg ccgccgccgc cgcttcctca gccgggagac 3841 aagagctcct ggggccggac ccgaagcctc gggggtctca tcagcgctgt gggcgtcggc 3901 agcacccgcg gcgggtgcgg ggggccgggt cccggtgccc ctgccccctg cccaacaacc 3961 tacgcccagc attgacttct tgtcccacca cttgaaggag gccaccattg tgaaagagtg 4021 aagatctgtc tggctttatc accaggatgt cacatgtcag agaatatcat taaaagaaga 4081 cgctcagcac tgtttcagcc cgaagctgct tgcagttttc ttttggatct gagcaatgac 4141 tgtgtttgga aacatctgtg gactctgtta gatgaggcac caacaaggca aggtcacctg 4201 cctctttccc ttgttcccgg atggggcatt catcattgtg ctgtttgcgt tttgttttgt 4261 tttgttttaa caaaattagc tgaagaagtt attctcaaga aaattggatg ttttcattgg 4321 ccttcttaaa ttgtggccag tgtcttttaa tttcttcttc ttttcctttt ggcaaagcag 4381 atataaccct cagcatgcta ggagagtgca cccgtacact atggaagtgg taaaatctgg 4441 tatttactgg cttacactca aaacgaccac agtc // LOCUS AB000813 2294 bp DNA PRI 13-MAY-1997 DEFINITION Human DNA for BMAL1c, complete cds. ACCESSION AB000813 NID g2094736 KEYWORDS BMAL1c. SOURCE Homo sapiens male brain DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2294) AUTHORS Ikeda,M. TITLE Direct Submission JOURNAL Submitted (30-JAN-1997) to the DDBJ/EMBL/GenBank databases. Masaaki Ikeda, Saitama Medical School, Department of Physiology; 38 Morohongo, Iruma-gun, Saitama 350-04, Japan (E-mail:mikeda@saitama-med.ac.jp, Tel:+81-492-76-1150, Fax:+81-492-95-5573) REFERENCE 2 (bases 1 to 2294) AUTHORS Ikeda,M. and Nomura,M. TITLE cDNA Cloning and Tissue-specific Expression of a Novel Basic Helix-Loop -Helix/PAS protein (BMAL1) and Identification of Its Alternatively Spliced Varia nts with Alternative Translation Initiation Site Usage JOURNAL Unpublished (1997) REFERENCE 3 (sites) AUTHORS Ikeda,M. and Nomura,M. TITLE cDNA cloning and tissue-specific expression of a novel basic helix-loop-helix/PAS protein (BMAL1) and identification of alternatively spliced variants with alternative translation initiation site usage JOURNAL Biochem. Biophys. Res. Commun. 233 (1), 258-264 (1997) MEDLINE 97289529 FEATURES Location/Qualifiers source 1..2294 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="brain" CDS 146..691 /codon_start=1 /product="BMAL1c" /db_xref="PID:d1020726" /db_xref="PID:g2094737" /translation="MINIESMDTDKDDPHGRLEYTEHQGRIKNAREAHSQIEKRRRDK MNSFIDELASLVPTCNAMSRKLDKLTVLRMAVQHMKTLRGATNPYTEANYKPTFLSDD ELKHLILRAADGFLFVVGCDRGKILFVSESVFKILNYSQNDLIGQSLFDYLHPKDIAK VKEQLSSSDTAPRERLIDAKR" BASE COUNT 687 a 500 c 520 g 587 t ORIGIN 1 gccccaccga cctgctttcc agctctcttg gtaccagtgg tgtggattgc aaccgcaaac 61 ggaaaggcag ctccactgac taccattcac aggtcgaatt tggggagcac aatggctgga 121 ggtcagatgc ccactaggag atgctatgat taatatagaa agcatggaca cagacaaaga 181 tgaccctcat ggaaggttag aatatacgga acaccaagga aggataaaaa atgcaaggga 241 agctcacagt cagattgaaa agcggcgtcg ggataaaatg aacagtttta tagatgaatt 301 ggcttctttg gtaccaacat gcaacgcaat gtccaggaaa ttagataaac ttactgtgct 361 aaggatggct gttcagcaca tgaaaacatt aagaggtgcc accaatccat acacagaagc 421 aaactacaaa ccaacttttc tatcagacga tgaattgaaa cacctcattc tcagggcagc 481 agatggattt ttgtttgtcg taggatgtga ccgagggaag atactctttg tctcagagtc 541 tgtcttcaag atcctcaact acagccagaa tgatctgatt ggtcagagtt tgtttgacta 601 cctgcatcct aaagatattg ccaaagtcaa ggagcagctc tcctcctctg acaccgcacc 661 ccgggagcgg ctcatagatg caaaaagatg aagtgtaaca ggccttcagt aaaggttgaa 721 gacaaggact tcccctctac ctgctcaaag aaaaaagcag atcgaaaaag cttctgcaca 781 atccacagca caggctattt gaaaagctgg ccacccacaa agatggggct ggatgaagac 841 aacgaaccag acaatgaggg gtgtaacctc agctgcctcg tcgcaattgg acgactgcat 901 tctcatgtag ttccacaacc agtgaacggg gaaatcaggg tgaaatctat ggaatatgtt 961 tctcggcacg cgatagatgg aaagtttgtt tttgtagacc agagggcaac agctattttg 1021 gcatatttac cacaagaact tctaggcaca tcgtgttatg aatattttca ccaagatgac 1081 ataggacatc ttgcagaatg tcataggcaa gttttacaga cgagagaaaa aattacaact 1141 aattgctata aatttaaaat caaagatggt tcttttatca cactacggag tcgatggttc 1201 ggtttcatga acccttggac caaggaagta gaatatattg tctcaactaa cactgttgtt 1261 ttagccaacg tcctggaagg cggggaccca accttcccac agctcacagc atccccccac 1321 agcatggaca gcatgctccc ctctggagaa ggtggcccaa agaggaccca ccccactgtt 1381 ccagggattc cagggggaac ccgggctggg gcaggaaaaa taggccgaat gattgctgag 1441 gaaatcatgg aaatccacag gataagaggg tcatcgcctt ctagctgtgg ctccagccca 1501 ttgaacatca cgagtacgcc tccccctgat gcctcttctc caggaggcaa gaagatttta 1561 aatggaggga ctccagacat tccttccagt ggcctactat caggccaggc tcaggagaac 1621 ccaggttatc catattctga tagttcttct attcttggtg agaaccccca cataggtata 1681 gacatgattg acaaccacca aggatcaagt agtcccagta atgatgaggc agcaatggct 1741 gtcatcatga gcctcttgga agcagatgct ggactgggtg gccctgttga ctttagtgac 1801 ttgccatggc cgctgtaaac actacatgtt gctttggcaa cagctatagt atcaaagtgc 1861 attagtggtg gagttttaca gtctgtgaag cttactggat aaggagagaa tagcttttat 1921 gtactgactt cataaaagcc atctcagagc cattgataca agtcaatctt actatatgta 1981 acttcagaca aagtggaact aagcctgctc cagtgtttcc tcaccattga ttattgggct 2041 agctgtggat cgcttgcatt aattgtatat tttggattct gtttgtgttg aattttttaa 2101 tcattgtgca cagaagcatc attggtagct tttatatgca aatggtcatc tcagatgtat 2161 ggtgttttta cactacaaag aagtccccca tgtggatatc tcttatacta attgtatcat 2221 aaagccgttt attcttcctt gtaagaatcc tttactataa atatgggtta aagtataatg 2281 tactagacag ttaa // LOCUS AB001835 1679 bp DNA PRI 17-MAR-1997 DEFINITION Human DNA for Brain-1, complete cds. ACCESSION AB001835 NID g1902885 KEYWORDS Brain-1. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Sumiyama,K., Washio-Watanabe,K., Saitou,N., Hayakawa,T. and Ueda,S. TITLE Class III POU genes: generation of homopolymeric amino acid repeats under GC pressure in mammals JOURNAL J. Mol. Evol. 43 (3), 170-178 (1996) MEDLINE 96359175 REFERENCE 2 (bases 1 to 1679) AUTHORS Ueda,S. TITLE Direct Submission JOURNAL Submitted (12-MAR-1997) to the DDBJ/EMBL/GenBank databases. Shintaroh Ueda, Graduate School of Science,University of Tokyo, Department of Biological Sciences; 7-3-1 Hongo,Bunkyo-Ku,Tokyo, Hongo, Tokyo 113, Japan (Tel:03-3812-2111(ex.4486), Fax:03-3818-7547) FEATURES Location/Qualifiers source 1..1679 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 60..1562 /codon_start=1 /product="Brain-1" /db_xref="PID:d1020216" /db_xref="PID:g1902886" /translation="MATAASNPYLPGNSLLAAGSIVHSDAAGAGGGGGGGGGGGGGGA GGGGGGMQPGSAAVTSGAYRGDPSSVKMVQSDFMQGAMAASNGGHMLSHAHQWVTALP HAAAAAAAAAAAAVEASSPWSGSAVGMAGSPQQPPQPPPPPPQGPDVKGGAGRDDLHA GTALHHRGPPHLGPPPPPPHQGHPGGWGAAAAAAAAAAAAAAAAHLPSMAGGQQPPPQ SLLYSQPGGFTVNGMLSAPPGPGGGGGGAGGGAQSLVHPGLVRGDTPELAEHHHHHHH HAHPHPPHPHHAQGPPHHGGGGGGAGPGLNSHDPHSDEDTPTSDDLEQFAKQFKQRRI KLGFTQADVGLALGTLYGNVFSQTTICRFEALQLSFKNMCKLKPLLNKWLEEADSSTG SPTSIDKIAAQGRKRKKRTSIEVSVKGALESHFLKCPKPSAQEITNLADSLQLEKEVV RVWFCNRRQKEKRMTPPGIQQQTPDDVYSQVGTVSADTPPPHHGLQTSVQ" BASE COUNT 255 a 648 c 615 g 161 t ORIGIN 1 ctgctgctgc ggcggcggcg gcggtggtgg cggcggtggg gtggcgggag cggagcggca 61 tggccacggc ggcttctaac ccctacctgc cggggaacag cctgctcgcg gccggctcta 121 ttgtgcactc ggacgcggca ggggctggcg gcggcggggg tggcggcggc ggcggcggcg 181 ggggcggcgc agggggcggg ggcggcggca tgcagccggg cagcgccgcc gtgacctcgg 241 gcgcctaccg gggggacccg tcctctgtca agatggtcca gagcgacttc atgcaggggg 301 ccatggccgc cagcaacggc ggccatatgc tgagccacgc gcaccagtgg gtcacagccc 361 tgccccacgc cgccgccgcc gccgccgctg ccgccgccgc cgccgtggag gcgagctcgc 421 cgtggtcggg cagcgccgtg ggcatggctg gcagccccca gcagccaccg cagccgccgc 481 cgccaccgcc gcagggcccc gacgtgaagg gcggcgccgg gcgcgacgac ctgcacgcgg 541 gcacagcgct gcaccaccgc gggccgccgc acctcggacc cccgccgccg cccccacacc 601 agggccaccc tgggggctgg ggggcggccg ccgctgccgc agccgcagcc gccgccgccg 661 ccgccgccgc gcacctcccg tccatggccg ggggccagca gccgccgccg cagagtctgc 721 tctactcgca gcccggaggc ttcacggtga acggcatgct gagcgcgcca ccggggcccg 781 gcggcggcgg cggcggcgcg ggcggtggag cccagagctt ggtgcacccg gggctggtgc 841 gcggggacac gccagagctg gccgagcacc accaccacca ccaccaccac gcgcatcctc 901 acccgccgca cccgcaccac gcgcagggac ccccgcacca cggcggcggc ggcggcggcg 961 cggggcctgg actcaacagc cacgacccgc actcggacga ggacacgccg acgtcggacg 1021 acctggagca gttcgccaag cagttcaagc agcggcgcat caagctgggc ttcacgcagg 1081 ccgacgtggg gttggcgctg ggcacactct acggcaacgt gttctcgcag accaccatct 1141 gccgcttcga ggccctgcag ctgagcttca agaacatgtg caagctcaag ccgctgctga 1201 acaagtggct ggaggaggcg gactcaagca ccggcagccc cacaagcatc gacaagatcg 1261 cggcgcaggg ccgcaagcgc aagaagcgga cctctatcga ggtgagcgtc aagggcgcgc 1321 tggagagcca cttcctcaag tgccccaagc cctccgcgca ggagatcacc aacctggccg 1381 acagcctgca gctcgagaag gaggtggtgc gggtctggtt ctgcaatcgg cgccaaaagg 1441 agaagcgcat gacgccgccc gggatccaac agcagacgcc cgacgacgtc tactcgcagg 1501 tgggcaccgt gagcgccgac acgccgccgc ctcaccacgg gctgcagacg agcgttcagt 1561 gaagccaggg cgcagagcga agagtgccgc cgccgccgcc gcctccgcag ccgccgtcag 1621 caccgccgcc gcccctgccg ccgccgccgc cgccgccgcc gccgctgccg ccgccgcgc // LOCUS AB004859 1098 bp DNA PRI 24-JUN-1997 DEFINITION Human gene for alpha(1,2)fucosyltransferase, complete cds. ACCESSION AB004859 NID g2217920 KEYWORDS alpha(1,2)fucosyltransferase. SOURCE Homo sapiens (isolate:Japanese) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1098) AUTHORS Kaneko,M. TITLE Direct Submission JOURNAL Submitted (16-JUN-1997) to the DDBJ/EMBL/GenBank databases. Mika Kaneko, Institute of Life Science, Soka University, Division of Cell Biology; Tangi-cho 1-236, Hachioji, Tokyo 192, Japan (E-mail:mika@scc1.t.soka.ac.jp, Tel:+81-426-91-2495, Fax:+81-426-91-9315) REFERENCE 2 (sites) AUTHORS Kaneko,M., Nishihara,S., Shinya,N., Kudo,T., Iwasaki,H., Seno,T., Okubo,Y. and Narimatsu,H. TITLE Wide variety of point mutations in the H gene of Bombay and para-Bombay individuals that inactivate H enzyme JOURNAL Blood 90 (2), 839-849 (1997) MEDLINE 97369744 FEATURES Location/Qualifiers source 1..1098 /organism="Homo sapiens" /isolate="Japanese" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" gene 1..696 /gene="h1" CDS 1..696 /gene="h1" /codon_start=1 /product="alpha(1,2)fucosyltransferase" /db_xref="PID:d1021392" /db_xref="PID:g2217921" /translation="MWLRSHRQLCLAFLLVCVLSVIFFLHIHQDSFPHGLGLSILCPD RRLVTPPVAIFCLPGTAMGPNASSSCPQHPASLSGTWTVYPNGRFGNQMGQYATLLAL AQLNGRRAFILPAMHAALAPVFRITLPVLAPEVDSRTPWRELQLHDWMSEEYADLRDP FLKLSGFPCSWTFFHHLREQIRREFTLHDHLREEAQSVLGQLRLGRTGDRPRTFVGVH VRRGDYLQVMPQR" mutation 695 /gene="h1" /note="nonsense mutation" BASE COUNT 185 a 368 c 305 g 240 t ORIGIN 1 atgtggctcc ggagccatcg tcagctctgc ctggccttcc tgctagtctg tgtcctctct 61 gtaatcttct tcctccatat ccatcaagac agctttccac atggcctagg cctgtcgatc 121 ctgtgtccag accgccgcct ggtgacaccc ccagtggcca tcttctgcct gccgggtact 181 gcgatgggcc ccaacgcctc ctcttcctgt ccccagcacc ctgcttccct ctccggcacc 241 tggactgtct accccaatgg ccggtttggt aatcagatgg gacagtatgc cacgctgctg 301 gctctggccc agctcaacgg ccgccgggcc tttatcctgc ctgccatgca tgccgccctg 361 gccccggtat tccgcatcac cctgcccgtg ctggccccag aagtggacag ccgcacgccg 421 tggcgggagc tgcagcttca cgactggatg tcggaggagt acgcggactt gagagatcct 481 ttcctgaagc tctctggctt cccctgctct tggactttct tccaccatct ccgggaacag 541 atccgcagag agttcaccct gcacgaccac cttcgggaag aggcgcagag tgtgctgggt 601 cagctccgcc tgggccgcac aggggaccgc ccgcgcacct ttgtcggcgt ccacgtgcgc 661 cgtggggact atctgcaggt tatgcctcag cgctagaagg gtgtggtggg cgacagcgcc 721 tacctccggc aggccatgga ctggttccgg gcacggcacg aagcccccgt tttcgtggtc 781 accagcaacg gcatggagtg gtgtaaagaa aacatcgaca cctcccaggg cgatgtgacg 841 tttgctggcg atggacagga ggctacaccg tggaaagact ttgccctgct cacacagtgc 901 aaccacacca ttatgaccat tggcaccttc ggcttctggg ctgcctacct ggctggcgga 961 gacactgtct acctggccaa cttcaccctg ccagactctg agttcctgaa gatctttaag 1021 ccggaggcgg ccttcctgcc cgagtgggtg ggcattaatg cagacttgtc tccactctgg 1081 acattggcta agccttga // LOCUS AB007828 3300 bp DNA PRI 09-OCT-1997 DEFINITION Homo sapiens gene for necdin, complete cds. ACCESSION AB007828 NID g2516265 KEYWORDS NDN; necdin. SOURCE Homo sapiens female leukocytes DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nakada,Y., Taniura,H., Uetsuki,T., Inazawa,J. and Yoshikawa,K. TITLE Structure, expression, and chromosomal localization of the human necdin gene JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 3300) AUTHORS Yoshikawa,K. TITLE Direct Submission JOURNAL Submitted (02-OCT-1997) to the DDBJ/EMBL/GenBank databases. Kazuaki Yoshikawa, Institute for Protein Research, Osaka University, Div. Regulation of Macromolecular Functions; Yamadaoka 3-2, Suita, Osaka 565, Japan (E-mail:yoshikaw@protein.osaka-u.ac.jp, Tel:06-879-8621, Fax:06-879-8623) FEATURES Location/Qualifiers source 1..3300 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocytes" /sex="female" promoter 1191..1335 /function="postmitotic neuron-restrictive core promoter" prim_transcript 1368..3263 gene 1454..2419 /gene="NDN" CDS 1454..2419 /gene="NDN" /function="postmitotic neuron-specific growth suppressor" /codon_start=1 /product="necdin" /db_xref="PID:d1023528" /db_xref="PID:g2516266" /translation="MSEQSKDLSDPNFAAEAPNSEVHSSPGVSEGVPPSATLAEPQSP PLGPTAAPQAAPPPQAPNDEGDPKALQQAAEEGRAHQAPSAAQPGPAPPAPAQLVQKA HELMWYVLVKDQKKMIIWFPDMVKDVIGSYKKWCRSILRRTSLILARVFGLHLRLTSL HTMEFALVKALEPEELDRVALSNRMPMTGLLLMILSLIYVKGRGARESAVWNVLRILG LRPWKKHSTFGDVRKLITEEFVQMNYLKYQRVPYVEPPEYEFFWGSRASREITKMQIM EFLARVFKKDPQAWPSRYREALEEARALREANPTAHYPRSSVSED" polyA_signal 3254..3259 BASE COUNT 856 a 842 c 807 g 795 t ORIGIN 1 aagcttaaga gtcctgttgg agggactggt gtggtaatgg ctctgcaaaa gtgttatgtg 61 cgtgcaaacc caaagagaga aagcacagaa aacctttcaa catcaacctg cttgaggaaa 121 aataaagtgg gaaaagatac atactcacag tgaggactct agacatgtca agacaatttt 181 taaatatgct tttggcttcg agtggcaata actagattca agacagcata tttaagaagc 241 tgctgatgag aagaaacccg ggaagagctg aaggaccaca tcagcccaga ccaaggatgc 301 tgaagcagca ttaaggtccc tggtttcaga tgctcaggca atgacccttt ttttcatgga 361 gagcctgtag gagtgacagt tttgtctttg cccactggga atctgttttc catacctgga 421 aaacagggtt acctatgttt cccctgctac cctttggtca tctcagagac actaccagat 481 attacccatg ggacctattt tttttttaaa tctcaggaaa gacttgggtg tggcttccaa 541 cgtggaggac tcagtagctt cagagagggt cctgagagaa ggtgaattga agaatgaggg 601 tgctgggcag agggaaaaga cattatcata caagtttgtg ctaaaagata tagcaatcct 661 tctgctatgg actaagtatg gaaaaaaata aaatggaatc aaagttaccc aaaggaagtg 721 taaaacccaa atttatgccc gttaaagcat taatgatgct ctaagtccac tgcctactta 781 aaaagttcat agttcacatg ggttgatagg aaattacgtt aacgacacac tgcatttccc 841 cttttcttat agcctatctg atttggtagg gagtcgatca ttttttattg gaatttctca 901 ggattccaac ctcagacatc cactttacag tttacacatt ttcttggaca agcccgactg 961 ttcctctcac tggttcgcat aaagctcatg tttacaaagc cgcccagacc tttctctggg 1021 actctcatat ttaaattaat tctggatata cccaggtaag cgtttcccaa gaaacttgac 1081 cccaacatcc caaaaactta aggtatcttt cccttaaact ggccccttct ccagtacgca 1141 tccatctcac ttctctcctg ccctacatct tctcagccca aacaggaaac cccgggatcg 1201 ctctcccagc aggtgaagcc tcgccatgga ccctccccgt cgggccccgc gctgccccgc 1261 ccgcccccag ccgctggcca aggccgcggt cgcgcaggcg cagtgccgcg tcccgccgcc 1321 gccccgccct gcccgtcgct gcggaaggcg ctgcgccagc aacgcgcact tcctctccag 1381 gaatccgcgg agggagcgca ggctcgaaga gctcctggac gcagaggccc tgcccttgcc 1441 agacggcgca gacatgtcag aacaaagtaa ggatctgagc gaccctaact ttgcagccga 1501 ggcccccaac tccgaggtgc acagcagccc tggggtttcg gagggggttc ctccgtccgc 1561 gaccctggca gagccgcaga gccctcctct aggcccgacg gccgctccgc aggccgcgcc 1621 gcctccccag gccccgaacg acgagggcga cccgaaggcc ctgcagcagg ctgcggagga 1681 gggccgcgcc caccaggccc cgagcgcggc ccagccgggc ccggcaccgc cagccccggc 1741 gcagctggtg cagaaggcgc acgagctcat gtggtacgtg ctggtcaagg accagaagaa 1801 gatgatcatc tggtttccag acatggtgaa agatgtcatc ggcagctaca agaagtggtg 1861 caggagcatc ctccggcgca ccagcctcat cctcgcccgg gtgttcgggc tgcacctgag 1921 gctaaccagc ctgcacacca tggagtttgc gctggtcaaa gcgctggagc ccgaggagct 1981 ggacagggtg gcgctgagca accgcatgcc catgacaggc ctcctgctca tgatcctgag 2041 cctcatctac gtgaagggcc gcggcgccag agagagcgcc gtctggaacg tgctgcgcat 2101 cctggggctg cggccctgga agaagcactc caccttcggg gacgtgcgga agctcatcac 2161 tgaggagttc gtccaaatga attacctgaa gtaccagcgc gtcccatacg tggagccgcc 2221 cgaatacgag ttcttttggg gctcccgggc cagccgcgaa atcaccaaga tgcaaatcat 2281 ggagttcctg gccagggtct ttaagaaaga cccccaggcc tggccctccc gatacagaga 2341 agctctggag gaggccagag ctctgcggga ggctaatccc actgcccact accctcgcag 2401 cagtgtctct gaggactagc aaagtctgga ggcagatgaa tggtttctga ccctcaccag 2461 ggctgtggaa gggtgggggt gggtcattat agtattcagg atttacagtg cagtattcac 2521 gtgtaacttt taagttttca gtacagtgct tttatacctt taatgcaatg ttgtattcat 2581 ttgggtacta ttgtgtagta tttaggatgt atgcatgttt gtttatatgt aagcttggtt 2641 ggtgctttcg cttttgtgct acctttcttg gatttttgta ccagagatgt gctaaactga 2701 tgaaatacat tgagaaagtt tccatcttat tcttttatat gggactgatg atgtgtgttg 2761 gggtagactg ctcctgcaga gtttggaaga agtcaccagc aaagccggcc taaccaagaa 2821 aagtcaaggc ccttcatgac cttgctgggc acagaaaaca ccctcgtgga gtacactaat 2881 ttgaactgga ctggtctcag tgtgagcact tggcacactt tactaaacac atatacaacc 2941 ccaccgtgag tcaactttaa agtaaacatt aaagattctt gtgatacaat catttttgga 3001 aaagtgtact ttatcatttt aacaaagcag tatggttggg aatgagacaa ttctctattt 3061 tacagtgtat acagatacaa ctatttcccc taatagggtg ggaaaaatcg ctactcatga 3121 ttactcctaa atttgtgaag tttatagttc tattgtcttt aaatgtaact catgtttatt 3181 tcaaaaacat tcacaaatat agaaaagtat acaaaacaaa acagtaagat tgtctgtaat 3241 cacatcatat gggaataaaa aacaaaaata atttccttcc cttaagtttc tacattttat // LOCUS AC003002 81786 bp DNA PRI 08-OCT-1997 DEFINITION Human DNA from overlapping chromosome 19-specific cosmids R29515 and R28253, genomic sequence, complete sequence. ACCESSION AC003002 NID g2494139 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 81786) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Garcia,E., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 500 kb ZNF gene family- containing human contig in 19q13.4 JOURNAL Unpublished REFERENCE 2 (bases 1 to 81786) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (08-OCT-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA COMMENT Map and sequence oriented from centromere to telomere. Clones overlap cosmid R30217 to the right. FEATURES Location/Qualifiers source 1..81786 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R29515 from bases 1- 42,876 and R28253 from bases 41,773- 81786" /chromosome="19" /map="19q13.4 from D19S303 to ZNF134" /cell_line="5HL2-B" /cell_type="fibroblast" /clone_lib="LL19NC03 R chromosome 19-specific cosmid library" /note="cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid UV5HL9-5B, which carries chromosome 19 as its only human chromosome." misc_feature 754..1071 /note="BLASTN similarity to Z62624 CpG clone HS70G1R (1..318); match: 0.99, score: 8.9e-125; 99% identity; database searched: nr" misc_feature complement(1608..1798) /note="BLASTN similarity to Z62625 CpG clone HS70G2F; P = 1.2e-70; Identities = 191/199 (95%)." misc_feature 2392..2472 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 88.000" misc_feature 3462..3588 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 90.000~~BLASTX similarity to (5..48); match: 0.61, score: 6.5e-08; database searched: nr; KRAB-domain- containing zinc finger protein ZNF45 - human (fragment) >gi|186633 (M67509) ORF [Homo sapiens]." repeat_region complement(3735..4029) /rpt_family="Alu" misc_feature 5882..7687 /note="DPS similarity to P52740|Z132_HUMAN ZINC FINGER PROTEIN 132; gi|488551 (U09411) zinc finger protein ZNF132 [Homo sapiens] (1..589); Score: 1751 Identity: 323/589 (54%).~" misc_feature complement(9636..9873) /note="BLASTX similarity to (152..235); match: 0.53, score: 4.2e-07; database searched: nr; probable pol polyprotein-related protein 4 - rat gi|56590 (X53581) ORF4 gene product [Rattus norvegicus]" repeat_region complement(9898..10089) /rpt_family="Alu" misc_feature complement(10194..10261) /note="BLASTX similarity to (1101..1152); match: 0.38, score: 2.8e-07; database searched: nr; line-1 protein ORF2 - human" repeat_region complement(10317..10572) /rpt_family="Alu" repeat_region complement(10893..11193) /rpt_family="Alu" repeat_region 11670..11966 /rpt_family="Alu" misc_feature 13244..13439 /note="DDS similarity to N24366 yx14c04.r1 Homo sapiens cDNA clone 261702 5' similar~ to SP:YB9B_YEAST P38334 HYPOTHETICAL 19.7 KD PROTEIN IN SRB6-RIB5 INTERGENIC ; (1..195). Identity: (97%).~~Other overlapping matches:~T89537 ye04c07.r1 Homo sapiens cDNA clone 116748 5' (1..167); 100% identity." repeat_region 14197..14483 /rpt_family="Alu" misc_feature 14551..14946 /note="DDS similarity to N24366 yx14c04.r1 Homo sapiens cDNA clone 261702 5' similar to SP:YB9B_YEAST P38334 HYPOTHETICAL 19.7 KD PROTEIN IN SRB6-RIB5 INTERGENIC; (196..485); 99.7% identity.~~Other overlapping matches:~T89537 ye04c07.r1 Homo sapiens cDNA clone 116748 5' (167..554).~~N40075 yx98e03.r1 Homo sapiens cDNA clone 269788 5' (1..365) ;.Score: 709 Identity: 361/365 (98%)~~H99119 yx14c04.s1 Homo sapiens cDNA clone 261702 3' (1..429).Score: 836 Identity: 426/429 (99%)~~AA195608 zr38a02.s1 Soares NhHMPu S1 Homo sapiens cDNA clone 665642 3' (1..395); Score: 713 Identity: 384/395 (97%).~~T89449 ye04c07.s1 Homo sapiens cDNA clone 116748 3' (1..426); Score: 681 Identity: 399/426 (93%)." CDS 14570..14992 /note="Hypothetical 16.6kDa protein most similar to hypothetical C. elegans ORF; DDS similarity to gi|1946954 (U97552) W05H7.3 gene product [Caenorhabditis elegans] (1..141).~Score: 389 Identity: 76/141 (53%)~~BLASTP similarity to sp|P38334|YB9B_YEAST HYPOTHETICAL 19.7 KD PROTEIN IN SRB6-RIB5 INTERGENIC REGION; Expect = 8e-21, Identities = 57/170 (33%)" /codon_start=1 /product="R29515_1" /db_xref="PID:g2494140" /translation="MSGSFYFVIVGHHDNPVFEMEFLPAGKAESKDDHRHLNQFIAHA ALDLVDENMWLSNNMYLKTVDKFNEWFVSAFVTAGHMRFIMLHDIRQEDGIKNFFTDV YDLYIKFSMNPFYEPNSPIRSSAFDRKVQFLGKKHLLS" misc_feature 14934..15045 /note="BLASTN similarity to D20534 (1..112); match: 0.98, score: 8.1e-38; database searched: est; Human HL60 3'directed MboI cDNA, HUMGS01509, clone pm2818." repeat_region complement(15246..15556) /rpt_family="Alu" repeat_region complement(15599..15887) /rpt_family="Alu" repeat_region complement(15955..16446) /rpt_family="Alu" repeat_region complement(19551..19635) /rpt_family="THE1" repeat_region complement(19642..19918) /rpt_family="Alu" repeat_region complement(20687..20976) /rpt_family="Alu" misc_feature 21491..21644 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 84.000~~BLASTX similarity to (8..59); match: 0.53, score: 2.7e-07; database searched: nr; KRAB-domain- containing zinc finger protein ZNF45 - human (fragment) >gi|186633 (M67509) ORF [Homo sapiens]" repeat_region complement(22785..23157) /rpt_family="THE1" repeat_region complement(23212..24758) /rpt_family="MSTAR" repeat_region 24816..25415 /rpt_family="Alu" repeat_region complement(25421..25610) /rpt_family="THE1" misc_feature 25502..25568 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 79.000" repeat_region 25617..25902 /rpt_family="Alu" repeat_region complement(25875..26074) /rpt_family="THE1" repeat_region complement(26080..26738) /rpt_family="PAB" misc_feature 27148..28419 /note="BLASTX similarity to P51522 (45..395); match: 0.52, score: 6.7e-136; database searched: nr; ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1) >pir||A32891 finger protein 1, placental - human~~predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 62.000" repeat_region complement(29075..29136) /rpt_family="MER5" repeat_region complement(29736..29819) /rpt_family="MER1" repeat_region complement(29838..30055) /rpt_family="MER1" misc_feature complement(30554..30678) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 61.000" repeat_region complement(31101..31395) /rpt_family="Alu" repeat_region complement(31405..31691) /rpt_family="Alu" repeat_region complement(31894..32164) /rpt_family="Alu" repeat_region complement(32277..32579) /rpt_family="Alu" misc_feature complement(32792..32928) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 87.000" misc_feature complement(33256..33322) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 73.000" repeat_region 33683..34118 /rpt_family="Alu" repeat_region 34378..34688 /rpt_family="MER1" repeat_region 35245..35532 /rpt_family="Alu" repeat_region 35731..36020 /rpt_family="Alu" misc_feature 39657..39736 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 48.000" misc_feature 40539..40672 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 41.000" repeat_region complement(40898..42495) /rpt_family="L1" repeat_region complement(41530..41830) /rpt_family="Alu" repeat_region 43115..43374 /rpt_family="Alu" misc_feature 43924..43959 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 76.000" repeat_region complement(45112..45494) /rpt_family="THE1" misc_feature 46757..46910 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000~~BLASTX similarity to (8..59); match: 0.53, score: 3.6e-11; database searched: nr; KRAB-domain- containing zinc finger protein ZNF45 - human (fragment) >gi|186633 (M67509) ORF [Homo sapiens]" misc_feature 48579..49625 /note="BLASTX similarity to P51522 (84..395); match: 0.53, score: 4.6e-149; database searched: nr; ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1) >pir||A32891 finger protein 1, placental - human~~predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 87.000" misc_feature 48808..48893 /note="DDS similarity to AA205091 zq71g04.r1 Stratagene neuroepithelium (#937231) Homo sapiens cDNA clone 647094 5' similar to SW:ZN17_HUMAN P17021 ZINC FINGER PROTEIN 17 ;(1..96); Score: 164 Identity: 89/96 (92%)." repeat_region complement(50165..50484) /rpt_family="Alu" repeat_region complement(50701..50893) /rpt_family="L1" misc_feature 50995..51067 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 57.000" repeat_region 51045..51310 /rpt_family="MER2" repeat_region complement(51475..51754) /rpt_family="Alu" repeat_region complement(51791..52008) /rpt_family="L1" misc_feature 51907..52250 /note="DDS similarity to T94161 ye28g12.r1 Homo sapiens cDNA clone 119110 5' (1..346). Score: 666 Identity: 342/346 (98%)." misc_feature 52492..52955 /note="DDS similarity to AA282963 zt15h09.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone 713249 3' (1..461).~Score: 901 Identity: 461/461 (100%)" misc_feature 52810..53007 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: marginal, score: 47.000" repeat_region complement(54460..54750) /rpt_family="Alu" repeat_region 55286..55576 /rpt_family="Alu" repeat_region complement(55589..55867) /rpt_family="Alu" misc_feature complement(56201..56489) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 60.000" repeat_region complement(57156..57441) /rpt_family="Alu" misc_feature complement(58267..58345) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 62.000" repeat_region 58377..58639 /rpt_family="Alu" repeat_region 58742..59029 /rpt_family="Alu" misc_feature 60754..61085 /note="BLASTN similarity to Z64322 CpG clone HS9F12R (1..331); match: 0.99, score: 6.8e-125;Identities = 327/332 (98%).~" misc_feature 60915..61142 /note="DDS similarity to AA333113 EST37150 Embryo, 8 week I Homo sapiens cDNA 5' end (1..229); 97% identity.~" misc_feature 61073..61216 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 54.000" repeat_region complement(62185..62479) /rpt_family="Alu" repeat_region complement(62615..62911) /rpt_family="Alu" misc_feature 63332..63372 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 78.000" misc_feature 63332..63372 /note="DDS similarity to AA333113 EST37150 Embryo, 8 week I Homo sapiens cDNA 5' end (230..270); 100% identity." repeat_region complement(63779..64061) /rpt_family="Alu" repeat_region complement(64073..64361) /rpt_family="Alu" repeat_region complement(64450..64734) /rpt_family="Alu" misc_feature 65177..65242 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 68.000" repeat_region complement(66240..66543) /rpt_family="Alu" repeat_region 67303..67598 /rpt_family="Alu" misc_feature 67648..67774 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000~~BLASTX similarity to (3..42); match: 0.55, score: 1.6e-07; database searched: nr; KRAB-domain- containing zinc finger protein D19S19 - human (fragment)." misc_feature 67648..67693 /note="DDS similarity to |AA333113 EST37150 Embryo, 8 week I Homo sapiens cDNA 5' end (271..317); 94% identity.~" repeat_region complement(68620..68905) /rpt_family="Alu" misc_feature 69783..71276 /note="DPS similarity to Accession: gi|1769491 (U66561) kruppel-related zinc finger protein [Homo sapiens] (221..718). Score: 1555 Identity: 269/519 (51%)~~predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 89.000." misc_feature 71011..71484 /note="DDS similarity to AA418246 zv96b07.s1 Soares NhHMPu S1 Homo sapiens cDNA clone 767605 3' 91..474).~Score: 936 Identity: 471/474 (99%)" misc_feature complement(71056..71543) /note="DDS similarity to AA418360 zv96f07.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 767653 5' similar to gb:X52354 ZINC FINGER PROTEIN KOX23 (HUMAN);~Score: 938 Identity: 481/486 (98%)." repeat_region 71654..72997 /rpt_family="MER7" repeat_region complement(72074..72359) /rpt_family="Alu" repeat_region complement(73231..73488) /rpt_family="Alu" repeat_region 74156..74440 /rpt_family="Alu" repeat_region 74806..75099 /rpt_family="Alu" repeat_region complement(75157..75468) /rpt_family="Alu" repeat_region complement(75655..75955) /rpt_family="Alu" misc_feature complement(77624..77929) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 52.000" misc_feature 77744..78141 /note="DDS similarity to D81878|HUM417H05B Human fetal brain cDNA 5'-end GEN-417H05; (1..396). Score: 762 Identity: 392/396 (98%)~~Other overlapping matches:~(77796..77947) AA454141 zx45h09.r1 Soares testis NHT Homo sapiens cDNA clone 795233 5' (1..152); Score: 304 Identity: 152/152 (100%).~" repeat_region complement(78065..78125) /rpt_family="THE1" repeat_region complement(78160..78332) /rpt_family="Alu" repeat_region complement(78762..79050) /rpt_family="Alu" repeat_region complement(79247..79541) /rpt_family="Alu" repeat_region 80505..80792 /rpt_family="Alu" misc_feature complement(81102..81399) /note="DDS similarity to AA229025 nc50c11.s1 NCI_CGAP_Pr3 Homo sapiens cDNA clone 5709 (1..300); Score: 550 Identity: 290/300 (96%)." repeat_region complement(81556..81783) /rpt_family="Alu" BASE COUNT 21056 a 17431 c 19441 g 23858 t ORIGIN 1 gatctttgat ttatcttcct aaagcagtgt atgtgagaac cttgtgttga ggaagactga 61 taacagcttg tatgtattta gcacgatgtg ccaggcctgg tgttaagtac tttttacctc 121 acttcccata ttcctaatga ggttggttgt agtgtttttc atcttactga tttttttttg 181 tttgtttgtt tttgttttgt tttgtttttg gctaatgact ttaaaagacc tccagttctt 241 gaacttcagc tagggtccca tccccagagt ttatcagaag gtataggttg ggtccaagaa 301 ttcacaggtt ttttaagttc atagtgaagc tgatgctgta aaactacata tcaaattggg 361 aagcaatgaa ttagaagagg ctcatgtttt cgtcatgtta catgtaagtt cctccttcac 421 ttttcgccat gaataaaagt tccctgtggc ctcaccagaa gcagatacgg gcaccatgct 481 tcctgtacag ccagcagaaa ggtagctaat tacatctctt ttctttatat gttaggaaaa 541 ttagaaggac attcaatgtt ttacaaaaag gcccacaaac gaaatacaga caacgttcaa 601 tttattaata cgtttcaatc cccaaatctg agagaggtga agcaggtttc ccagggcctc 661 agaagcagga ttgaagaatg ggtcccttgc tggcgacttc agtggcgtgg cattgtccca 721 ggtggccatg gaaatcaggg tgcataccct ttaaaagcag ctgtggcctc cgagatatcg 781 ttttcccagt cttttctccc cctccctgca cgacgcctct tggcagacat ccgggagaac 841 ccgaaaggcg ctcgttgcct ggttggatgc agggtacaaa ctacgttacc cagaaatctg 901 tgcccagctc attgttagcc ggcgtgcagc caatgacagc ccagaaactg ggcgtttcct 961 gctgctctgg gctgcagggg cgagacttct ggcgtcgccg tcgtgacgta tttttcctat 1021 gcccggtccg tgcattctgg ttgtgaaggc tgagttctag agatcgggtc ggctttctac 1081 gcggctctcg tggaacctag caaagaaaga cagtgaagac tgcaggacct tccttcgcgc 1141 ttttgttaca atccatgacc cctgtcgtgg gacgggcggc ctctcgcgga ggtgtctgcc 1201 ggggctgggc tcttaccgag gcctccacac acgtcctctt gtccttgtct cccccagaag 1261 cagccgcctt agtcttgtga gcgtttttac accgggtaga ttgagacttg gagtgctaca 1321 ctcagcccga gggcgtccag cgcggtggag gcgtggggtt tcggctgagc ccacagggca 1381 cagactgttc atccgcttct catggcagcg gcggtgctga tggaccgggt tcaggtgagt 1441 gggggcatcc ctcaagcgca ccccggcctg gttggtgtgt cctgggatgt tcgctctcat 1501 cgcgcacttc agggtgctca ctccgtcgca ttatgtggag gggttccgct cccctcatca 1561 gtggcataag gtgtggaacg gactcgaagg cgcaggggca gttcgtacaa atgcatgcat 1621 ggagaggaga atgagtttgg ggagttccgg gaactctctg tgcctggtga gaacagggac 1681 tagggggtca gaggtaggcc tggaatggcc gcccgaggag ctggtcctct cttctgaggg 1741 tgctgggagt catgcaaggt ttggtacaga gaagggaggt cctctgacgc agttcttaaa 1801 aggctccctc tggctgcaga gtgggtagac tagggattat agggtagacg gggagatcaa 1861 gatgaagtct actgaaatag tccaggtgag cgttgatggt ctgggccaga gtggtggcag 1921 aagaggttag agaatgtatg ggttttgggt tttgtatgta ttttataagt aaggctgacc 1981 agattctctg atgagacata gtgatgatat cctgggattc ttcacatgac tgtcttcact 2041 tgagacaggg ggcagtgaat ggaggtcatt tctagtttag tgtcctgaat ccaggccatg 2101 agttgtatgg agttatgtgg agaagtcccc acctggcgac caatgggatg agtactgcaa 2161 aagtatatca gacctaggaa caggtacgct caaatgcttg taggattgag tttggagcgc 2221 tcagggaatc tcaggtctta attacctttg ttggaaacag gtatcagagg aaggagttag 2281 gcaggaatgg ctggccgagg tgtctcgagt gtggtaaaag tcgtgggaga tttggagcag 2341 aggagggaga tgatctgact cagattttaa catgctctct ctggctttca ggttgagaac 2401 agactgtgag catgagggag cagctactgc agtagtgttg atggtgtctg gtctagaggt 2461 gtggctgtgg aggtgagaga agttggtaaa ttttctgttt tttaaagtag ggtgattgga 2521 ttttttgatg cgaaaggtga aagagaggag agtcgtggat gaccctaagg ttggtttgtt 2581 tttgctgtag ctggtaaaat tatgaaaatg ccatcaactg agatagagaa gttggcagca 2641 ggtatgattt ttagtggtta tatcaggagt taggctttgg tcatgtttag gttaagaggt 2701 ctcagttgtc atcttagttg aggcctcagg aaaacacaca ggtttttagt tctggggaag 2761 ggttctgttg aagaagttga aaaaaatggt ggccagtgtg tggagtggga tggggtatct 2821 tcaaatcatg gtggtaatgc tgttattcag taagaaagtt gaaactggac actgagagaa 2881 aactgggtct gttgcttagg cacctggacg gcagtggtgg atagtgtgaa gacgcgagcg 2941 cttcctggac actgcttgca gagtggcttg ggggatgagg gatgtcatgg aggtgcatat 3001 ggcatgggcc agtggcggtg gggatctcag atgtgaggaa ggtgtagtcc aaagagtaat 3061 gagcacgttg ggaggagtgt gaactgatgc ccttaagact ctagtattgg cagctcatgg 3121 gagaagatgg agctgaagtt tggttgtgtt tagaggcatc gtctggacaa caccaggaga 3181 actggttggt aggcgagtat ttgggaccct tcatgccaga cccagagttg agacagcatc 3241 tgtctgcaca cttgcctggt tctgctctgg gctggacaca gtggtgagag ggacatgagg 3301 agagagcaga cactagtggt gccaaggtct gttatctcct cttagttggg aggctgggga 3361 ggcgagggac tagagagtgg gtgtgtgagt gtgtagactg gggaggaggg ggttctgggg 3421 agagatgctg actgtggact cagctgtact catcatggca gagttgtgtg accttcgagg 3481 atgtgttcgt gtacttctct cgggaggagt gggaacttct tgaggaggca cagagattcc 3541 tgtaccgtga tgtgatgctg gagaactttg cacttgtggc tacactaggt aagtctgtgt 3601 tttagttatc taatgctaca tggcaaatta tcccaaagct tagtaactgc aaacaataaa 3661 aatcaattat ctctccagtt tcagtgcatt ggtaatctgg ttagagcatt ttttgttgtt 3721 tattattttg agggtttttt tttgtttttt gttttttgtt ttttcagaga gtcttactct 3781 gtcacccagg ctacagtgca gtgttgtgat catggttcac tgcagcctca aattcttggg 3841 ctcaagtgat cccccccacc tcaccctcct gagtagctgg gactataggt gcatgccacc 3901 acgcccggct attttacttt ttgtagagat aaggtcttgc tacgttgccc aggctgatct 3961 agagctcctg acctcaagcg atccttcctc cttggcctcc caaagtgctg ggattacagg 4021 cttgagccag tgtgcccagc ctgatcacac cttcttatgc aggtgctttt ggctctcatg 4081 aagttttaat caagctctca gccagtgttg tggtcttacc tgagaagaat ccacttacaa 4141 gctcactcac attgcttttt gcaagattta gtttcttgtg gcttgttgga ctgtgagcac 4201 tgttccttgc tggctgttgg ccaaagtcct ccctcaggtg gcctgtctgt attgtacctt 4261 atgacacaga agctaacttc tctttgagca agtgaccaaa agaatgttca agatggaagc 4321 ccaggattgt aacctaatct tgaaagtgtc atttcattat ttttgccata ttctattcct 4381 tggaaataag taagtccatc cagcttacaa tcaagtgagg aggcttacac aggggcaaga 4441 gtaccaagag gtagggatca ctgggaccat ctcagaggct gtgttccaca ggcctttaca 4501 cccactccgg tgtccttggt ggggtttgag tcttcttctc ttccccatgg gccactcttt 4561 ctgtccatcc agatccatga ctcttctcct tcccttcttt gtttccaaga gtaggtgctg 4621 tgggttctag ggctggctgt gttcagtgtt tctgccttca ctgggtaatc tgaacatctg 4681 ttgccctaga gctttgcagg aaaaggttag gattcgggag tttttcagtc tgctaagcag 4741 attctacaca gctcaggtgc ccactgcctg agtccgtgtc atgatctttg cacctctttg 4801 tcctagattc tgccctcatc ctctggctga catttcctgg ggcctgactg tgccaggaat 4861 ttcaggggct gacacagtca cttcttgtgg tgattacagg gctgtcctgg tacatccgcc 4921 ttggagagta ttttttctaa acattctatt ttctctgtgg agtgggtcta cctgcgccac 4981 tcacctgccc tttgtttgcg ttacagcttt cattttccca gtcccatgca gttgcgcagt 5041 tggagggggc agaaaacctt gggtgcctga cagggcagac atcactgcag ccacagtaaa 5101 ggagacctac agagggcatg gccctgctga atgggagctg gaagagggta tctgttatgg 5161 ttgggttcaa atcgaacctt caacttgttt tgtgtttgtc agtgttttgt tgccaaggcc 5221 catagtgatt cttcacatct gctttacttt cctcattctt ggcacttttt ccatttgtca 5281 tttctcacgt gtcttccatc ctttgggtgt tttcccactt ttctggcctc cgtgtttcaa 5341 ttgctctctt caacactagc ttactttaat gtgtcatgag ttctgcacat taaatagtcc 5401 gtgagaatga gctacttatt tggagtcaga ttttcgtggg catggacata cccatctgca 5461 cagagctcac ctgacaccag cggccaaggc cctgctgaaa ctctcttgca gaggaatcag 5521 gtagaggctt cacctatact ccctatactc tatgccattg tttttaggtc tttacctcat 5581 agtcagggtt cctccattct gtgaggccct atactgacaa ccagcccttc tttacactga 5641 tcctggccct cttgtcctgc caacaagcca gtggactcgc actggtgtta gacacacatt 5701 tgtgatgggg ctgctgcctc ccaccaaagt cagcatgcac ttcaccagct cttttttgct 5761 ttcaggtttt tggtgtgaag cagaacatga ggcaccttct gagcagagcg tttctgtaga 5821 aggagtgtca caggtcagga ctgctgagtc aggtcttttc cagaaagcac acccatgtga 5881 gatgtgtgac ccactcttga aagacatttt gcacctggct gaacaccagg gatcacacct 5941 tacacagaaa ctgtgcacac gtgggccgtg taggagaaga ttctcgttca gtgcaaactt 6001 ttaccagcac cagaagcaac ataatggaga gaattgcttc agaggggatg atggaggggc 6061 ctcatttgtg aagagctgta cagtccacat gttagggaga tcctttacgt gcagggagga 6121 agggatggac ttaccagata gctctggcct tttccagcac cagaccactt acaatagggt 6181 gagtccatgc agaaggactg aatgcatgga gtctttccca cacagctcca gtctcaggca 6241 acaccaagga gactatgatg gacagatgct tttcagttgc ggtgatgaag ggaaagcctt 6301 cctggacacc tttactcttc ttgacagcca gatgactcat gctgaggtga gacccttcag 6361 atgcctacca tgtggaaatg tgttcaagga gaaatcagct cttattaatc acagaaaaat 6421 ccacagtgga gaaatatctc atgtgtgtaa ggagtgtgga aaagccttca ttcacttgca 6481 ccacctaaaa atgcaccaga aatttcacac tggaaaaaga cactatacat gcagtgaatg 6541 tgggaaggcc ttcagccgca aggacacact tgttcagcat cagagagttc acactggaga 6601 aagatcttat gactgcagtg aatgtggaaa agcctacagc agaagctccc accttgttca 6661 gcaccagaga attcacacag gagaaaggcc ttataagtgc aacgaatgtg ggaaagcctt 6721 tagccgtaaa gacacacttg ttcagcacca gagatttcat actggagaaa ggccttatga 6781 gtgcagtgaa tgtggaaaat tctttagcca aagctcccac cttattgagc actggagaat 6841 tcataccggg gcaaggccct atgaatgcat agaatgtgga aaattcttta gccataactc 6901 tagcctcatt aaacatcgga gagtccacac aggagcaaga tcctacgtgt gcagcaaatg 6961 tgggaaggcc tttggctgca aagacacact tgttcagcac cagataattc acactggagc 7021 aaggccttat gagtgcagtg aatgtgggaa ggccttcagc cgtaaagaca cacttgtgca 7081 acaccaaaaa atccacactg gagaaaggcc ttatgagtgt ggtgaatgtg gtaaattctt 7141 cagccatagc tccaacctta ttgtacacca gagaattcac actggagcaa agccttatga 7201 gtgcaatgaa tgtgggaaat gctttagcca caactccagc ctcattttgc accagagagt 7261 tcacacagga gcaaggcctt atgtgtgcag tgaatgtggg aaggcttaca ttagtagctc 7321 ccaccttgtt caacacaaga aagttcacac tggagcaaga ccttatgagt gcagtgaatg 7381 tgggaaattc tttagccgca actctggcct cattctgcac cagagggttc acactggaga 7441 aaagccttac gtatgcagcg aatgtgggaa agcctatagc agaagctccc atcttgttcg 7501 tcaccagaaa gctcacactg gagaaagagc tcacgagtgc aacagttttg gtggcccttt 7561 agctgcatct cttaaacttg tttaacacca gaaaattcac acaagagaaa ggccttatga 7621 atgcagaaaa tatgtcatct tgttcatcct cataggactc acaccagagc aatgctctgt 7681 gagtaccctt tgtgagggaa ccatcagcta gcagatgagc accgtatatt cattccaccc 7741 tggggagatt cctgataagc accacatatg tgggaggctt tcatgaggtg tgttgcactt 7801 tgtaactgtc tagagctctt gatggaatta tatcactgcc agtgcctgtg gcggaagcca 7861 tcttattgct accagctgtg tgtgtcaatc actccatttt gctcagggaa ggcagacttc 7921 tgtgctttct ttcctgttcc ctacaggtaa tcatgaatat tttcaaggac ttcccccccc 7981 ccccacttca ccccctacca ttgagggtcc tcatcttttc cctcatgatt aggttctgag 8041 caaacatgat ctagctctca ccaaaaggac ctgagctagg gtctgctggg atttcctgac 8101 acgattttcc atcttcatgg acaatgttaa ctgtaaacgt gatagctgtg acttacttgt 8161 cttactgcca aatcgcccaa atttggaact gcttgtccca tgctgctctg atttatacag 8221 tgataagggc ctattgtggc agtctttact cttggggatt ttgttactgt gtagagtgga 8281 ttgagaaaag aattgggttc tgtcatgaag ggagtagcac cttttgtggg caccaccttt 8341 atgtgcctca gaggggacca aaggatggca gaaaactgtt ctcagtctag tttgacctaa 8401 tttacactat tgccctaggc ttgctaggaa agactgaaaa aatttttgtg cctaactttg 8461 tggcctggct gccaggattt cctgtgagcc cagtgaggag ggcatagttg gtgcgaaaat 8521 ctcctgtgct tgtggaagca acataagttg ggtcccttaa ctgtcacgta ctccccagca 8581 ctgttgaggg attgctgtct tctgtacctg tacaggatca agtccctgat atccagaacc 8641 ccataggcaa gtaacattga ctgtaagacc tctgtagggc aatgtgaaaa tcatgattgc 8701 tacagaagca ctgagataat ggagtgggga tgactgtggc aaacaaaagg agattcgaac 8761 attctagagg ggcatccaca agtctcggtg cagttatgat ggtgtgggat gggagtgcat 8821 agaaattctc aggaccttca gacatctgta tctcctgtgg ccaaggccac tgtcagagga 8881 aataatctaa atgtttgtgg attcctgtgt ctccctgggc agttgacctg cacagacagg 8941 aacctcagca gtgacagaag agagatccag ggatgggtct tctctgaccc agccagcatt 9001 ccgagtgact gaggtggatg tggacatcat tgcttaccaa tttttaccaa cattgaggca 9061 gggctcactc tcctaaattg tagggagagg aatatggtaa agccaaaata gttgacagat 9121 ctaactttcc tagtggaaca gtatatatag attttaatga ggaacattta tttaatagct 9181 ttatggaggt atgattaaca tgcaataaac tgcaaatatg taaagtgtaa aatgtgctgt 9241 ttcagcatgt gtgtacacct atgaaaccac cacagtcaag atatccaaca caacaaaaga 9301 ttgtcccttt ataatcctca atttttcctt atcttgtttt ccacaattca caagcaacag 9361 caagcatttt ctataatttt ataaatgaaa ccatatggtg tgtacttttt agggggtggg 9421 gctctggctt tttcagtaaa cataactatt ttaaggttaa tccattatct tgttgcatga 9481 atcaattgtt tgctcatttg tattgctgag tagtatttca gtatttcatt gtataccaca 9541 atttttcttc tttgtcaact taattgttat ggatgtttgg gtaactttca gtttttggct 9601 attacaaata aacctctgaa gatttgtgtg caaattatgg ctgaatgata ttccattcta 9661 taaatacacc tcagtttctt tatccattta cttattgaag gacggttgct ttcaagtttt 9721 ggcagttatg aataaaccta ctacatattt gtatagtttt tctgtggaga tgttttcaac 9781 tcatttggat atgtgccaag gagtgcaatt ttctctgtca tatagcatgt ttagtcttgt 9841 aagaaactgc caagctgtct tccaaagcgg ctgcaccatt tattttattt atatatattt 9901 tttgagacag agtctccact ctgtcaccca ggctggagtg cagtggtatg atcttggttc 9961 actgcaacct ccacctattt tagtagacat gggtttttac catgttggcc aggctggtct 10021 caaactcctg acctcaagtg atcaccctcc tcggcctccc aaagtgctgg gataacgggg 10081 gtgagccact gcaacgggct gtggctgtac catttttgca tccaactaac aatgattgaa 10141 cattcctctt gcatctcatc ctaatcagca tttggcattt tcagtatttt ggattttaaa 10201 atgccattca aataggtgtg tagtggtatg ccattgttgc ttttatttgc aattccctaa 10261 taacacacaa taagtatctt ttttaaaggc agggttgttt tgttttgttt tgaaatggag 10321 tcttgctctg ttgcccaggc tggagtacag tggcatgatt tcggctcact gcaacctcca 10381 cctcccgggt tcaagcgatt ctcccacctt agccttccaa gcagctggga ctacaggcac 10441 ccgccaccat gcccagctaa tttttgtatt tttagcagaa atggggtttc gccatgttgg 10501 ccaggctggt ctcgaactcc tgacctcagg tgatctgtct gcctcggcct cccaaggtgc 10561 tgggattaca ggtgtgagtc actgcgccca gcctggctgg ggttttttta tgtgagtttc 10621 acacacctct gagtaagtgg cttgcagaca gagaagttaa caactttggc tgttagcttg 10681 aacctgtgat ttatgaatac caaaaagaca aatacagaat ctaaggaaac acataacagt 10741 cctgtataca cacagcagct ggcagcccta ttcccggtgt gtcccatata cctaccattg 10801 acctgaaacc cagtttgtac gtgctcattg aacattttgt agagcagtaa atacacatca 10861 tcacacagat tgcctactca tttttatttt tatttttcga gacggagtct tgcactgtca 10921 cccaggctgg agtgcagtgg cgcgatctct gctcactgca acttccacct cctgggttca 10981 agcgattctc ctgcctcagc ctcccaagta gcttggatta caggcacccg ccaccatgcc 11041 cagataattt ttgtattttt agtagagaca gggtttcact atgttggcca ggctggtctt 11101 gagctcctga cctcgtgatc cgcccacctc ggcctcccaa attgctggga ttgcaggcgt 11161 gagccaccgc acccggcctt catttttatt ttttgacaag tgtttccatt tgttatctct 11221 atccctccaa acagttaata agcgatgttt tcaagtatga aataagaaat acatatagga 11281 gacagaagca taatgtgtcc aggttccagc acagccagtc tcaacttcat tctttgactt 11341 ctcagactct tgctcaaact ctaccgtaac cttgcttttt ccccccttca catgtcacag 11401 agagatttct gtaggactat ttctgccact agttttgaaa taacataatc actgttagtc 11461 caaactgcac catcctgtaa gccccctgcc atctcacaga ccttggtcag agtgaagcat 11521 tccacggagt gagggccttg agaaacatcc tgcccaactg cctgactttc ttatcacatc 11581 gttctgggaa aagatccaag gaaggtcact atcacatcct gccggataaa aggccaaact 11641 gcctcaggaa catcttacgc acatcctttg gccgggtgct gtggctcacg cctgtaatcc 11701 cagcaatttg ggaggccgag gcgggcggat cacctgaggt caggaattcc agactggcct 11761 gaccaacatg gtgaaaccct gtccctacta aaaatacaaa aaaattagcc gggcgtggtg 11821 gcacgcgcct gtaaagccaa ctactcggga ggcggagttc aagaatcgct tgaactccgg 11881 gaggcggagg ttgcagtggg ccgagatcac gccattgcac tcaaggctgg gcgacaagag 11941 cgagactcct ccaaaaacaa aaaaaacccc accatcctcc tgggcagcaa gtcataccct 12001 cccgccgcgc cccctgcccc ccacccctga cccctctcat ccaggcctat aattgcccca 12061 gcctgtaagc agtgaggggt tctggcacta agctagttct ccccatcaca ggtctcgtgc 12121 tggacataaa acctgcattg ctgtagagct gccaactctg cctttcttta accctcgcct 12181 tcccttcaaa acctaacagt tattattaat agatttctgt aattttcatt tcacaagtat 12241 tttcacagat aatttcaact ccctgtaaaa tgataactca cttctcagga aggagataag 12301 atcatgaaag ttcacagttt ttattgggga taatacgttc cctgtgtaca tatactccaa 12361 ctatggaatg gcacaaaatg tcctgtcagg aaatggcatg tatcagtcac tctcattttc 12421 agacaggatt gagactttta aaataaatgt aaaatatttc agacatgaag taataagtat 12481 tattagaata aaagcattgc actcttggta ggaacacaat cactgactct acagacagta 12541 atgatgtttt cttctggatc cagaggcaca gcgtcaccct ttcaatctac gtcggtatct 12601 cctgtcttct tttgtgcaaa ttcccagcca tgtagtcttt ctttgttctc ttggaccaac 12661 catattgaag cttaataaat agaagtcaaa gtttaactgt atgtgtattt gaaaagcaac 12721 atatggctga gaggcatgag tagagacaga gtgcctttgt ggtatagtgt gtccctttgg 12781 cggtgtagcc gttgagaact tgctgtgctt cacaatccga tatcataggt ccctcacatt 12841 tctgtgagta tcctagacct tacttcaaga aaatctggtt atgtcgcctt taagagcaac 12901 ccaagccccg tcctgtccgt ggagtccggc actctgttta ccagctcccc tgacggtgcc 12961 actgagcctc tatccgccat ataaggactg accaacgctc taaggcaagt ccagaagcgg 13021 tgtcgaaact tcataaccca gaagacactg cggttctcgg cagcgcgact gacccaatga 13081 atgtgcagga aggaaccttt gcgtgcgtgc gtgcgtgcgt gcgtccgtcc tcgtgctcgc 13141 gcatcgtagg agggcgggac ttccggcgtc ctcttgccgt ggttgatttg attttctctg 13201 gtgttttcac tagttccggc ctttggcgct ctatgacgtc accgaagtga cggagcggaa 13261 aagcgcgaga agcggcttgg ttccttgtac gcagaggcgg tagtgacaca ggcacaactg 13321 acagtggcag aagctcagct gacaaggact ggggacggcg gtgtccttgt cttgcctttg 13381 tcgcccccgc ccctctcttc cctggctgga cttgcggagt ccccgccgaa gaacccgagg 13441 tgggtgcccc gtcccaggcc cccccccacc tccgcccgac ccctccttcc agtgctgaag 13501 cccatagaag gggccctgca ggtcaggccc cttgtctcga agagaggggc gttcctgtgt 13561 ggggtccccg tatcagctgg tttgatgcag cctcagagct cccttcgggg atctcaggtt 13621 cagagcatgg aggtgctgcc aagtgggccg cgtcggggag cccggagtgt gggttgtttc 13681 tgcgggaaag agagggtttt gtgaattttc tcttggaatg gcagcctgag cagccggtat 13741 aaccatggaa ggttgtcaag ggaagaccag tgggaggggc aggtccagag ttagaacgag 13801 caggatgggg aattgacaca gagagaccca gaagaggcca ctgcagtgga tcccagaggc 13861 tgcaccagag ccgtggtata agagggacaa tgtgatggga ttctggatag acctggaaga 13921 tagagccaac aggttttcct gccgtatcag atatgtctgt gagcaaaaat agggagtgaa 13981 gggtcatttt tttttttttg cccgaacatc tgaaagaatg aagttgggga agtcagcagc 14041 atgagaagta ggtttcatgg agaagatcat gggttggagt ttggccagtg tgattctcag 14101 gcgtctcaag ttgaaaatgt cacatgggta gggggacaga aaggtgtgga gtctacagag 14161 gagaactggt gtagattctt caaaagaata gagggtggcc gggcgtggtg gctcacgccg 14221 gtaatcccag cactttggga ggccaaggcg agcggatcac aaggtcaggc gttcaagacc 14281 agcctgacca acatggtgaa accccgtctt tagtaaaaat acaaaaatta gccgggggtg 14341 gtgacgcgcg cctgtaatcc cagctattcg ggaggctgag gcaggagaat tgcctgaacc 14401 caggaggtgg agattgcagt gagccaagat cgcgccattg cactccagcc tgggcgacag 14461 agcgagattc cgtgtcaaaa aaaggtgcgg agcgcgggtc tcttccgcgg aaactgacat 14521 tgcgtttccg ttgtcggcct cccgctgcag gagccatata ttgaagacca tgtctggaag 14581 cttctacttt gtaattgttg gtcaccatga taatccagtt tttgaaatgg agtttttgcc 14641 agctgggaag gcagaatcca aagacgacca tcgtcatctg aaccagttca tagctcatgc 14701 tgctctcgac ctcgtagatg agaacatgtg gctgtcgaac aacatgtact tgaaaactgt 14761 ggacaagttc aacgagtggt ttgtgtcagc atttgtcacc gcggggcata tgaggtttat 14821 tatgcttcat gacataagac aagaagatgg aataaagaac ttctttactg atgtttatga 14881 tttatatata aaattttcaa tgaatccatt ttatgaaccc aattctccta ttcgatcaag 14941 tgcatttgac agaaaagttc agtttcttgg gaagaaacac cttttaagct gaatggagaa 15001 aattccaaaa taaattatat caccacaatg gtgtatactc aggaatgtgt acattgtaaa 15061 ttacttgatt aaatagcctg gaaatctttt gtgtattctc agcttatcta aacttaatga 15121 aatttctttt atatttaaaa atagtacatt ctgtctcatg tcacatatca gtagatcaat 15181 tagtatttcc ttgtgaacaa tgttatttat aaagaactca ttatcaataa taattaattt 15241 ctttcttttt tttttttttt tgagacggag ttttgctctt gttgcccagg ctggagtgca 15301 gtggcacagt ctcgtctcac tgcaccctcc gcctcccagg ttcaagcaat tctgcctcag 15361 cctcccgagt agctgggatt acaggctccc accaccatgc ctgcctaatt ttttttgtat 15421 ttttagtaga gacggagttt cgccatgttg gtcaggctgg tcttgaactc ctgacctcgt 15481 gatctgcctg cctcggcctc ccaaagtgct gggattacag gcatgagcca ccactcccgg 15541 ccatgaaata tttttactta aaaattggga ataagctttc tttttctttc tttctttctt 15601 tttttttttt cttttgagat ggagtctggc tcctgttgtg caggggctgg agtgcagtgg 15661 cacgatcttg gctcactgca acctccacct cccgggttca agcaattctc cttcttcagc 15721 ctcccaagta gctaggatta caggcatgca ccaccacgcc tggctaattt ttgtattttt 15781 agtaaagacg ggatttcacc atattggtca ggctggtctt gaactcctga cctcgtgatc 15841 cgctcgcctc ggcctcccaa agtgctggga ttacaggtgt gagccactgc gcctggccat 15901 gaaatatttt ttacttaaaa attgggaata agcttttttg tgtgtgtgtg tatgtttttg 15961 tttttttgtt tttgagatgg agtcttgctc ctgtcatgca ggctggagtg cagtggcacg 16021 atcttggctc actgcagcct ctgcctcccg ggttcaagca gttctccttc cgcctccaga 16081 gtagctggga ttacaggcat acgcctggct aatttttgta tttttagtag agacagggtt 16141 tcgccatttt ggccaggctg gtcttgaact cctgacctca ggttatctgc ctgcctcggc 16201 ctcccggagt gctgggatta caggtgtgag ccactgcgcc tggctgggaa taagctttcg 16261 acttgcccaa ttcagataaa ttgttttttt tttttttttt tttgacgtgg agttttgctc 16321 tgtcgtccag gctggagtgc agtggcgctt ggctcactgc aacctctgcc tcctgacttc 16381 aagcgattct cctgcctcag cctcccaaag tgctgggatt acaggtgtga gccaccatac 16441 ccggcccaga taggttgatc ttataacaat ccagaaacaa atgtcatagt caagatttgg 16501 tagatagatt taaaactaaa atattctgcc attggaagtg aaatgtcata gcacatacgt 16561 ggacgttatg catttagaga tgttataaaa atgtattggc agtatacata gcactactca 16621 agaagccaaa gaaacacttg tgcagtgcta agtgtcacat gtctgcttct gccagaggct 16681 aggaatagtg atcttgctat aatgtgagaa cctgaatcat gtttgtaaaa taggaggctg 16741 ggagcagttc caggccacag tgaagtgtgt tgcttctgtc actttatatt cttatatttc 16801 ctttccctca gacaatcaac tgttgatgcc tgtacttgtg aaaagttgta aggcaatttt 16861 tagggttgtt gtcaaagaaa agcttggatt acattaacat ttgtactcag ccttttaggc 16921 aatacaccag agggggctgg gaattgtctg tttgttccta tgataagtag atcttactgt 16981 aaaatagtaa atgtccattg aaaagcaagt aacacaacct gtgctaactg ggagcacttg 17041 aacaattttg ttccattctg aataactttt cagcaagtaa ttctaagctt tgtcttatat 17101 ccttgtgtaa aattgctacc ttcattttta ccctatttga tttcttaaat ggtttgttca 17161 agaaaaaaaa aaaaaaaaaa aagaatagag ggtgtagatt ggaccgggtc ctgaggatcc 17221 aatttggtag ggaagtctaa aggtcattgc ctgcaagcta tgtggtgagg gaggagagag 17281 ctgatacttg gggttcacac cataggtgga gccataagcc agatgcttac tttagaaagg 17341 cgcagacctg ccataggaag gattttgtcc tctgcctaga aaaagggagg cgtgggtggg 17401 ataaggaatg ggcttgggat ggagactaag gtttgagatg gtttatattc tgcacctaca 17461 ggcttggctg accctcaggg ccactctttg tgggaaaaga aaaaagttat ccaggctagt 17521 ggagcagctg ggttctaaac tagggcactg ctttttcccc ttctccacat atcaaggcaa 17581 gattggatga tctactgccc cagtcaaggt ctgccagcaa gctcccctca gagccttatg 17641 taacaggcag gcctggttgc agaggaccag tagctgacct acctgctcag cccttcccta 17701 caactcatta agaatatatg aaacacttat tggtataagc actaaccagt gtccttctaa 17761 atgtgtacac ataggcagtt agaagtgaag ctattcattg agttaaattc ctgcaaacac 17821 tacataattg atttgaccca aatttgccta taattatcaa cataccatgg aaaattttag 17881 tatcctctaa aattttaaaa gctaaaaata tacacctcag tttttgaaat tctactacat 17941 catttatact gaaaacatat agtaaaccat taatccaaga acagtaatag ctgatatata 18001 tgatgagcaa cgagtgtttg acaccggtaa gagtcctatg aggtgggagc cattcttaca 18061 acctatttac agatgagagc actgaagctc cagcaggttc agttcttttc tagagtcaca 18121 gtgctgctaa gagccaacag tgaagtcgga ggagctgagt ttatacaatc aacttgagat 18181 cagtttgctt tagggtaagg aagaagagat ggtggtccaa ctctgcctgt aagatcttcc 18241 catcgttgcc catttctcac ggtttccctc ttccttcagg gtcccctggc gatggcagaa 18301 atgaaccctg cacaggtgag tggagtgttt tctacctttc acctaccagt tatctcatag 18361 atggttttgg cacctccatg taaagaaaag tggtgtccag agactctctt tctgtctttc 18421 tccctctctt atgtctgaaa agtgagtgat tccaccagtt ctagtatctt aaattaggtc 18481 tgggattttt gtttgattgt tttaagttta ctgtgacacc ttacctggat ggccctggat 18541 tatgtggagt ggctcccatt tttgacagtt gacgtggtaa catctgtcag agtgtcatgg 18601 gcacaagatt aagaatgttt cagtgtttag aggagagact gaactggatt tttagggaac 18661 tgcaagtgtc tgttatatcg gataggaata gggagcagag tgggaggtgg ctagagacac 18721 acaaacatca ggcctggaat gacagcctag gtcctgggtc tctaatctaa gggaggtggt 18781 agatggggaa gatttgaagc agaagagggc agtgatagga ccctagctta aggaactttc 18841 tttggcgtct gagtggagga cagactgtgg aggtgagggt aatggcaggg agatctgaga 18901 ggatgttcct gcagcagtct aggtggatgt tggtggtgac ttgtcaagag tagcggtggc 18961 ttcaggatgt gttttgaact ggagctgccc acattttatg atgtagagag tgaggcagct 19021 gggggggtgg gatagaaggg gaatcaggag tggccccatg gtttttgttt ggaagcatag 19081 atgctccatc agctaagatg ggagaagtca gcagtattag gaattttcag agtgagtgtg 19141 aggagtcagt gtttgttcaa gccacagatg tctcagagac tcctggtgga ggagttcagg 19201 ttggagacgt ccacactatg gtggctgtcg tatggaccag gatgaagcca tggattcagt 19261 ttaggtgaca gcatctctag gttatggtgg cagtgctgtt cgaagacaga gaattggaat 19321 ggggcatgag aactgggtct cttactgggt agctgtgcag tggtggagga gtcactagac 19381 aaatgaggga atggagtgtt tgtggggatg agtgtatgtg agatggtggg gtataggagt 19441 gagaaagact tctgaaggag agtggacatg atggggttgc tgaggagggt aatagggtga 19501 ggtcggagag ttgctaagca tcagttggaa tgagatgagg atgggaattc tattagtctg 19561 ttttcatatt gctacaaaga actacctgag actgggtact ttatgaagaa aagaagttta 19621 attgactcac agttcttttt tttttttttt tttttttttg agatgtggcc caggctggag 19681 tgcagtggca caatctcggc tcactgcaat ctctccctgc caggttcaag cgattcttct 19741 gcctcagcct cctgaatagc tgggacgaca gactcacccc actacgccca gctaattttt 19801 gtatttttag tagagatggg ctttcaccat gttggccagg atggtcttga tctcctgacc 19861 tcgtgatcca tcctcctcag cctcccaaag tgttgggatt acaggcgtga gccaccgcac 19921 ccagcccata gttcttcagg cttaacagga agcataactc ggaggcctca ggaaacttac 19981 aatcctgtgt ccatgaggca ggacacagct gagagtatgt gaagtcctca catggtttgg 20041 gcacctgaca gtgaagtgag aaacagggcc atgtagggaa ccttcaaccc agggcttagg 20101 gctgcccccg gtgctaggga actttggtgt tctggggcag gactgggagt gaattggatg 20161 ctgagcgtat ttgccgtgca ccacctggtt tctgtgtgca gagtggcctg ttaggaggtg 20221 aaggaaatgc ctgtggtgtg ggctgggagg tgtctgtgga attgactgga gtggaggtgt 20281 gagagatggg attggagtgc tgtccaaaga gtgatgtgca ggcaggagga gtgtgaatgg 20341 agggcctcag gccctggccc tggtagctca agggaggagg atgagtctga gtttggttgt 20401 cttgggaggc aagatctggg tgactccatg gaaactggct ggcaggagag gatggatccc 20461 cccacgggag gcccacattc cttttctacc ttatcttcca tctgtttcct ctgtgtgctg 20521 gacacggtgg ccaatgtcaa tatggagaga aaagtcactc atgccacctg gatgtggttt 20581 ctcttccaca ttgggagggt gtagagattg aggaaggaaa ggaggtgtag gggagaggga 20641 tgagttcctg gagagagaaa agtagcagtg attgtaccca ttggggtttt ttgttttttt 20701 ttttttagat ggagtcttgc tgtgttgtca cccaggctgg agtgcagtgg catgatcttg 20761 gctcactgca acctgcacct cctggattca agcagttctg cctcaatctc ccaagtagct 20821 aggactacag gcatgtgccg ccatgcccac ctaatttttg tatttttagt agagatggga 20881 tttcactgtg ttggccaggc tggtcttgaa ctcctgacct caggtgatct gcccgcctca 20941 gcctcccaaa gtgctgggat tataggcatg agccactgtg cctggccccc atggagtttt 21001 aatttatgga tgtgagtgag tgatgccagg gacaggccaa tgacactgaa agaagatatt 21061 tattccttac gtttccctaa ggaggttaca taccacataa cacagggcca cacggtgaaa 21121 caccagattt gatcaggagg aaaattggag tgagtttagt ttagtccaca gatgctgttg 21181 gggtttccaa gggaaacaag gcagggctgg ctgtcaggat agtttggcta gttttagtaa 21241 ttccaggaca cctgggctat tgggactgtc cctagttgtg cagtacctgt ccctgggttg 21301 atttagagca tgggaaatac tgacttggtt tgagaaagtt agagatggag atggttaaga 21361 atatgtgccc aggatggtag gggagatgga aacacattta gctgttagtt tgtccctgtg 21421 tttaatgggt ggtaaatata agtagaaaat aaatagagaa cttaagtaaa tacagtttga 21481 aaagctgctc atgtattcat ctttcccaat ttggcagggt catgtggttt ttgaagacgt 21541 ggccatatat ttctcccagg aggagtgggg gcatctcgat gaggctcaga gattgctgta 21601 ccgtgatgtg atgctggaga atttggccct tttgtcctca ctaggtaagg ccctcacact 21661 tgcccagtgt cctgggttgg gctgtgttgt ctccttttac ctgaaggcag ctctgcgttt 21721 cccacagtga gaccatgggt gctgcttctt ttccttgttt cctgacatat gttccatgag 21781 agtcaggact gcaatatgtg ctgtgtgctt cctttttcct ggcagcccca tcctctgctg 21841 ttctgaggct tgcaagaaag ggctcaggat ccagaaatgt tgaaggtgac atagagaccc 21901 actggccctg tgttctattc aaatgcatga gacctctgtg ctctgttgtt cccttgctct 21961 tgccaatatt tcttggtccc ttcttgttcc tggaatttct ggcactgacc tgatcactaa 22021 tttggtgaac cactggcttg attgtgtagg atgtagaaat cttctgtgaa gttctggtga 22081 ttatcgaatg tgagatatca tcaggtgatc tctctgggaa tgtgctgtct tgtctctttc 22141 tgtgtctttc ccctaggact ggtctctttc aggtctcaca gactgtcacc ttaagcaatg 22201 gggagggccc tggtacccaa gagggtggat attactccct gaagggctgt agagactcag 22261 agggacatga tcctggttcc tgggagttgg gaaaaggtgt aatatcacag ctggggtcat 22321 attactagga atctgtctgg tgacccctgt tgtttagttg aagactggtt gccacaactc 22381 actcttcttt tccttccccc gttgtcattt ttactctgac ttctaatccc tggcttccag 22441 ccgttaccta ttttcttcaa ctgctccctc tgcagtgcca tccactgcat gacctgtact 22501 taaggtgagg tctgcatgta tttgtcagta gtccctgaac accatctcct ctgtcacaat 22561 cagatctctc tgggtttaga cacagccttc tacacactgc tcactagtca ccagcagcca 22621 tggcctgtgg aaagccattt gcagagaagt gacttggaga ccctcctcct ctctgcactg 22681 tacccctctc ttgtgttctt acccatcatc agggctctcc cattatgggt atttgccagg 22741 tccaactctg cacagcaggc cttcttctca ttggccttag tgtttgtatt acttcattct 22801 cactctgcta ttaggaaata cccaagactg ggtaatttct aaaggaaaga ggttaattga 22861 ctcacaggtc cccattgctg gggaggcctc aggaaactta cagtcatggc ggaaggcaaa 22921 ggagaactag gcatcttctt cacagggcgg caagacaaat gagtccaagc agggaaaatg 22981 ccagacacat aaaaccagca aatttcatga gaatttactc agtttcatga gaacggcatg 23041 ggggaaactg ttcccatgat tcagttctgt ccacctggtc ccacccttga cacgtgggga 23101 ttatggggat tacaattcaa gatgagattt gggtggggac agagagccta accatattat 23161 tccaccccgg ccccccccac ccaattttat gtcccttcca catttcaaca ccaatcatgc 23221 cttcctaaca gtcctccaaa ctcttaattc actctagcat taacccaaaa gtccaagtcc 23281 aaagtctcat ctgagatgag gcaagtccct tctgcctatg agcctgtaaa atcaaaagca 23341 agttagttac ttctaagata taatggggat agaggcactg gataaatgta cctattccta 23401 atggaagaaa tgggccaaaa caaagcctac aggccccatg caagtccaaa acccagcgag 23461 gcagtcatta aatcttaatg ctctgaaatg atatcctctg actccatctc tcacatccag 23521 ggcacagtga tgcaataggt gggctcccac tgtcttgggc aactctgccc ctgtggcttt 23581 gcagggtaca gaccccccct cagctgcttt catgggctgg cattgagtgt ctctggcttt 23641 ttcaggcaca cgttgcaagc tgttggtgga tctaccattc tggggactgg aggacgtggc 23701 cttcttctca tagctccact aggtagtacc ccagtaggga gtctgtgtgg gggctccgac 23761 cccacatttc ccttcctgca gtgccctagc agaggttctc cgtgagttcc acccctgcag 23821 caaacctctg cctggacatc aggcatttcc atacctcctc tgaaatctag gtggaggttc 23881 cctcccaaat atcagttcat gacttctttg cacccacagg cccaacacca tgtgcaagcc 23941 accaaggctt ggggcttaaa ctctgaagca atggctggag ctgtaccttg ccccttttag 24001 ccatggctgg agttgaagca gctgggatgc agggcgccat gtcccgaggc tgcatagagc 24061 agggggaccc agggccagca gctgggatgc agggctccat gtcctgaggc tgcactgaac 24121 agcctgggcc aggcccacaa aaccattttt ccctcctagg cctctgggcc tgtgatggga 24181 ggggctgccg ggaaggtctc tgacaggcac tggagacatt ttccccatca ccttcgtggt 24241 taacattcca ctcctcgtta cttatgcaaa tttctgcagc agccttgaat ttctccccag 24301 aaaacgggtt tttcttttct attgcatagc caggctgcaa attttccaaa cttttctgcc 24361 ttgcttcctc ttgaacactt tagcacttag acatttcttc tgccagatac cctaaatcat 24421 ctctctcaag ttcaaagttc cacagatatg tagggcaggg gcaagaagct gccattctct 24481 ttgctaaagc atagcaagag tcacctttac tccagtcccc aacaagttcc tcatctccat 24541 ctaagaccac cacagcctgg gcttcatggc ccatattgct aaatcagcat tttgatcaaa 24601 accattcatc aagtctctag gaagttccaa actttcccac atttcctgtc ttctgctgag 24661 ccctccaaac tgttccagcc tctgcctgtt acccagttcc aaagtcactt ccacattttc 24721 gggtatcctt acagcagtat ttcactacct cagtaccagt ttactatatt tgtccgttct 24781 cacactgcta ttaagaaata ccctggctgg gcacagtggc tcactcctgt aatcccagca 24841 ctttgggagg ctggggcagg tggatcatct gaggtaggag tttgagacca gcctggccaa 24901 tatggtgaaa ccccatctct actaaaaata caaaaattag ccgggtatgg tggcaggtgc 24961 ctgtaatcct agctactcgg gaggctgaag caagagaatc acttgcaccc agggggtgga 25021 agttgcaatg agctgagatc gtgccactgc actccagcct gggtgacaga gcaaaactct 25081 gtctcaaaga gatacgcgat acttggccag gcttggtggc ccactccggt aatcccagca 25141 ccttgggggg gctgaggtgg acgggtcact tgagatcagg agtttgagac cagactggcc 25201 aacatggcga aaccccgtct ctactaagaa taacaacaac aacaaaaatt agccgggcat 25261 ggtggctcat acctgttatc ccagctactt gggaggctga ggcaggagaa ttgcttgaac 25321 ccaggaagcg gaggttgcgg tgagctgaga tcaagccact gcactccagg ctcggggaaa 25381 gaatgagagt ttgtctcaaa aaaaaaaaaa aaaaaaaaaa aaagacatac ccgagactgg 25441 ataatttata aaggaaagag gttgaattga ctcacagttc cgcatggttg ggaggtctca 25501 ggaaacttac catcatggta gaaggcaaag gagaagcagg catcttcttc acaggacagc 25561 aggatggagt gagtgcaagc aggggaaatg caagatgctt ataaaaccat tagattggcc 25621 gggtgcagtg gctcacacct gtaatcccag cactttggga ggccgaggag ggcggatcac 25681 ctgaggtcag gagtttgaga ccagcctggc aaacatggtg aaaccccatc tctactaaaa 25741 atacaataat tagccaggcg tggtggcacg tgcctgtaat cccagctact caggaagctg 25801 aggcaggata atcacttgaa cccaggatgc agatgttgca gttagcagag atcgtgccac 25861 tgcactccag cctgggcggc agagcaagac tcagtctcaa aataaataaa taaaacccat 25921 gagatctcag gagaactcat gatcatgaga acagcatggg gcaaccgtcc ccatgattcg 25981 gttaccttca cctggtccca cccttgacac gtggggatta tggagattac aattcaagat 26041 gagatctggg tggggacgca gagcctaacc atattgtttc cagaccaaac cgagggtggg 26101 gctgcttatt cttgcagccc aataatgaga tgcagatgaa ctggggaaaa agagagtttt 26161 tatttctgta accggttaca gggagaaggc ctggaaatta ttgccaaacc aactcaaaat 26221 tacaaagttt tgcagagctt atataccttc taagctattt gtctacatgt gggtttgcat 26281 tcatctaaag atataagtga ttaacttctc tgtaaccaag atctgagtcc tgaagacctt 26341 cctctggagc ctcagtaaat ttacttaatc taaatgggtc caggtgctgg ggtgattacc 26401 cttatcttgt ctccttttaa atcatggagg tttggggagt ttccttagaa ccccaataaa 26461 cttattcgtg gaggcctggg gagtttcttc agacccccaa taaaatgtat ttaatcctaa 26521 acgagtcctg ttaagaattc cttcattatc ttttcatcct ttaaggccca ggaaaggcct 26581 aggcaaaact cttggtgggc ttttgttaca ttccagcctg tacatgaggg cactggctct 26641 atcagctttt aatcaactta accactcagt cagtgctgaa accgttgtca tggaagcctg 26701 cctgctcagc tgttagtgag acctggcctg ccacagtacc agggttcatg cctgtcacca 26761 attccatgat cattttcatg gggtgtctga ctgacatttg tgatggggtt gcctccgctc 26821 atcacagtca acatgcacct caccagcatt tttattctta caggttgttg ccatggagct 26881 gaggatgagg aggcaccttt agagccaggt gtttctgtag gagtgtcaca ggtcatggct 26941 ccaaagccct gtctatctac ccagaatacc cagccctgtg agacatgtag ctcacttctg 27001 aaggacattc tgcgtctggc tgagcatgac ggaacacacc ccgagcaggg actgtacacg 27061 tgtccagcac atcttcacca gcaccaaaag gagcagatta gagagaaact ttctagaggg 27121 gatggaggaa gaccgacatt tgtgaagaac cacagagttc acatggcagg gaagaccttc 27181 ttgtgcagtg aatgtgggaa agcctttagc cacaaacata aactttctga ccatcagaaa 27241 atccacactg gagaaagaac ttataagtgc agcaaatgtg ggatattgtt tatggaaagg 27301 tccacactca atagacatca gagaactcac actggagaaa ggccttatga gtgcaatgaa 27361 tgtgggaaag cctttctttg taagtctcac cttgttcgtc accagacaat ccactctgga 27421 gaaaggcctt atgagtgcag tgaatgtggg aaattgttta tgtggagttc cacactcatt 27481 acacatcaga gggttcacac tggaaagagg ccttatggtt gcagtgaatg tgggaagttc 27541 tttaagtgca actcaaacct ctttaggcat tacagaattc atacaggaaa aaggtcttat 27601 ggttgcagtg aatgtgggaa attctttatg gaaaggtcta cactcagtag acatcagaga 27661 gttcacactg gagaaaggcc ttatgagtgc aatgaatgtg ggaaattctt cagcttgaaa 27721 tccgtcctca ttcaacacca aagagttcac actggagaac ggccttatga atgcagtgag 27781 tgtgggaagg ccttccttac aaagtcccac ctcatttgtc atcagacagt tcacactgca 27841 gcaaagcagt gcagtgaatg tgggaaattc tttaggtata actctacact tctcagacat 27901 cagaaagtcc acactggata aggcccttat gaatgcagtg gatatgggaa agccttcagt 27961 caccaacata ttgtggctgg acagcaggca gtacacactg gagaaagact gaatgccgtg 28021 aacgtgggta attatgtagg tacagctctc cagtcgctat gtatcagaga attcacactg 28081 cagaaatgtg tgttcagcaa actcgggaca ttattttggt ttgactctca tctcattaga 28141 cattggagag tttacactga agaagagtct tttcaataaa gtagaaagtg gtaaagattc 28201 aacatgcaag attgtactta ttgggcttca gaatatccac actagtgaaa gtcttctgag 28261 tacagcaaat gtgtgacatt attttgctac tactccacac tacttagaca tcatgtagtt 28321 cacactggaa aaaggccacg tatgtgcctt gaatgtagcc aaaatgacga acaacaccca 28381 gaaatctgtg atttagcact gagaactagt attatatggt ttttaaaaaa caatggtgaa 28441 gtacatgcca cataaaattt gccatcttaa ctattgtaat gtcttgttta atacttgaag 28501 tacattaaca ttgttgagca aagaatatcc tgaactcttt atcttgtaaa atgaaactct 28561 ataaccacca ttaaaaaaac aactcattcc cacattcttc agtccctggc gaccaccata 28621 tttttaagtc attatgactc tgactattct tggtacttct cagaagaaga atcttagagt 28681 gtttttcatt attttttttc acttaggata atatcctcaa agttcattca ttttttagca 28741 tgtgtcagaa attttaaggc tgagcaatat ttcattgttt gcatttacca catttccttt 28801 atttctgaca tctattcatg gacacttggg ttacttctac ctctggctat tgtgaatatt 28861 gctactacaa gcataagggc acaaatatct ctttgagacc ctgctttcaa ttcttattgg 28921 gtaccccaaa aagtggaact gctggatcac atggtacttc taactttaat tctttgatga 28981 actgagaaac ttttccataa aacttgcacc gttttacatt cctaacattc cacaaaagtt 29041 gtgatttgtc ctcattctaa tccatagaag gcaacatttc taacaggttt ctaggtgatt 29101 ttgttgttgc tgatgtaggg actacatttt gagaactgtc ctatcccatt tttataacaa 29161 tgcagagctt ctctgtaaat accccttgtt tctatgttct atcctgtgag ccttttcatt 29221 ttgttaattt ttcccagtgt ttacaaccag gaaatagctt gtcatctttg atacaaaaca 29281 gaaatctttg caaaatacca attgcattct gtagcccttg gatcattttc tcccccaatc 29341 tatgtagata tttagtggtg tataatgttg tatgagaata catcatggta agaatacatt 29401 ttaaaggact ttaagtcagg tcacaaaata cagcaattta cctgtggacc tgccctctac 29461 ccaggccatc cagtcccaat gactgtgctc ttactgctgc tttccaggtg tgtacctatc 29521 atggggtgtg catgccactg accattcctc tgtgctctta ctgctgcttt ccaggtgtgt 29581 acctatcatg gggtgtgcat gccattgagc attcctctta atgggttcta ggcaacagat 29641 agaatagctt aggtgccaaa gccacagaac ctatcagggg ttgtgtcatt ttgaaaatta 29701 tgccaaattc taaatgatct tgtatgacag cagtccccca acctttttgg caccaggaac 29761 tggtttcatg cgagacagtt tttctgtgga tggcagtgca gggggctgga tggtttcgga 29821 atgaaactct tccacctcag atcatcaggc attagttaga ttcacataag gagtgcacct 29881 aggtccctcg catgtaacag ttcacagtag ggtttgtgct cctatgatgt ctaatgccgc 29941 cgctgatcag gaggtggaga tcaggcagga atgctcgctc ctgctgtgcg gcactgtgtg 30001 gcctagttcc tgacaggcca cggactggta cggtctgtgg tttaggggtt ggggattgct 30061 gttaaatggg atttggggaa ttttatttgg gttctctctt tgactagtct attgtcaaag 30121 ttattccatc taatttttct agagtaagcc aggttctgtg tcatgtaata gagccacata 30181 aacagcaaag gtgctattgt gaactgtgca tacgagggat caagttgtgt tccttatgtg 30241 aatctgagct aaaataagaa aaaaaaaaat ttagcctgtg aggtcagaag caagacagtc 30301 atgttagatt tctgtcatta ttcatagttc tgcaaaggtg gtttcaggtt ctgcatcaac 30361 tgaaacgtgc tcttccatcc agcagcattt aaaaccaaag atgctgtccc ttaattagtt 30421 aactctcaca atccatttta gtaatggttc ctcaggaaac tgactacttc aatctataaa 30481 gtaaaacaag ttcttcacat tatactctct ggtttgaata gttacttggt tttgttcttc 30541 cccacaatga ctaccttctt ggtagccgta ggtctcagag gcattttctg ttgtccctaa 30601 atgattctgc ccttggtggg aggtggcttt taggctggct ggtggctttc gttcagatgg 30661 actgaaatct aagcttgatc tagcatcaag tttggaccca gcactctcta attttttttt 30721 ttttagctgt tatagataac aaaggattga gtaatttgct ttttgcttat tagcttgcat 30781 tttcttatgc atccagtgaa ccagagattg aatattttgc tttttgcata ttagtttgca 30841 ttttcttatg catccagtgt accaactccc tgggagctgg atccatctct aaatttccat 30901 ggtaactctt acctttagta actgagcaca gacagccaca gttccatacc atgtatggcc 30961 atttggccac taaggaatca aaggtttttc atcctccacc catttttaaa ttgaggatta 31021 ttcaaaagtt aaagaaaaaa ctttattatc tcttattaat atatgtaaat tgtgttcaaa 31081 atagaaaatg aacttctact tttttttttt ttttttttga gatggagtct cactctgttg 31141 cccaggccag agtgcaatgg tgcgatcttg gctcacttca acctctgcct cccgggttca 31201 agtggttctc ctgccacagc cccctgagtt gctgggatta caggtgtgca ccaccttgcc 31261 tggctaattt ttgtattttt agtagagacg gggtttcacc atgttggcca gactggtctc 31321 aaactcctga cctcgtgatc cgcccgcctc tgcctccacc tcccaaagtg cttggattag 31381 aggtggagcc accgcaccca gctttttttt tttttttttt ttgagacaga gtcttgctct 31441 gttgcccagg ctggagtgca gtgatgtgat ctcggctcac tgcaacctca gcctcctggg 31501 ttcaagcaat tcccctgcct cagcctcctg agtagctggg actacaggtg cgcacaacca 31561 cgcccggcta attttttata ttttagtaga tatggggttt caccatatta gccatgattg 31621 tctcaatctc ctgaccttgt gatctgcccg ccttagcatc ccaaagtgcc aggattacag 31681 gcatgagcca ctgcgcccag ccgaaattct atttttatat ttgtgtattg tcaatactaa 31741 agctaatttg aataaagtct tataaacaaa tccatccaat tttaatcagg ttttgaccac 31801 acaaggtaag atttttccgt aaacctttta taccttctta caaatttttt tctatttttc 31861 tctttcccca atttttagat ctatttacct ttgtttgaga cagaatctca ctctgtcacc 31921 catactggag tgcagtggtt tgatcttggc tcactgcaac ctccacctcc caggttcaag 31981 tgattctcgt gcctcagccc cccacatagc tgggattaca ggcacacacc accatgtccg 32041 gctaattttt gtattagtag agatgaggtt tctccatgtt ggccaggctg gtttcaaact 32101 cctggcctca agtgattcac ccacctcagc ctcccaaagt tctgaaatta taggcttgag 32161 ccactacgcc cagtcctaga tctatttact tttatctaca ttattttcct ttcattttga 32221 aacgaccttt acataacccc taaactagac agaattcctt tttttttctt tctttctttt 32281 tttttttttt ttttgagaga agtctcactc ttgttcctca ggcttgagtg caatggcccg 32341 atcttggctc actgtaacct ctgcctcccg ggttcaaaca attctcctgc ctctgtctcc 32401 caggtaactg ggattaaggc acctaccatc acgcctggct aatttttgta ttttttttta 32461 gtagagacgg gttttcacca agttggccag gctggtctcg aactcctgac ctcagatgat 32521 ctgcctgcct tggcctccca aagtgctggg attacaggcg tgagccacca cgcccggccc 32581 agaattactt tttctttagc aaaaaccaca tcttcatttt ttaaatataa gcttcttcac 32641 aaaaaacaca tagtaaacat catggcactg ggctgcacaa tactttgata gagcacgttg 32701 atgtaaagac atttttgtaa gtgtttagag catgcctttt atatctaaac atgcaaagaa 32761 atgagtagcc tcctgtcgta ataaccattt actgtaaaca actgccacca gctgcttctg 32821 acactgcagc tcttgcttgt gacagccatt acgcacaaaa acgtcaagtt ctttcacagg 32881 acaaagtaat ctctggtacc ccccaaaacc aatgatatca ggtaatgcac tacaaaagaa 32941 ggcagaattt tagacctgag ataaatctgt cctcttaaaa ctcttgagtg agagagagag 33001 agagagagag agagagaggg agagagattt cctcatctgg ttagtgtaga tagcaggttg 33061 gcttcctgag ctggtcagtg cagtagtggg ttagagttct gttttatatt tggcttggcc 33121 attgttgatt tctatagtca atttcttaca tgtaaacaag gaagatagct ataatattga 33181 gatttcttgt tttcttaact ggtcttaggt tgaaaattga tttttcattc tcctccccgc 33241 tcccccctcc ccccaccatc ccattcttct gcctctgccg gatcttcagc tgggtagcat 33301 tttggttttg tttgtttttt aacctggagg atgtccccaa atatcaaaac ctaaacttct 33361 gattcttaga agtcctccct tcgatgagtt gctagtttta tttctgaaca tgaacttacc 33421 aaaacatccc ccagaaatcc cctggtctca gacataactc tctatgcttc ccattccatg 33481 gcagcaactc agcctaactc atccaaagca aaaaagcaat cccttcttgc ttaagtgaca 33541 caaggatccc cagatgaaca agtagctgta tcacccttca attaaatgta ataatattta 33601 cttctggtgg aacctctcct tccttggaac aaagggcccc caaccagcag aatttacaat 33661 taagaagaag tggccagaca cagtggctca agcctgtaat cccagcactt tgggaggccg 33721 aggcaggcag atcataaggt caggagttcg agaccagcct ggccaacatg gtgaaaccct 33781 atctctacta aagatacaaa aaattagctg tgcgtggtgg tgcgtgcctg taatcacagc 33841 tactcgggag gctgaggcag gagaattgct ttaacccggg aagagggggt tgcagtgggc 33901 tgagatcatg ccattgcact ccagcctggg cgacagggca aaaaaaaaaa ttagccgggc 33961 atggtggcgg gcgcctataa tctcagctat tcaggaggct gaggcagaag aatcgcttga 34021 agccgggagg cagaggttgc agtgagctga gatcgcgcca atgcactcca gcctgggtga 34081 cagagcaaga ttctgtctca aaaaaaaaaa aagaaaaaaa gaaaaaagaa gaagggtagc 34141 agaaattctg tgaatgagtt actaggttca aaaatgagga tgctcaatat aatgcatagg 34201 ttcagaaaaa aattacgatg aaaagaattg tcaaattttg taactgatac ttgacctggt 34261 taaatttgaa ttatttctaa ttccctatgt atatgtgcag tgagataaaa gataaatatt 34321 gtatgtgaca tactcatcat aaaattattt gttgtttatc tgaaatacat tctaaatcag 34381 gggtccccaa cccctggggc cacagatgag tactatggcc tgttaggaat ggggccatat 34441 agcagagggt gagtggcaga tgagtgagca ttgctgcctg atctccgcct cctgtcagat 34501 cggcagcagc attagattct cataggggtg caaaccctac tgtgaactgc acaagagagg 34561 gatctaggtt gtatgctcct tatgagaatc taatgcctct gaaaccatct gcccccaaac 34621 ccctctccct tatccatgga aaaattgtct tccacaaaac cggtccctgg tactaaaaag 34681 cttggggatc acagttctaa tgacataact ctttccctga taatttgctt ctacttaaca 34741 attccttgtt ttatataatt tgcttacaga attcctgcag ttacagtctt gatgccattt 34801 atgttgtgac aaaccttcgt catttgtcat attcaactta tttcattcct ttatctattg 34861 gacattcatt aatgagaaac acactcaaca gtagcattct gagcctaaag actcatgatg 34921 ataaacagtg tagaaaacat tacaaactct attataatta ttctaccatc tccatctgat 34981 tttaacaaaa aagttatttt gtctctttct ttggaactag ttgagaaaat catgcacctt 35041 ccaaagttaa tatatatttg actgtccacc attttcatta gtcaatgggc aattacatag 35101 agtcgaagac cttcaaaggg aattttaaat actcctcagt tcatcataaa ttataaactt 35161 gcattttgac attttttgca ttcaatatac tcaaaatata gacaatagca acttctacaa 35221 gagttgaaaa acattgtggg tacagtggct catgcctgta atcgcagcac tttgggaggc 35281 ctaggcggac agatcacttg aggtcaggag ttcgagacca gtatggccaa catggcgaaa 35341 ccccatctct aatagaaata caaaaaatta gccaggcatg gtgccacgcg cctgtaatcc 35401 cagctacttg ggaggctgag gcatgagaat cacttgaacc tgggagccag aggttgcagt 35461 gagccgagat tgcgccactg cactccagct gggtgacaga gtgaaactgt atcttaaaaa 35521 aaaaaaagaa aagaaagaaa aaagaaaaca ctggagaaaa cgttggattt tacaatgtat 35581 gtctgggtca gttagtctac catttcatat ccaaactcat aactaattac aaacagatgg 35641 aaccattgaa agcttgcaaa cttttaaatt atattcatat aattaaaata taagtaataa 35701 tttatattca attaagaata aatggggcca ggcgtggtgg ctcatgcctg taatcccagc 35761 actttgggag gccgaggcag gcggatcatg aggtcaagag atcgagacca tcctggccaa 35821 catggtgaaa tcccgtctct actaaaaata caaaaaaatt aactgggcgt ggtggcgtgc 35881 atatgtagtt ccagctactt gagagtctga ggcaggagaa tcacttgaac cggagaggcg 35941 gaggttgcag tgagccgaga tcacaccact gcactccagc ccagcgacag agtgagactc 36001 cgtctcaaaa aaaaaaaaaa gaataaatag attttgatat ctcaagggaa aaaaagtgat 36061 attcctattt gtctgcaggg cagagctgta gtggtcaggc agtctcctca aacaaagcca 36121 aaactgctat tatatgagct tttaggaagg ctggccttca gggactgagg actttcagaa 36181 gcagcagagt tgaaagactg atggactttc aggtatgtta gtgtaaatgc aaccatttac 36241 caactggctt tcataaacag gattactgac aggaggcact tgtggattag ttgttgccag 36301 cagcctgggg ctgccactga ttgaccatct ttagtactgg gccttacccc cattggaaaa 36361 attttagaat cttggcctaa caatgatttg ctaagtttct ggttaattac tggggtgaag 36421 cttttgctga ctagctgact tttaaagcat aggtacacat cccaagttat ctctaattaa 36481 tggtggtttc ttgtttatga ttatggattt ggggcagaga taaatttttt ccttacttga 36541 acataagcca gcaacttaat tctcattccc catttttggt ttcccaacag ccagaatttg 36601 taagaaatca tatgctatgg tttggatgtt tgtctcttcc aaagctcatg ttgaaattta 36661 atttccaatg tttgaggtgg ggcctaatag gaggtgttct ggtcatgagg gtagatccta 36721 cctgaatacc cccacttgaa gatgtgtaaa ttctcactct attagttccc aagaaagctg 36781 gttgttgaaa agagcctgat acactcccca cccactgtcc tcctctcttg caatatgatt 36841 ccttcgcatt atagctctct ttcgttttcc accatgaata gaacccgccg aagaccctca 36901 ccagatgcag atgcccaatt ttaaactttc cacacatcaa cattttgagc caaataaaca 36961 tttttttctt tataaattac cagcttcagg tattcattta tagcaacgct aactggacta 37021 agacatcata agtattatct agatatttgt ttgtgaggat atgattttca actcatgttg 37081 ccaatgagta atttaacggc cagttacagt ctgagaaaaa tgtgtctccc actggtgatt 37141 ctttgtaata tattacaacc agctgatatg taggatatgt aggccattct gataagtcaa 37201 tacaggattc aggaaaaatc gttatatata ggtcatattc ttaaactgtg cccaggtctt 37261 aattaataat tagcaatgac acagagggga cactgccatc aaaagaaaag attaatgtta 37321 cagttccctg gaaataagag gcactgaaca ccacacaggc cacaagaaac cttattacat 37381 acagagactg gaaagcagaa agagaaaaca gagtaactgg aagtatgtct ttataactaa 37441 agtggcagag aatgatgagg tagagatacc gtcgatgggc aggaaggaaa aacacatttt 37501 agtgtttgtg gttggggggt tttatatgag tttcatgcac ttgagtaagt ggcttgcaag 37561 aaaaaaagta aacaactttg gttctattag tttgaccctg tgattaatga atatcaaaca 37621 gacaaataca gaatctaaga aaacatagaa cagtcctgta tacacatagc agctgggaaa 37681 cctattccca gtgtgtccca tgcacctatc attgatctga aacccaatac atatgtgctc 37741 attgaacatt tactagagaa gtaaatacac attatcacac agcttctcta ttcattttta 37801 tctgtggcaa gtctttcagt ttatcatctg tacttacccc tctaagcagt taatgagcaa 37861 tgttttcaag tgtgtaaaaa aattacattt tggaaacaaa agcatagtgt gtccaagtgt 37921 ccagcagagc cacaggttca acttcattct ctgacttctc agaatttgct gtctctcact 37981 caaacattga gatcttatct ttttctcaca ttacagatcg atttctctag ggcttctggt 38041 tttttttctt ttggaataac ctacataatt actaatagat ttcggtaatt tttttatttc 38101 acaagtgttt ttcacatttc gactccctgt gcaaatacag ttgtaaacac aggtgaaatt 38161 ttaaatcact tctgaggaag aagataacat catggaattc cacacctttt ttgaggataa 38221 gatgtgcata tgcttcaact atgaaaacgg tagacaatgt tgtccagaaa tggcatgtat 38281 cagctgctct tgagaaataa aaataaaatt ctaagcaccc cctaactgac tgaatggatg 38341 ccctcttgac catgagaacc ctagaataac tttggaagct gaattcacca ctatagagga 38401 acgggaaatc agacacacct cattatatcc cctaccacac tacaatcatt aggttttctt 38461 ccctaagggc taaatagaaa tcaacctttt cagaagacta ctagcttatc ttccaaggta 38521 cagaacagag agaagatgag attattcatt ccttcatcct tctctgagac atcttcttta 38581 ttcccttttc ccctcacgtg tttactctat cttatgtaaa atgtagattt actgggcact 38641 aactaaagtt tcacatgtct gtaatcattt gtctcactgc caacccctct tcctttttaa 38701 ggaaaatgta taataaatac taaacctcct aagaacctct ttggaaaaac cagccacaga 38761 tgcttctgtg acttacattt ttctggatgt gcccttaggc tggtccagta atcctccatg 38821 atttgtgact tatgcctcaa tcactcattt tggtcgtcac tctcattttc agataggatt 38881 gtgattctta aaataaatgg aaaatatctc agatatgaag tagtaagtat tattagaata 38941 aaagcgttgc acccttcgta ggaacacaat cgctgactct acagtcaggt aagatgtatt 39001 ctctttgatc cagaggcata gcttcaccct ttcaacctac ttcagcgtct cctgtcatcc 39061 ttcctacaaa ttccagtcca tagccctaac tgcgtagtct ttctttcttc ttctggacca 39121 aacatattga agcttactaa acagacgcca aagcgtaact gtatgtgtat ttgaaaagca 39181 ataatatggc tggcaggcat gagtagaggc agactgcctt tatggtatgg ggtgtctctt 39241 tggacctgta gttattggga attagctgtg cttaatagag tcttaagtaa taagttgcta 39301 acatttctgt aaatatccta tactacaatc caaaaaaata ccgtatgttg cctttaagag 39361 caagccaagc cccctcctgt ctttggagtc ggggactccg tttcccagct cccctgacag 39421 tgccactggg cctttatctg ccatataagg actgatcaac gctctaagac aagtccggaa 39481 gcggtgtcga aacttcataa cccagaagac actgcagctc tcggcggcac gacttaccca 39541 ataaaggctt agaaggggac gtttgcgtgc gtgcgcaccg tcggagggcg ggacttccgc 39601 cgtcctcctg gtggtggtcg ttttggttct gtgtggtgtt tcaccaactt cggcctatgg 39661 ctctgtctga cgtcaccgaa gtgacggaac ggaaaagcgc gagaagcggc tcggttccca 39721 ccacggagag gcgggagtga gtcaactgac aagcgctggg gacagtggcg tccttgtctt 39781 gcctttgtcg ctcccgcccc gctcttccct ggctgggctg gcggaggcct tgctgatgaa 39841 cctgactgag gtgggtgtcc cgtcccaggc tcccccgccc gaccggtcct cccagtgctg 39901 aagccccctg aaggggccct gcaggtcagg ccccttgtcc cgaagagagg ggcgttcctg 39961 tgtggggtcc ccgtatcagc cgatttgatg cagcctcaga gctcccgtta gggacctcag 40021 gttcagagca tggaggcgct gccgagtggg ctgtgttggg gcgtcagggg tgtgtgttgt 40081 ttctgcggga aagagagggt tttgtgaatt ctcgcttgga atggcaacct gagcagccag 40141 taaccatgga aggttgtcaa gggagggaca gttggagggg gcaggtccag agttcgaagt 40201 ttcaagttag gaaaaccagg ttaggaatgg acacagggag acgcgaaaag gccctgaggc 40261 cactgcagta gacccaggag aattatggtt ggggctggac cagtgacagt agagggagat 40321 gtgatgagat tcttcatagg cctgaaagat agggcccgta ggttttcctg gtgcttcagt 40381 tatgtctgaa caaaggcagt gaagaaccgt ttcaagtgtt tgttttgttt tttgttttct 40441 gtttgaacgt ttggaagaat ggagttgggg aagtcagcat aaagagcaga tttccgggag 40501 aagatcacag gttggatttt ggacatggtg attctcaaat gtctcaagat ggatatgtca 40561 cacagacagg tggacataaa ggtgtggagt ctgaggaagg ggattgggtt ggcgatacca 40621 gacggatgga gggcttggac tgggccagag aatggtctga ggatcagatt gagtaggtcg 40681 gtctcgggtt ttttttgttt gtttgtttgc ataaatttaa ggagtcaagt gcagttttgt 40741 tatatggata tatttgcata gtggtcaagt ctgggctttt aatgtatcca tcgcctgaat 40801 aatgtacatt gtatccatta agtaattttt catcactcat cttcgtctca tccttctacc 40861 tgtttggagt ctccaatgtt tgtcattcca cactctttgt ccatgtgtac acattattca 40921 gcttccactt gtaagtgaga acatttggta tttgactttc tgtttctgag ttgtttcatt 40981 taataatggc ctccagttcc atccacattg ctgtaaaaga catagtttca tgcttttttt 41041 tttttttgtg gcctagtggt attaccttgt gtatatatat gccacatttt ctttacccag 41101 tcatacattg atggataatt ggttaattcc ctatctttgc tattgtgaat agtgctgtga 41161 taaaaataca tgtttttttt aatgtaatga tttattttcc tttgggtaga tgcccagtag 41221 tggaattggt ggatttaatg gcagttctat tttcagttat ttgaaaaatc ttcatactgt 41281 tttccattga ggttgtactt atttacattc tcaccaacag tgtgtaacct ttttctcttc 41341 tccatatcct gggcaacatc agttgttttt tgacatttaa atgatagcca ttctgactgg 41401 tgtaaggaga tactgtggtt ttaatttgaa tttgtctgat gattaatgat gttgagcatt 41461 tttttatatg cctgcttgcg atttatgtgt cttctttttg aaaaatgtct gccctgtgct 41521 tactgttttt tttttttttt tttttttgaa atggagtctc gctccgtcgc ccaggctgga 41581 gtgcagtggc gcgatctcgg ctcactgcaa cctccacctc ccaggttagt tcaagcaatt 41641 ctgctgcctc agcctcccaa gtagctggga ttacaggtgc ccaccaccac gcccagctaa 41701 ttttttgtat ttttagtaga gacggggttt caccatgttg gccaggctgg tcttgaactc 41761 ctgacctcag gtgatccacc tgcctcggcc tcccaaagtg ctgggattac aggtgtgagc 41821 caccgcgccc agccctgtgc ttactgttaa tgagattttt tgttgttgtt tttttgttat 41881 gttttgtttg tttgttgttg agttctttgt aaattctgga tattagtacc ctgtcagatg 41941 catagtttgc aaatatttca tcccattttt caggttgtac atttacttgg ttgattagtc 42001 cttttgctat acagaagctg tttagtttaa ttaggtcctg tctacttttg tttatgttgt 42061 ttatgctttt gaagtcttat tcatgaattt ttttgcctag accaatgtcc agaaggggtt 42121 tccctgagtt ttcctttagt attcttcgtt caggtcttac atttaaatct ttaattcatc 42181 ttgagttgat ttttgtatat ggcaagagaa attggtccag attcattctt tttcatatgg 42241 caatccagtt ttcccagcac attttattga aaagtgtgtc ctttccgtag tgtgcatatt 42301 tgttgccttt ctcaaagatc agttggctat agatatgtgg ctttatttct aggaactttg 42361 ttctgttcca ttgatctatg tgtctatttc tgcagtcaaa tgctttacca ctgagctgta 42421 caccttatgt gtctatttct atccattacc atgctgtttt ggttactaac aacttgtagt 42481 ataatttgaa gtcagatgga cttaatttga gggaaatctt gaggttgtca tttcaggtgg 42541 gctctgtgtt gagagaacag ggggctgaga cttgggggcc agaccattgg tggctccaca 42601 gacatcatac tctgtccaga acggtgcaga cctggcataa cagggattct ggattgcatc 42661 tgtaaaaagg gaggcctgga ctgggtaagc atatggcctg ggatggaggc tgaagtgtga 42721 ggtggtttcg atcctgcacc tgcaggcttg gttgaccctc agggccactc tctgttgaac 42781 aagagaaatg ttattgaggc tggtggagta gctgggttcc agatcagagc actgcttttt 42841 ccccttctct acatgtcagg gctacagggg atgatctact gctccagcca aggtctcctg 42901 gcaagctcac tttagaagat tatgtagcag gtagacctgg ttgcagaaag gaccagtacc 42961 tgacctgcct actcttccct tctctataac tcattaactt aataatgtat gaaacactta 43021 ttgatttaaa ccccaattag acagtatctt tctaagtgtg tgcaaacaga taggcagtta 43081 atagtgaagt cattatgtta aatttctaaa agtgagccgg gtgtggtggc tcacacctgt 43141 aatcccagca ctttgggagg ccgaggtggg tggatcacct gtggtcggga gttcgagacc 43201 agcctgacca acatggagaa accctgtctc tactaaaaat acaaaattag ccgggtgtgg 43261 tggcgcatgc ctgtaatccc agcttctcag gaggctgagg caggcgaatt gcttgaaaac 43321 aggaggcgga ggttgcagtg agccaagacc acgccattgc actccagcct gggcaagaag 43381 agcaaaactc tgtctcacaa tagaggagag gagaggggag gggaggggag ggcagaggag 43441 gggagggggg agggcagggg aggggagggg agaaagtggt tcataattgg tttgatgtta 43501 atttatctcg tttgctttcc ttggaaaaag ttaatatcgc atcctctaaa atcttaagct 43561 aaaaacatac atgaaaattt caaaacccca ctctttcatt tgcattgaac tcatgtacaa 43621 aaccattaat cttggaccat agccgctgat atatatagta catgctactg atgtttgata 43681 ctaatatgag ccctgtgagg gggcagctat tcttgcaatc tatttacaga tggagacact 43741 gaagctcagg tgggtttggt tcttttccat ggtcacagag ctgcagcact gaggcctgag 43801 gaactttatt caatgaattg agatcagttt gctcccaggc agggcagacg aggtggtggt 43861 ccatctcagc ctgaaggatc cttccatggc tgtcaatttc tcacggcctt tctcttccct 43921 cagggtcccc tggcgatggc agaaatggac cctacacagg tgagtagagt gtttcctact 43981 attcacccac cctggttatc tcccagatgg ttttggcagg tgacaaaacc tcttttcctg 44041 tctttctctt atgtctgaag ggtgagtggt ttcaccagtt agtttcagta ttacaagttg 44101 gctgctgggt tacatagact ggtccccgtt tttgagaatt gatctgataa gatatgtttg 44161 ggcctcttgg gcacaggatt cagaatgttt caatgtttgg aggagagact gagctaggtt 44221 tttagggaac tgcaaatgtc tgttacatct ggtaggaata cggagcagaa tgggaggtgg 44281 ctagaggcac acaaacatca ggcctggaat gacagcctag gtcctgggtc tgtaatctaa 44341 gggaagtggg agacggggaa gatttgaagc aaaataggta gtgataggac cccagtttaa 44401 gagactttct ttggcttctg agtggaggac agactgtgga ggtgagggta atggaaggga 44461 gatctgagag gatgttcctg cagcagtcta ggtggatgtt ggtggtggct tgtcaagagt 44521 ggtcatagct tcagttttaa ctagagctgc ccacatttta tgatacagag agtgaggcag 44581 ctggggaggt gggatagaag gggaatcagg agtggcccca tggtttttgt ttggaagcat 44641 agatgctcca tcagctaaga tgggagaagt cagcagtatt aggaattttc agagtgagtg 44701 tgaggagtca gtgtttgttc aagccacaga tgtctcagag actcctgatg gaggagttca 44761 ggttggagat gtccacacta tggtggctgt cgtatggacc aggatgaagc catggattca 44821 gtttaggtga cagcatctct aggttatggt ggcagtgctg ttcgaagaca gagaattgga 44881 atggggcatg agaactgggt ctcttactgg gtagctgtgc agtggtggag gagtcactag 44941 acaaatgagg gaatggagtg tttgtgggga tgagtgtatg tgagatggtg gggtatagga 45001 gtgagaaaga cttctgaagg agagtggaca tgatggggtt gctgaggggg gataataggg 45061 tgaggtcgga gagttgctaa ctatcagttg ggaggaggtg aggatgggag ttgtattaat 45121 ccgtttccat actgctgtaa agaactacct gagactgggt aatttatgaa gaaaagaggt 45181 ttaaccaact cacagttctt caggcttaac aggaagcata actgggaggc ctcaggaaac 45241 acaatcctgg cggaaggtga aggggaagcg aggcacatct tcccatgatg gagcaggaga 45301 gagggagaga atgaaggggg aagagccaaa cacttttaaa caaccagatc tcgtgagaac 45361 ttactattat gagaaaggca aggggaaaat atgccctcat gatccagtca catcccacca 45421 ggtccctccc ctgacacgtg gggattacaa tttgacatga gatttgggtg gggacacaga 45481 gccaaaccat atcaggagtg gaaatatgga ggtcagtgca gctgggcttg gaatgaggtt 45541 ttggtggaga gcaattttcc tgactgttta tgggacagag tgtaccatga ggcaggacac 45601 agctgagagt atgggaagtc ctcacatggt ttgggcagct gacagtgaag tgagaaacag 45661 ggccgtgtag ggaaccttca acccagggct tagggctgcc cccggtgcta ggggactttg 45721 gtgttctggg gcaggactgg gaatgaattg gatgctgagc gtatttgccg tgcaccacct 45781 ggtttccgtg tgcagagtgg cctgttagga ggtgaaggaa atgcctgtgg tgtgggctgg 45841 gagttgtctg tggaattgac tggagtggag gtgtgagaga tgggattgga gtgctgtcca 45901 aagagtgatg tgcaggcagg aggagtgtga atggagggcc tcaggccctg gccctggtag 45961 ctcaagggag gaggatgagt ctgagtttgg ttgtcttggg aggcaagatc tgggtgactc 46021 catggcaact ggctggcagg agaggatgga tccccccacg ggaggcccac attccttttc 46081 taccttatct tccatctgtt tcctctgtgt gctggacacg gtggccaatg tcaatatgga 46141 gagaaaagtc actcatgcca cctggatgtg gtttctcttc cacattggga gggtgtggaa 46201 gagattgagc aaggaatgga ggcgtagggg acagcgatga gatcctggag agagaaatgt 46261 agcagtcatt ttacctattg ggttttaatt tatggatgtg agtgattgat gctggggaca 46321 ggccaatgac tgaaagaaga tatttattcc ttgcatttcc ccaagggggg tacataccac 46381 ataacacagg gccacatggt gaggcaccag atttggtcag gaggaaaatt ggagtgaggg 46441 gaaagtttag tttagtccac ggacgctatt ggggtttcca agggaaagca atgcagggct 46501 gggcgtcagg atagtttggc tagtttaaat aattccagga cacctgggct attgggactg 46561 tccctagttt tgcagtacct ggccctgggt tgatttagag tacgggaaat attggcttgg 46621 tttgagaata tgggcccagg gtggtagggg agatggaaac acatttggct gttagtttgg 46681 ccctgtgttt aatgggtggt aaatataagt agaaaataga gaacttaagt aaatacagct 46741 tgaaaagaag ctgcttatgt attcatcttt cccaatttgg cagggccgtg tggtctttga 46801 ggacgtggcc atatatttct cccaggagga gtgggggcac cttgatgagg ctcagagatt 46861 gctgtaccgt gatgtgatgc tggagaattt ggcccttttg tcctcactag gtaaggccct 46921 cacacttgcc cagtgtcctg ggttaggctg tgttacccct ttttacccaa aggcagctct 46981 gcgtttccca cagtgagacc gtgggtgctg cttcttttcc ctgtttcttg gcatatatgc 47041 tgtgggaggc agggctgggc cgtgtgtttt ataccccctt tttcctagca tccccatccc 47101 tgctgctctg aggcttacaa gaaagggctc aagagccaga aatgttgaat ttggcataga 47161 gaccccacca gccttgtctg tccctggtca ggttactctg tccaaacaca tgagacctgt 47221 gttctgttgt cccctttctc tcactgatat ttcttggtgc cttcctgtgc ttggaatttc 47281 aggcactgcc ctggtcactg attttgggga ctgctgagtt gactatgcag gacgcaggac 47341 tcttctgtga agttcttacg attatagaat gtgggatgcc atgggcttga tctctctggg 47401 catgtgctct ttactctttt tctctttctt tcccctagag agggccccag gcacctgaga 47461 gggtggatat tacttcaaca atggcattag aggctcaggg acatggccct ggtgcctggg 47521 aattgggaaa agggttgatg tcacagctgg ggtcaaatca ccctgggaac ctgtcctgtg 47581 ttcaccattg tttgttgcca gggccactgg ccacagattt ctaccttttc ttccccatta 47641 tcatttctag tatgactcct ggcccctgca cctccaatgc cccacagtct gtacttctgg 47701 ggccaccacc cttcaaccat tcttttccac tgctcccttt gaatgcctgc caacacgtgg 47761 cctgtattta acatggtgtg aggtctgcat gtatccctga acacaagctc ctctgtcaca 47821 gtctgatctc tctgggtttg gagagaacct tctacacact gcttgccagt ctccagcaac 47881 catggcctgt tgatggctgt tttcagagga gtgagttgga gaccctgcct tctctctctg 47941 taccataccc ctcccttgta tgcttaccat catcagggct ctcccattat ggggccttgc 48001 caggcccagc cctgcatagt ggaccctcct ctcactggcc ttaatgtcca tgatgtcccc 48061 aatcccatgg ccttatttgt gtgtgtgcct gacacacatt tgtgatggag ctgcttcctc 48121 ccaccagagt caacatgcac ttctccagca tttctgttct gacaggttct tggcatggag 48181 ctgaggatga ggaggcacct tcacagcaag gtttttctgt aggagtgtca gaggttacag 48241 cttcaaagcc ctgtctgtcc agccagaagg tccaccctag tgagacatgt ggcccaccct 48301 tgaaagacat tctgtgcctg gttgagcaca atggaattca tcctgagcaa cacatatata 48361 tttgtgaggc agagcttttt cagcacccaa agcagcaaat tggagaaaat ctttccagag 48421 gggatgattg gataccttca tttgggaaga accacagagt tcacatggca gaggagatct 48481 tcacatgcat ggagggctgg aaggacttac cagccacctc atgccttctc cagcaccagg 48541 gccctcaaag cgagtggaag ccatacaggg acacagagga cagagaagcc tttcagactg 48601 gacaaaatga ttacaaatgt agtgaatgtg ggaaaacctt cacctgcagc tattcatttg 48661 ttgagcacca gaaaatccac acaggagaaa ggtcttatga atgtaacaaa tgtgggaaat 48721 tctttaagta cagtgccaat ttcatgaaac atcagacagt tcacactagt gaaaggactt 48781 atgagtgcag agaatgtgga aaatccttta tgtacaacta ccgactcatg agacataagc 48841 gagttcacac tggagaaagg ccttatgagt gcaacacatg tgggaaattc tttcggtaca 48901 gctccacatt tgttagacat cagagagttc acaccggaga aaggccgtat gagtgcaggg 48961 aatgtgggaa attctttatg gacagctcca cactcattaa acatcagaga gttcacaccg 49021 gagaaagacc ttataagtgc aatgattgtg ggaaattttt taggtatatc tccacactca 49081 ttagacatca gagaattcac actggagaaa ggccttatga gtgcagtgta tgtggggaat 49141 tgtttaggta caactccagc cttgttaaac attggagaaa tcacactgga gaaaggcctt 49201 ataaatgcag tgaatgtggg aaatcattta ggtaccactg caggctcatt agacaccaga 49261 gagtccacac gggagaaagg ccttatgagt gcagcgaatg cgggaaattc tttcgttaca 49321 actccaacct cattaaacat tggagaaatc acactggaga aaggccttac gagtgcagag 49381 agtgtgggaa agcctttagc cacaagcata tacttgttga gcaccagaaa atccacagtg 49441 gagaaagacc ttatgagtgc agcgaatgcc agaaggcctt tattagaaag tctcacctgg 49501 ttcatcacca gaaaatccac agtgaagaga ggcttgtgtg ctccatgaat gtggggaatt 49561 ctttagctaa aactccaacc tcattaaaca tcagagattt cacaatggag aaagtttacc 49621 attgactatt gtaattgggt agtaatgtta tataaattcc acatttttat gcaactaatc 49681 tccagaacat ttttcctctt accaagaagt aaaatgctgt acccattaac aacaactcat 49741 tccccttccc tacttcccca gaaatgtctc aactatattt ctatactcta tggtacttat 49801 atgaggtacc aatagatatc tatgaatttg atatatattt gtacctcata taagtggatt 49861 ctacagtatt tatcttttga gactggctta tttcacttag gataaggtct tcacggttca 49921 cccatgttgt ataatgtgtc agaatatcct tcctttttag gtgaaataat attctatggt 49981 atttatatac cacatttatt tatccattca tctgttagtg gatacttggg ctacttccac 50041 cttttgccta ttgaaataat gctgctatga agatgagtgt acaagtgtct attcaagatt 50101 ctactttcaa ttcttatagg gtatatactc agaaatggtg gtgctggatc atataggatt 50161 tctatttttt ttttttgttt gtttttgaga cagagtcttg ctctgtcacc caggctggag 50221 tgcagtgctg tgatcttggc tcactgcaag ctccgcctcc caggttcatg ccattctcct 50281 gcctcaccct cccgagtagc tgggactaca ggtgcctgcc accacgcctg gctaattttt 50341 ttgtattttt agtagagacg gggtttcacc gtgttagcca ggatggtcct gatctcctga 50401 ccttgtgatc tgcctgcctt ggcctctcaa agtgctggga ttacgggcgt gagccaccgc 50461 gcctggccag gatttctatt tttaatattt ttgggaaaat ttttccatag tacctgtgcc 50521 attttacatt cccaccagca gtgcacaagg attgcaatct atatacatcc tcaccaacat 50581 tgttcatttt ctatttctgt ttttggggtt ttttgtagtg ccttttgttt tggatagcag 50641 ctatcttgtt ggatgtgagg tggaatctat agtgtctttc atttttattt tgtgaatgat 50701 tgatgatgtt gaggatcttt tcatgtgctt gttaggcatt tgtgtatctg gaaaaatatt 50761 caagtctttt ttttccattt ttaatgggac tatttgcttt ttgttgttga gttgtagttc 50821 tttatacatt ctggatatta actccttacc aaatatatgc tttttacata ttacctccca 50881 gtccataggt tgctttttcg ctctgttgat tgtgtccttt gatgaaattt taagttttga 50941 tgtactgttg actctttctg tctgtgggtt ctgtattcat ggatcgaagc aaccatggat 51001 caaaagtatt tggagcatcc atggattgca gtgatcatta atcaaaaata tttggaaaac 51061 aaaaagggta gttgcatctg tactaaacat gaacagacat tttttcttgt cattattccc 51121 taaactatat agtataataa atatttacat agcatttaca ttgtattaga agttataaat 51181 aacctaatga taatctatat aggaagatgt gtgtaggtta tattcaaaca ctatgccttt 51241 ttatgtgagg gacctcttga gcatcagatg attttggtat ccacaagggg tcctggaatc 51301 agtcccccac agacaccaag ggatgactgt agtgcatttt atctattttt acttctgtta 51361 cctgggcttt tgatgttata tattaaaaaa aattagtatc aaatccaatg ccaagcattt 51421 tccctatgct ttattctaag aattttatat ttgaaggtct tacatttagg tctttttttt 51481 ttttttcttt tggaggcaga gtcttgctct gtcacccagc ctggagtgca gtagtggaat 51541 ctcagctcac tacaacctcc gcctcctggg ttcgagccat caccccacct cagcctccca 51601 agtagcttgg attacaagtg tacaccacca cacctggcta atttttgtat ttttagtaga 51661 gatggggttt tgccatgttg gccaggctgg tcttaaactt ctggccttaa gtgatccccc 51721 tgcctcggcc tcccaaattg ctgagattac aggcaggagt tgtaatgcac tgtgcctggc 51781 tacatttaga tctttaatct acttggggtt catttttgca tatggtttaa ggcaaaagtc 51841 cactttatgt ggctatccag ttttccaagc accatttttt gaaaagagca tctttcctct 51901 gttgagtagt cttggcacac ttgtcaaaat catttgtcca tatatgccat ggtttatatg 51961 tggattctct attttattgg tcatatgtct gtctttatgt cagtaccaca cattttaggt 52021 gtgtgtgtgt gagactcagt gttgaggaca aggctagtgg gctttcacac tccagactgc 52081 tgtattccag cccaaattac tcaaattagc caatccatgg ggaacatgga aaacgtagct 52141 aatgcaatcc gcttgcctta cctaagttgt cccctgcagc ctcaggttgc tgttactgtg 52201 tttcagatgc aaccctctgt gggaccctac ccaagttctc tcattcttag ctataggtaa 52261 taaattgttc tgattttgtg tatccaagtg acattgggtt gtttcttgct atcagaagaa 52321 cccagaaaag tattatgaat ctagtgaatg ttggaaaatc tttagccatt agcataacct 52381 catttcgtgc cagcatgttc accctagaga aaagtagaag tgaaggcaat gtcatctttc 52441 cttgttaaca tgataactca gtagagcaat gctttggaga ttagcttttt agggagagag 52501 ccagcagttg agcctcctgc atctgaacat ccacactagg gatattgtct gtactgccag 52561 atatatggga agattttgtg aactgtgttg cagttttcaa cttgactggg gcctttccca 52621 gagttatgcc cctgccagtg cctatttaaa aatgtcatct cttttctacc aactggcaaa 52681 gagccatggt gtgtagcatt ttagtcacat taaaatgcag ttatggcagc atgctgtgtt 52741 ctctatctga agaattcact agtcacttga acactttgga gtcctcaccc ccctcctatg 52801 attgattagg gcatagacat cagccttgga cagaagacat ggactggctt tggcaggcag 52861 attgcagaaa tttcctttct ccaggggaaa gtattggtta tctcattgat agtggtagga 52921 ggcagataca ttgctaggca gactaaggac gggtccctgg tgaaacccaa acttcaagcc 52981 aacgacagtt taaagcctga aaattgagct gccagttcca agtagagtcc atgactggag 53041 tgagaacttc ctcaatgcct tttagccaat caaatggtgc tttttccagg cccacccatg 53101 gaccaatcag tatgcagtct ccattctgag cccataaaaa ccctgggccc agctacacat 53161 tgggctaccc actttcaggt cccctcttgt tgagagcttt tctgccactc aataaagttc 53221 cctgccttgc tcactctctg gtgtccacat aacatcattc ttcttggtca tgggacaaga 53281 actcggaaac tgccaaatgg cgggtgtgaa aggagctgta acactgtagc cctcctgcct 53341 tccaccagcc ccaggcagcc accccatgtg acaggaagca gcggcagcag ggccaggcca 53401 gcccatgagc catgggctgg agcagggtgg caggaccaaa caagctgtga cacaccccca 53461 ttcactgaag tgtgtggatg gtgggaacag acaagctgta acacaaatga gctgtaatgc 53521 ttccttgggg ctcagacctt gggattccct gagcaaaagc tgtaacaccc cttggggctc 53581 tgtggttgct ggcgtctctg agtttttggg cgctgccatg tcccccttgt ccagatacca 53641 gcttccaagg cagaagccgg tcgcagcacg cctggaccag ctgtaggcca agcacagagc 53701 catggtgggc acaggatccg gctggtaaca tgagccaagc acagcctgtc ggactgagtt 53761 agttgagtga gtccagcagg ccaagtgatg tctgggcaga agtgcttcag ccatggaggt 53821 ttctgcctgg tgaagtggca ctgaaagtat cttgtgtcat catgacactt gggatggaat 53881 tttccaacct gccagtcacc cacactgtga actccttctc acccctaatg cacacacata 53941 ccctggttgg ttttgtgata ataaaggtca cattgtttaa gctaccttaa ctcttggaaa 54001 tctagtcacc gtatgcagtg ggttttgaac agaattggtc tctgctagaa taaaagcatg 54061 aatggttttt tgtgggacct ttactttgtg agctccagag ggactagtag gaagcaaaag 54121 atcagctcgt atgcagattt gggccaattt gcattgccat gagaagcctc ctgggaaagt 54181 ctgaaggact tctgcacaaa atttcaagcc ctgagtacaa agattatttg tattcagaaa 54241 agtacaattt gaggagaaaa cagttgcctt gatgtttaag gcattgggca acagtatgca 54301 ttgggtatcc ctacccatat ttcccatatt tcccctgcat tgatgaggga tagctgtttc 54361 ttgtgacctg ctggaagcta tgggtcctga agtaaaacac tatctaggta tatgttcctg 54421 gctgttgacc tttagggcta gatggaagcc atattctttt tttttttttt ttttcttgag 54481 acgtagtctc gctctgttgc ccaggctgga gtgcagtggg atgatctcag ctcactgcaa 54541 cctcgcttcc tgggttcaaa caattatcct gcctcagcct tccgagtagc tgggactata 54601 ggtgcacgcc accacacccg gctaattttt gtatttttat tagagatggg gtttcaccat 54661 attatattgg ccaggctagt ctcaaactcc tgacctcgtg atccgcccac ctcagcctcc 54721 caaagtgcca ggattacagg agtgagccac tgcacctggc cagaagccat attctataat 54781 aaatagtggt tggattaata ggacatggga gagactgcag tagagtggat gagtcccctc 54841 taaaggagca ctcacaaatg ccctggtagc tatggctgtg tggggtgggg tattacagga 54901 attccaaaga cctaaggagc tgtgtacctc ctgtagccaa ggtcaatgtc agaacaacta 54961 aatcaatgat tttatgaatt cctgtagcct ccctggacta taaatttcaa accatagttg 55021 catcatctac atagtgatgg gccttgagcc ttccctaaag ataaaaagcc tacatagccc 55081 gtgctgcccc atcacctgtg tgtcggaaca tcttccttgt ttcaagctac atgagtgctt 55141 ttttattctg ttgaagtgtg tcatgtcatg cctggtgaat taataaatct gtcctctgca 55201 actgacaggg ttcttccact gtgaaacaag agggttgtat ataggttgcg tttaactaac 55261 agagttaatt aaaccttttt actttaaaaa ttactcagtc ttgggccagg cgcggtggct 55321 cacgcctgta atcccagcac tttgggaggc cgaggcaggc ggatcacgat gtcaggagat 55381 cgagaccatc ctggctaaca cagtgaaacc ccgtctctac taaaaataca aaaattagct 55441 gggcgtggtg gcaggcacct gtaatcccag ctactcagga ggctgaggca ggagaatcac 55501 ttgagcccag gagttggaga ttgtggcgag ccgagattgc gccattgcac tacagcctgg 55561 gcaacaagag tgaaactcca tctttttttt tttttttttt ttttttgaca cagagtcttg 55621 cactgtcacc caggctggag tgcagtggtg tgatctcagc tcactgcaag ctctgcctcc 55681 caggttcaca ccattctcct gcctcagcct cccgagtagc tgggactaca ggtgtccgcc 55741 acgacgccca gctaagtttt tgtattttta gtagagacgg ggtttcaccg tgttaaccag 55801 gatggtctcg atctcctgac ctcatgatct gcccgcctcg gcctcccaaa gtgctgggat 55861 tacaggcatg aaccactgtg cctggccact ccatcttaaa caaatttaaa aaatagttta 55921 tctctctatt ttttaattta ctgttgtttg tgggggtttt gaagatgtat caatgatttt 55981 ggttgagtag ggcactttag ctttacttct gagtgcatgc agcagtaaag tttttttttt 56041 atgatttcct tggctttaaa cagttcaagt ggttttctca agtgtgttag ggtgcacact 56101 gttagttatt agtggaggtt ttggtaaagt tgtgctggga acagaatgcc agatgagctt 56161 gtcttcaggc ttcagcagta ctggtggtga gctatgtatg tttatccttg tactttgcgg 56221 tgctgaatgt tggtacctgt gttggcagtt ctaggcaggc cgattcttgg gcctctggtt 56281 ggctttctta gatgctggtt gtgatagcag tgtaccaagc atgtgagtgc actcttgagc 56341 ccctgggctg ctggtgtggc atccgtgatg gtggtggcag tggtgcgaaa ctcttctggg 56401 tcgcacttgc tgtgctcatt agcagtggtt gcagcgggct ctgtgggcca gccactagac 56461 cagcagatgg cgcttgtagg caggagatgg ctgaagtggg tgcagtaggg tatttaggcc 56521 taacctcagc gccctaggag tgctcaggtg tttcacttgg tggactaggt tgtgcaatct 56581 ctgggggatt ttaaaagttt attttgcaat tgagaatggg atgtttagaa tttttgtctc 56641 ctactgatta aaggtagagt atggggaaaa aaacaggcaa aactaggagt ttaaagaaaa 56701 ggagagagaa gaaaaagaaa tgaagataca ataatacaga aagaacagag ggaaaaactt 56761 tatttatatt taatacctat gatgtgacat gcccggagtt aggagcattc ataagcacca 56821 tctcatatta tcctccaact aagctactca tgtcagaaaa aattaaaatt aacaaaggtc 56881 aacagggagt aagaaataga aaatggaaag gaaacaattt tggagtgaga cctacaaaat 56941 gatctcagaa tttaccctct gataagtctt tctttgatcc cagagtgcac aaaatgttac 57001 atgaaaaaac aaattaatat aattacatct aaatttcaag ctttctattt cataaagatt 57061 atgagtcaaa actgagtgag ttcagggact cagagaagac atttaaaata tctaaatgtg 57121 cccagtttta cggttttagg catgcgagtt ttagattttt ttgctttttt attgagacag 57181 ggtctctttc acctaggatg gagtgtagtg gcgtgatctt cactcacagt agcctccacc 57241 tctcaggttc aagcaatcct cccattcagc ctccccagta gctgggacta caagtgcgca 57301 ctaccatgct tggctaattt ttaacttttt tggtagagac ggggatctca ctatactgcc 57361 caggctggtc tcgaactcct agactcaaat gatccttcca cctccacctt tcaaagtact 57421 gggattacag gcatgagcca ctgtgcccag ccaaagtttc agattcttct taaaggaata 57481 aagtcagaga tcttagaaat gaactaaagc acgagcatac aataaacaag agagaaaaat 57541 cagatggtta tcaagaatat gaatggatat tcaaactcgg tgtcagagaa atgcaaatta 57601 gatattattg catgttactt taagttctat aaattactat actatggaaa gaaatatggg 57661 gtgatgtagc tgcaatcatg tatatttcat ggaagagtca aagactgtag cagtttcata 57721 aagcaatctg cacaggtact gtaaaaaaaa tgattttcta gaatatcatg tgacacagaa 57781 agcctgtttc tggtatatct ctcagtcaag tccataaatt cttatgtgag agaagttttc 57841 tcacaatact gttcaaggca gcagggacat ggagacaata tgttgtagca ttaagtgtaa 57901 gaggaggacc aaagacatct atgagtgaag tagaagactg aatgtccaca ctaaagtaca 57961 acagggcagt tactagcatt taagtagact ttgacacagg agtatatata catattaaac 58021 acacattttg agtaccaaca aagggagaaa gaagtctata gtacagtaca attaaactac 58081 acaaaaagta acagtgctcc tctttctaaa acacatgtaa atacaaaaac acacaacaga 58141 cctttagaat tgttgtctac aagtggtggg gtgatgaagt gtgaggaaca ggaatgagtg 58201 gaaacacgtc ggaacattga acaagacgag gtccttgtgg ggacataagg agagaatgtc 58261 cactcactgt ccagagcctc ctgggccatt gtcaccatac agccgctcat tacatgaaag 58321 tggaatttca aaaggtagcg aaacacctaa taaaatatgt tgaaagggct gggcatggtg 58381 gctcatgcct gtaatcccag cactttggga ggccaagccg ggtggatcat gaggtcagga 58441 gttcaagacc agcctggcca acatggtgaa accctgtctc tactaaaaat acaaaaatta 58501 gccaggtgtg gtggtgcaca cctgtagtcc cagctactcg ggagctgagg caggagaatt 58561 gcttgaaccc aggaggcaga ggttgcagtg agctgagatc gtgtgactgc actccagcct 58621 gggcaacaga gcgagactct ctctctctcc ctctctccat atatatatat atatatacat 58681 acacatacac aaagatatgt ggcctgtcac caagtgttag atggatgttt gggccaggcg 58741 aggtggctca tgcctgtaat cccagcactt tgggaggctg aggcgggtgg atcacctgag 58801 gccaggagtt caagaccagc ctggccaaca tggtgaaacc ccgtctctac taaaagtaca 58861 aaaatcagcc aggcgtggtg gccatgcctg taatcccagc tactcaggag gctgaggcag 58921 gagaatcgcc tgaacctggg aggcagaggt tgccgtgagc cgagatcgtg ccattgcact 58981 ccagcctggg cgacagagag agactctatc tcaaaaaaag aaaaaaaaag tttgtttagt 59041 ttctgttggt aaatatctag aaatagaatt gtaggatggt agggatgtat tgttggtaaa 59101 tgattaacat gtttttaatt ttcagttttt tcagagacgg ggatctcact atattgccca 59161 aactggtttt gaacttatga actcaagtaa tcctcccgtc ttaacttcca tgtagctggc 59221 actactgaac agctttaact cttccagacc ttcagttggg tggttttttc ttttcctttc 59281 gtttccttct ttctctctcc ttccttcctt ctttttgtaa tttgataaac caaatttcaa 59341 gactttggcc aaaaaacagt cttctccttt actgaatatc tcagtgctcc attttatttt 59401 tccctcacca attctgcagg agagatcaca agactatgtc catatcttta tttgtggggg 59461 tatgattttc agctaacata gctgctaagt gatttaattt acaattatag tggtagaaaa 59521 atagctctaa cttctgaggt tttgtaattt attgtaaaaa gcggaataca ggacatttaa 59581 agtgacctga caaggtcaat acaggactga ggactaattc attaaatacg tcttacgtgt 59641 cttagaggac cacagacagc tagcaaccct actcctaatg tgtcccatgc acttagtatt 59701 gtacctggca tcgaatgagt gttcatcaaa tatgttagag taataaatac gacagcacag 59761 tttccctatt cagctttact tttctggtaa ctcttttaat atatagtctt tacctctccc 59821 ttcaagctgt taatgcacaa tgttttcaag tttgaaaaag aatcagattt tggacaccaa 59881 acttgagtgt gtccaaggtt ccaggacaac cttagctttt ttcctcctat cacaaatatg 59941 tttctctata actatttttg ccactttctt tggatatcta cataatcgtc attaatagtt 60001 tcagtaatag ccatttcaca agagttattc agagacaatt tcaactccct gtgcagacac 60061 tgttgataca atttcaggcg aaatgataaa tcactttcca ctaatgagat gagatcatgc 60121 attcccaaac ctttatctca ggagtaagat gctttctttg cacaagggct ttgactggtg 60181 gaaggccaca ctattctggc agataatggt agataagtcg cttttccttt tcagttaggt 60241 ttcagtctct taaaataagt ggaaaatatc tcagatataa agtaataagt atcatgtgaa 60301 taaagtgttg ctgggtaggc tgaaagacaa taactgatta ttcagaaagg aaagatatta 60361 ctcctcaaac tccccatcat gccatgtctg ggtcttttca atgtgcccag cagagcacat 60421 tgaagatcat aaaacagtct ccagagcttc gttctgtgtg tatttggcaa gcacaaaaga 60481 gctgaccgct atgcgtggaa gaccagttct ttatggttgg ccatgttcct tcggatgttc 60541 gtcttttggg aacttgctgt gctttacaga atctgaaagg acaggtctct gaaacatttc 60601 tctgacttgt tctagactgc cttccgggaa aatctggatg cttgccgttt aaaagccaat 60661 gaagcccctc cacgtccttg gagccccgca actgcatttc tcaaagcctc aggaggatcc 60721 tcctggcttt catcctgaac gcgagttaac ttaagttcag aagcggggca ggcagtggcc 60781 tgggaactac attacccaaa agacacagcg gcggacacaa gcagcgagtg tagccaatga 60841 aggcctagca gagcggcgtc tacgggggtt cgcaatgcgt gtgggcggga cttcctgcaa 60901 cgcctcctgg ggttgtcaat atggctgcgt tgggatctgt tcaccttcag gctgagtcga 60961 gactgaggtg aaaaagcgga aaaacgcgag aaaaggtttc cccgttgtac agaggctaga 61021 gtgaggctcg gttgaatcgg ttgcaggcgt tggtgcctct gtcagcgtcc aggtcactgc 61081 cgctcccgcc ccgctcttcc ctggctgtgc tggcggaggc tgcgccgatg aacctgactg 61141 aggtgggtgc cgcgtcccag ggcgccccgc ccgatccctc ctccgagtgc cgaagccccg 61201 aggaggggcc ctgcaggtca ggcccctgtg tcccaaagag aggagcgttc ttgtgtgggg 61261 taccggtgtc cgcggtgctg tgaggcgggg gagctcctgt cagggacctg cacgtgcgag 61321 gcttagaggt gctgcagagc gggcggcact ggggagcccg agcgtctttg tctccacgga 61381 gctgagggtg aggcggagtc tcgcctggga gggcagcgga gctgctgaaa gccgtagaag 61441 gctgtgcagg aagggttatg cccagaggta gacgtttaaa gttgggaaaa acaggatgtt 61501 aaaccaggac aaggggaccg ctgaggagac cccttgagtt atggtaggga tgggccagag 61561 gggttgcagt agaggggaag tgtgatgaga ttctggatgg atctcagaga tagagtcgac 61621 aggatttcct gctgcatcaa atgtagcagt gagggtagtg aagggtctat tccatggatt 61681 ttgttttggc ctgaaaatct gaaagaatgg agttggagaa gtcagcttag ggagcaggtt 61741 tcagagagga ttatgagttg cattttggag ggctgagttt tggagggcag atcatgggtg 61801 ggtccataag cctgacgcta cctttagaaa ggcgcagacc tgtcatagga gggattctgg 61861 ctcatatctg caaaaacgga gttctgaatg ggttaaggaa ctggcctagg ctggagactg 61921 aggtgtgagg tggtttagat ttctgcacct gcagacccag cagcccccca ggcccactct 61981 gtggaacaat aggaaggtca tccagactag tggggacatt gggtccagac gggggtaggg 62041 ctttctcaca tctagtcaag ggcctccagc aagcttccag caactgaatt tgtagcaggc 62101 aggcctggtt gctgatggga ccagtaactg acctgcttgc ttatcccctc cctgcaactt 62161 atttatttat gaatgtgtaa aacatttttt ttttttttga gacagagtct cgctccatcg 62221 cccaggttgg agtgcagtgg cgcgatctcg gctcactgca agctccgcct cccgagttca 62281 agcaattctc ctgcctcagc ctcccgagta gctgggacta caagtgcccg ccaccacgcc 62341 aggctaattt tttgcatttt tagtagagac ggggtttcac cgtgttagct agtatggtct 62401 ctatctcctg acctcatgat ctgcacgcct cagcctccca aagtgctggg attacaggca 62461 tgagccaccg cgcccggccg aatgtataaa acatttattt aaccactaat gaaccagtat 62521 ccttctaaat atgtacacat aggtcggcag ttagaaatga atgtaaccaa tagttaaatt 62581 tattcactta ataattgttg aatgttgatt tttctttttt cttttttttt ttgagacgga 62641 gtctcgctct gtcgccaggc tggagtgcag tggcgtggtc tcggctcact gcaacctctg 62701 cctcccgggt tcaagcgatt ctcctacctc agcctcccga gtagctggga ttacagacat 62761 gcgccaccat gcccatctaa tttttgtatt tttgatagag acagggtttc accatggttg 62821 gccaggatgg tcttgatctc tgcacctcgt gatccgccca cctcggcctc ccaaagtgct 62881 gggattagag gcttgagcca ccatgcccgg ctgaatgtta atttttctat ttgcttgcaa 62941 tggaaagttt tatgtatcac atttccttaa caccgtaaga agtgggaaaa atacacaccc 63001 aaaatttcaa aacccttcca tgtcatttgt accgaaaaca tgtaaaaaat atttaatcca 63061 gtagaaagaa attgctgata catagtgtgt actacttata tttgatgccc atataagccc 63121 tattagttgg caatcctttt aatctctatt ttacacataa gggcaccgag acttaaggaa 63181 gttcagtcct catcaagaca ctgaaccctg aggacttact gtgtaccctc agataacttt 63241 gctctagttg cagggccaga ggtggttgta cagtcagcct gtaggatgct gcaatgctgt 63301 cttaatcctc atggcctgcc tcttcccaca gggttcatag cagtggcagc aatgcttatg 63361 gatgctggac aggtgagtgg agagtgtttc cagctttcac ccatcccaga tggtttcagc 63421 atgctgatat agggaatggt gttatcctga gactgttcac cttttcttct ctctcccttg 63481 ggtccaagga gagcctgtag ttccaccaga cctgggttcc aaactcagtg cctggttata 63541 tagagtggtt ctagttctca caactgatgg tttgagattt ggcaaggcat ctcaaacaca 63601 ggatctagaa tgttcaaatg cttggtggtg agacttagat gggatgctta gagaactgta 63661 gcttaatgtt ctagctggca aggataagga gcacagcagc agggcaggag gtcggggcat 63721 gcagggatca ggcatggaat ggcagcctga ggttgtctag gtttttgttt ttgttttgtt 63781 ttgtttttga gacagaatct tgctctgtct ccaggctgga gtgcagtggc gcgatctcag 63841 ctcactacaa cctccgcctt cctggttcaa gcgattctcc tgccccggcc tcccgagtag 63901 ctgggattat aggcacccgc caccatgccc ggctaatttt tgtatttttt agtagagatg 63961 gggtttcacc atgttggcca ggatgttctc aatctcttga cctcgtgatc tgcctgcctc 64021 ggccctccca aggtgctggg attacaggct tgagccacca cacctggcct tgtttttttt 64081 tttttttcca gacagaggct aactctgtcg cccaggcggg agtgcattga tgtgatctcg 64141 gatcattgcg acctctgtct cccaggttca agcaattctc ctgcctcagc cttccaagta 64201 gctgggatta caggcatgaa ccaacacacc cagctaattt ttgtattttt agtagagaca 64261 ggatttcacc atgttggcca ggctggtctc gaactcctgg cctcaagtga tctgcctgcc 64321 tcagcctccc aaagtgctgg gattacaggt atgagccacc gtgcccagtg gttctaggtc 64381 tttaatctga ggggggtagt agctagggaa ggtttgagaa agaaaaaaga ggtgatttga 64441 cctagattct tttttttttt ttttgagacg gagtctccct ctgtcgccca ggccagagtg 64501 cagtggcgtg aactcggctc actgcaagct tcgcttcctg ggttcacgcc attctcctgc 64561 ctcagcctcc caagtagctg ggattacagg cacctgccat catgcccggc taactttttt 64621 ttgtattttt tagtagagcc agggtttcac cgtgttagga tggtctcgat ctcctgacct 64681 catgatctgc ccgcctcagc ctcccaaagt gctggaatta caggcctgag ccactgcacc 64741 cagccagatt tgacctagat tctaagaggc ttcctctgcc tgcagaatgg aggacagagt 64801 gagagttagg gaagaaccaa ggatacctgg gtgataattc ctgccacagt ccagtggagg 64861 ttggtggtgg ctggacacga gtggcggctg tggaggtggg aggggtgggt ggcttctgta 64921 ggttttgatc tagagctgct cacatttttt gatgtacaga gcgaggaagc tcgggaggta 64981 ggatgcaaga gggagtcagg aatggcccca ttgttttggc cttgttgaga aggttggatg 65041 ttccagcaac taagatgtgg taaattttgt agtaggggga gtttaataga gcatatgaag 65101 aacgaggttt gggctaagtc aaaaatgttc cagagactcg agtagaggtg ccagactggc 65161 agttgaacac acaggaatgg agttcaatgg aggtgttcag gatggagaca tctactatat 65221 gatagccagc gtgtacacca tggtaaatct gtgggtcaga tttaggaata gacatcttta 65281 gattctggag gcaatgctgt tagggagaag gaaggggaag gaaggggctg aggactggat 65341 ctttctgttg ggcacctgga tggggttgct gggcatcaca aagacagatg agggagtgga 65401 tggactgagg atatgtgtgt gtgtatttgt gtgtttatgt tggagatggc ctttggaaaa 65461 taaaggcttt tgagggggag agtgcacata aaagttggat ggtgaggcat ctccacaaaa 65521 gaggtgagat gatagtgagg tctgggaggg gcttagtacc ctattatgga ctcaacaggg 65581 ttttgtggca aatattggtt gtacctgggt agtttcacgc gaactctgag actattttgg 65641 cagaagatgg gatcatctag aagcattgct agtcctgggg tggggatggt gttggggagg 65701 tcagtgtatc tgggctttgg attagagttg ggggatagag aaagtcgtga ctgcttatgg 65761 gacaacatat acatgtagaa gggggagaca gccaagatct ggaactcaaa ggaggtgtgc 65821 aagaatatgt gggtttggaa agctgaggtc aaagaaagaa acaggattgg gttgggagac 65881 gttcaaagca atgtttccta aacattgcat tctggtcatt ctaggggcaa atacgattct 65941 ggttgcattt gtcatgcact gcaggatttc atgtgcagag tggcatggta acacgccatg 66001 gaaatactcc cagttgggct gtggtggttg ttagtgttgc caagagtgga ggtgttggat 66061 acatgaggga ggtgccatcc aaagcctgag tggaggagca tgaatagatg gttccacctc 66121 tggcactggg cacttaaggg aggagggtga gtccaagttt ggttatctga gaggcaagat 66181 ctgggtgact ctagggaaat tgagtgacaa aagaaaggta ggcccccatg tgccaggtgt 66241 tttttttgtt ttgttttgtt ttttgagatg gagtctagct ctgttgccca ggatggagtg 66301 cagtggcgcg atctcggctc actgcaacct ctgcctccca ggttcaagca gtactctgcc 66361 tcagcctccc gagtagctgg gattacaggt gcctgccacc atgcccggct aatttttgta 66421 tttttagtag agatggggtt ttactatctt ggccaggcag gtcttgaact cctaaccttg 66481 tgatctacct gcctcagcct cccaaagtgc tgggattaga ggcatgagcc accatgcccg 66541 gccatgtgcc aggttttgat ggcgcttttc agtctctttc ccccacccat ctttgttcta 66601 tactggagtc agtgatcagt ggcaatgagg ggagagcagg tacaggtgcc atgaagacct 66661 agtgtctctt cctccttcag aagtgggtgg aggctagcag ggtgtggatg tgtgtgggtg 66721 tggtgaggag cactgaaggt cctggagagg gaagtgtatc agtgattgta cccacaggat 66781 ttcaatctgt gaacaagagt gagtgattac agaaaagcca atgacattga aataattttt 66841 tttttcaatt ggtgagaaat catacctgga tgaaatgatt tttattcctt tcatttcctg 66901 agtgcaggga gcatgccaca tcacataaag ccacagagga aatagcggat ttggtcaggt 66961 ggcagatgca cagttgaagg gagagcatat atcagtggct ttattggggt tttggctgga 67021 aaggcatgca ggggagagtg aagagcttaa gactgggtag tttgcatgat tttggcagcc 67081 ttggggtata gggactatcc ctccttttgt ggtacaagcc ttgtgttgat ttagggcagg 67141 agaaagaatc atgtgtgatt gttagatagg aggcagttca gtctatggga tctggattat 67201 aagagaaatg ttaaaatgct ggttgttatt ttggtcctct aattcttaga tttcaagtag 67261 ataaatacag atctaaggaa acagaataag aagttgctca tgggccgggc gtggtggctc 67321 atgcctgtaa tcccagcaat ttgggaggcc aaggcaggca gttcacgagg tcaggagttc 67381 tagaccagcc tgaacaacat ggtgaaaccc catctctact aaaaatacaa aaattagcca 67441 gtcatggtgg cacgcgcctg taatcccagc tactcaggag gctgaggcag gagaatcact 67501 tgaatccggg aggcggagct tgcagtgagc cgagatcaca ccactgcact ccagcctggc 67561 aacagagcga gactccatct caaaaaaaaa aaaaaaaaga aagaaaaaaa aagttgctca 67621 cagactaaac tgttataatt ttggcaggat tatatggttt ttgaggacgt ggccatacat 67681 ttctcccagg aggagtgggg aattcttaat gacgttcaga gacacctgca cagcgatgtg 67741 atgctggaga actttgcact tttgtcctca gtaggtaagg ccctgacacc tacgtcagtg 67801 tcttgtgctg ggcaatgttt tttcactttt ttccttggca tctccgtctc acaccaggtc 67861 atggatgctg catcctctcc tggtttcctg gcatatgtgt tgtggcagct acagctgggc 67921 tgtgtgatct gtatcaattt tccttaagaa gcttagctct tgtcactctg aagccttata 67981 aggctcaaaa gtcagaagtc ctcaggtttc tcaagaatat cctaatgacc ttgcctgtcc 68041 ctggttagtt tactctgtcc aaatgcatga cactttctct gtgcaaggct ttctccattc 68101 ttcttgctga catagggata ccctgtgtca ggaatttcag ggacctacct cgtcactgct 68161 tgtgaggact gtgggcttat ttctgcagga ctctccacta gaagttctgg taactttaga 68221 atacagcatg tcctgggttt attgctctgt gtacatgctg ttgacaccct ccctttccct 68281 gtctttcccc tagcttggct aacgccacac ctagatagtt gctcaccttg agcaaggtag 68341 aggtccctgg gtgcctgaca gagtggacat tactccattg aaggcaagag atgctcaaaa 68401 ggacttggcc ctggtggatg ggagctcaga aggggtgtga tgacagggct gggttcacat 68461 cacactggga atcaatccag tttgatgccc ctgccagtgg ccacaccgca tgtctctctt 68521 ttcttcccta ttgtgatctc tcttttgact ttcatctcct ggctcccacc tctgacccta 68581 cgtctctggc ctccacctct cacctgttcc cgttttttgt tttttttttg tttttgtttt 68641 tgtttttgag atggagtttc gctcttgcaa tggtgcgcgc gatcttgact cactgcaacc 68701 tctgcttccc gggttcaagc gattcttctg cctcagcatc ccaaatagct gggattacag 68761 gcatgcgcca ccacacctgg ctaattttgt atttttagta gagacggtgt ttctccatgt 68821 tggtcaggct ggtctcgaac tcctgacctc agctgatccg cctgccttgg cctcccaaag 68881 ttctgggatt ataggtgtga gccactgcac ccagccacct gttcctttta ctgttccctt 68941 tgtagtgccc tcaacatatg acctgtactt aaggtgttgt gaggtctgga tgtatatatt 69001 tcagcagtcc ctgaacacca gctctactgt cacatcagat ctctctgggt ctggacacag 69061 tcttctacac attgttcacc agaagctatg gcctgttgaa agccattcac tgaggagtaa 69121 cttggggact ccacttctgt gctctgcact gtaccactcc ctatgttgtt ccccatcatc 69181 aggactcttt cgttatgggt ttctgtcagt cccggttctg cccagtaggc tttcctctca 69241 ctggctttaa tgtcccatgc ctgtcaccaa tcccatggtc ttatttgtga gttggtctga 69301 cagacatttg ttatggggct gcctcttccc tccaaagtca tcatgcactt catgagcatt 69361 tctgttttag gttgttggca tggagccaag gatgaggagg caccttccaa gcaatgtgtt 69421 tctgtaggag tgtcacaggt cacaacttta aagccagctt tgtccaccca gaaggcccag 69481 ccctgtgaga catgtagctc acttctgaag gacattctac acctggctga gcatgacgga 69541 acacacccca agcgtacagc caagctttac ctgcaccaaa aggagcatct tagagagaag 69601 ctcaccagaa gtgatgaagg gaggccttcg tttgtgaatg acagtgttca cctggcaaag 69661 aggaacctca catgcatgca gggtggcaag gattttactg gtgattcaga tcttcaacaa 69721 caggctcttc acagtgggtg gaagccacac agggacactc atggtgtgga ggcctttcaa 69781 agtggacaga ataattacag ctgcacccaa tgtgggaaag acttttgcca ccaacataca 69841 ctgtttgagc accagaaaat ccacacagag gaaaggcctt atgagtgcag tgaatgtggc 69901 aaattgttta ggtacaactc cgaccttatt aaacatcagc gaaatcatac tggagaaagg 69961 ccttataagt gtagtgaatg tggaaaagcc ttcagcctca aatacaatgt tgttcaacac 70021 cagaaaattc acactggaga aaggccttat gagtgcagtg aatgtgggaa agcttttctt 70081 agaaagtctc acctacttca gcaccagagg attcacacca ggccaaggcc ttatgtgtgt 70141 agtgaatgtg ggaaggcctt ccttacacag gctcaccttg ttggtcacca gaaaattcat 70201 actggagaac ggccttatgg atgcaatgaa tgtgggaaat actttatgta cagttcagca 70261 ctcattagac atcagaaagt tcacactgga gaaaggcctt tttattgctg tgaatgtggg 70321 aaattcttta tggacagctg cacactcatt attcaccaga gagttcatac tggagaaaaa 70381 ccttatgaat gcaacgaatg tgggaaattc tttagatacc gttccacact cattagacat 70441 cagaaagttc acactggaga aaagccttat gagtgtagtg aatgtgggaa gttctttatg 70501 gacacttcca cactcattat tcatcagaga gttcatactg gagaaaagcc ttatgaatgc 70561 aacaaatgtg ggaaattctt taggtattgc ttcacactga atagacatca gagagttcac 70621 tctggagaga ggccttatga atgcagtgaa tgtggcaaat tctttgtgga cagctgtaca 70681 ctgaagagtc atcagagagt tcacactgga gaaagacctt ttgaatgcag catttgtggg 70741 aaatccttta gatgtcgctc cacacttgat acacatcaga gaattcacac tggtgaaagg 70801 ccttatgagt gtagtgaatg tgggaaattc tttaggcaca actcaaatca tattagacat 70861 cggagaaatc actttggaga aaggtctttt gagtgcactg agtgtgggag agtttttagc 70921 caaaattccc acctcattcg gcaccaaaaa gttcacacta gggaaagaac ttacaaatgc 70981 agcaaatgtg ggaaattttt tatggacagc tccacactca ttagtcatga gagagttcat 71041 actggagaaa agccttatga gtgcagtgaa tgtgggaaag tctttagata caactccagc 71101 ctcattaaac atcggagaat tcacactgga gagagacctt atcagtgcag tgaatgtgga 71161 agagtcttta accaaaattc tcatctcatt cagcaccaga aagttcacac cagataaaga 71221 atgtatatat aaagcagatg gggaaagact tcacacagaa atctactctg atttagcact 71281 gggacctacg ttttaaaaaa agtattcttg tagaatacag ataacataaa atctaacatc 71341 ttaaccatgt taaagtgtat agttcagtac tgttaagtca ttcacattgt gcaatgaata 71401 tctagaagtc ttttcaactt atgaaactaa gtctatacct tttaaaacct tattcctcac 71461 tccatccagc ctcttgacaa gcaccgctct gtatgaattt tactagtccg ggtacctcat 71521 ataagaaaac ttaagttttg gtcttcttgt ggtttatttt gtggcttatt ttgcttaacg 71581 ttatattttt aaggtttcat gttctaatcc attagaattt ccatcctttt taaaggctga 71641 ataaaattct gttagtcatg tgttgcttaa cagtggggaa gtgtcctgag aaaagtgtta 71701 ttaggtgatt ttctttcttt ttttggtggt gggggggttg cgtgaatgcc taggctgtat 71761 ggtatatcct atagcacctt gctacaaact tgtatagcat attactgtac tgaatactgt 71821 aggctgttgg aacacatggt aagtaattgt ttttaagtat atctaaacag aaaaggtaca 71881 gtaaaaatac agtataaaag aaaaaatgat agactcacag agaacttacc atgaatgaag 71941 cttacagtac tgcaagttgc tctaggtgag tcagtgagtg gtaagtgaat gtgaaggcct 72001 aggttgttac tgtgctgtag actttataga cattgtgtac ttagacgaca atacattttt 72061 atttttatta ttatttttga gacagaatct tgctctgttg cccagactag agtgcagtgg 72121 tgcaatcttg gcttcctgca acctcctcca cctcctggtt caagcagttc tgcctcagct 72181 tcccaagtgt ctgggattac aggcatgcac caccatgccc cgctaatttt tgtattttta 72241 gtagagaacg gggtcttacc atgttggcca ggctggtctc aaactcccga cctcaagtga 72301 gccactcgct ttggcctccc aaagtgctgg gattacaggc atgagccacc gtgcccggct 72361 ggacaaaatt aaatttatag aaatttttct ttaatatatt aaccttagct gaccattttt 72421 tactttataa gcttgtattt ttaaaaactt tttgactttt gtaataatgt tttgcttaaa 72481 acattgtaca actgtacaaa aatatttttt atatcctaac ttcttaaatt ttttttgtta 72541 aaaactaaga tacacacatt tgtctaggcc ggcacaggat caggataatg tcattttatt 72601 ccaccttcac aacctgttcc agaagatctt ctgggacagt aacacacatg gagctgtcat 72661 ctaaaataac aatgcctttt tctggaatac ctcctgaagg acctgcccga ggctgtgtta 72721 cagtttgtgt gtatgtacac ctgtatattt atagacatac acacaactcc atatatactc 72781 tactgttgga gaacacctaa tgataaaaag tgtagtatgc taagaacata agctagtaac 72841 tcgtaagttt tatctactgt acattattgt atatgttata cttctatatg actggcctca 72901 cagtaggttt gtttatacca gcatcaccat gaatatgtga ttaatatatt gccttactac 72961 tgctatcatg tcattaggca ataagcattt ttcagcttca ttgtaatttt atggaaccac 73021 tgtcatgaat gtggcccatt gttgactgaa aagtgaggtg catgattata tatgtatgtg 73081 ccacttttcc tttcattagt ggacatttgg gttgtttcca ctcagctgtt gttaatagca 73141 gatgagcatg aatgtacaaa tgtttctttg aggctgtgct ttaaattcct tttgaggatt 73201 tactcagaag tagaattggt gctttctatg tttttttttt gtttgtttgt ttgtttgttt 73261 gttttttgag acgggagtct cgctctgtcg ccaggctgga gtacagtggc ctgatctcag 73321 ctcactgcag cctctgcctc cggggttcaa gcagttctcc tgcctcagcc tcccaagtag 73381 ctgggactac aggcatgcac caccatgccc agataatttt tgtatgttta gttgagacgg 73441 ggtttcaccg tgttggcaag gatggtctca atctcgacct catgatccac tcacctgtcc 73501 ccctatgttc atttaaaaaa tatttttaat tttttaaatt tctgactcac aatgttccca 73561 ttctatgtta attctgtttt aaaatttttg aggactcttc tatagcagct gcatcatttt 73621 ttaatttaac tttattatta attttttttt ttagtagttt tactgggttg agtttttttt 73681 ttttctttag ggttatttgg ttttgagatg taggagtttt ttagtatatg tggggacttc 73741 agaaagtttg tggaaacatg aaagtaaagg atacaaaaag aaaacaagtt ttatttctca 73801 acataagctc catcaagttc aagacaattt tataagtgat gataccaggc atttagtcca 73861 tccctaaaga agtgaggttc ctgggaattt aaccttgtct atgcaatctt ttttaatatt 73921 aactaaagaa aaatgggtgc cctttaaaga ttttttaaaa gtaggaaaca agaagtcaga 73981 aggagccaaa tccggactgt aaggtggatg ccttagggtt atcagaattc ttgtaaagtt 74041 gcagttattt gatgagagga atgagcagga gcattgaagt gaaggactcc tggtgaagct 74101 ttcccatggg ttttctgcta aagctttagc taactttctc aaaacactct cattgcgggc 74161 gccgtggctc acgtctgtaa tcccagcaat ttgggaggcc gagttgggcg gatcatctga 74221 gctcaggagt ttgagaccag cctgaccaac atggagaaac ccccctctct actaaaaaat 74281 acaaaaatta gcctggcgag gtggtgcatg cctgtaatcc cagctactca ggaggctgag 74341 gcaggagaat cgcttgaacc cgggaggcgg aggtggcagt gagccgagat cacaccattg 74401 cactccagcc tgggcaacaa gagtgaaact ccatctcaaa tagtaattaa taataaagcc 74461 atgtttcatc tgtacaattc ttcaaagaaa tgcttcagca tcttgatcct acttgtttaa 74521 catttacatt gaaggttctg ctcttgtctg cagctgatct ggattcagtg gctttggcac 74581 ccattgagtg gaaagttcgc acaactttaa tttttcagtc agaattgtgt aagctgaacc 74641 aattcagatg tctgtggtgt cagttactgt ttctgctgtt aatcatcagt cttcttcaat 74701 aaggtcatga gcaagatgaa tttcttcctc gcaaattgtt attaatggtt tgccattgtg 74761 ggctgcgtgg tcaacatcat ctcatctctt cttaaaacgt tatcaggccg ggcgcggtgg 74821 ctcacgcctg taatcccagc actttgggag gccgaggctg gtggatcatg aggtcaggag 74881 atcgagacta tcctggctaa cacagtgaaa ccccgtctct actaaaaata caaaaaatta 74941 gccgggcgtg gtggcgggcg cctgtactcc cagctactcc ggaggctgag gcaggagaat 75001 ggcgtgaacc cgggaggcgg aggttgcagt gagtggagat cgtgccactg cactccagca 75061 tgggcgacag agtgagactc cgtctaaaaa aaaagaaaac aggttatcag tttgtaaact 75121 ctgatttctc tgggtcattg tgcccataaa cttttctttt tctttttttt gagacggagt 75181 ctcactctgt cgtctgggct ggagtgcagt ggcgcgatct cagctcactg caacctccgt 75241 ttcctgggtt caagcgattc ttctgcttca gcctcctgtg tagctgggat tacaggcatg 75301 cgccaccacg cccggctaat ttttgtattt ttagtagtga tggggtttca ccatattggt 75361 cagaatggtc ttgaactcct gaccttgtga tccgcccaaa ctcggcctcc caaagtgctg 75421 ggattacagg cgcgagccac cgcacccggc cggttttacc atttttagag ccaagcttta 75481 ctatatattt gatatttgtt ctttcttcaa ccttagctga attcacattc ctctgataga 75541 aggtgttttc aaactgatgc cgttcttagt gcctcaaact agatcctgtt catacttgtt 75601 agaacaagtt attacaaatt cactttggtg taaaaaattg aaatccatac ataatttttt 75661 tttttttttt tgacagagtc tcactaacgc taggttggag tgcagtggca tgatctcggc 75721 tcattgcaac ctccgcctcc tgggttcaag caattctcct gcctcagcct cttgagtagc 75781 tgggattaca ggtgcccaca atcacgccca gctaattttt gtatttttag tagagatggg 75841 ttttcactct gttggccagg ctgctctcga actcctgacc tcaggtgatc cacctgcctg 75901 ggcctcacaa agtgctgtga ttacaggctt aagccaccac ccctggccaa ttttttcata 75961 atatacattt ttttctcatt tttcatgaaa cttttgaaga cccctcatat tctagatatt 76021 ccttctcaga tatgtggttt tcaaatactt tctcccattg agtctttttc cttttcactc 76081 tgtccattat gtcctttttt acacaggaat tttgaattta aatggagtct aatatatctg 76141 ttttataagt ctttgatgca tttgagttca ttttagcaac ttaattcttt tgcgtgtgga 76201 tatccagttt ttttttttta acatcaaaag aataatgttt ttgcctagca ttaaggccct 76261 tggtagaggc ttgtcagtta caattttgga gcagcagatt aagtccacac tcccaaccat 76321 tttccttatc aggctctcaa actctgggcc acaatatgta agacccaatc accccaggat 76381 caggaatcag atatctaggg acagcttctg tgcccaggag cttgtaaaat tattccattg 76441 gtcaatgcac aggggtccct gaaaacctag ctaaccccaa tttacatggc acacacaagc 76501 tgccccctaa gctccagctt gctgttatct tgggttccct cataactctt gcagccctgc 76561 ctatgtcctt aggtttcaag ctgtaagtag caaagtggtc tacattttat gattatcatt 76621 gtgacatgtc ctgacatcag aaaaacacct ttgtatgtta ttactataca accagcagaa 76681 tattatgagt gcagcaaatg ttagaaagta ttcagcctaa cttcactgag caagagtaag 76741 ttcatcctgg agaaagtcct taggaatgca ggcaatatac tttttttcct ttgtcaacag 76801 gtcaaaaaca gcaaagctct atcgagcttg tcttactcac cctatttttt tgttgctctg 76861 ttttgtttta ggcttttagc ctgaagccat ggttttgttt ctgtctctag tggtaggtgg 76921 acaagaggaa tgagatgaga aaggagcttt actggcccag ctagaaacaa actaagaacc 76981 catgactgta ttctttccct tggatgaccc tgtgttagct tgttgaggga gatctcagcc 77041 tgaaattgaa tctcacatcc aaacatccac gcaagggaga tttgttgtaa ttgtcagata 77101 tatggtaaat ttttgtgaat gatgttgcac tttctgaccc tgcctggggc ctttccagag 77161 ttaagttgct gaaagtgtgc attacagaag actcctgcta ttagctgtca tggtgccaca 77221 atgtgcatca ccttagtcac cttaaattac ttagagagtg ataaggtctg gacttctggt 77281 taaatgtttt taaaaaatgg ggggtggggg ggtggtgcat agattgctgt gttctctacc 77341 tttatctgga atattcagtc atttgttccc tttgggggcc tcattcccag tcccctgact 77401 ggtttgggtg tggacatcac ccagctttgg acagagaaca cacgccaact tcagctggca 77461 gcttgtagag atttcctttt ttcagaggta ttattagttg tctgatactg ataatgttga 77521 tgataaattt tctaccttcc aagcttccca acccagtcaa tttccaccta agcattgctg 77581 ttttcttctg atgataaagg tcatattgtt taagctacat ttactcttgg ggttctcttc 77641 actgtgtgct gcgggttgag aacaaaatta ggctttgcca gaatgaaaaa gtgaatggtt 77701 tttggggcct tcaacttttt gtgctcttga agaaataaga agacaaaata gctttcaatc 77761 cacatcaggc ccaatttgca ttgcttcggg agttcctggg aaagtgacgg acttctatcc 77821 aaaatcgcgc cgtgaatttg attattggta gttctacagt cagcttgagg gttgttggtt 77881 tgacagttgt cagagcatgt tgcagctgta tgaggtgggt atctgtacat atggatgtcc 77941 catattctcc agcattgcat aggaatagct ggtgtctaga tcctgcctca ggagctatgt 78001 gtcctgaatt taaaaatcag gtatgttata tccctggggc atgtcagaca tacaacaaca 78061 gtgcattagt ccattttcca ttgcttacaa caaaatacct taaactgggt aatttgtaaa 78121 gaaaataaat ttcttactgt tgttttgttg tgctttcttt tttttttttt tttttttgtt 78181 tactttgttt ttagagacag gatctcactc tgtcccctaa gctggagtac agtggcatga 78241 tcatagctca ctgccacctg gaacccctgg gctcaagtga tcctcctgcc tcagcctctc 78301 aagtagctgg gactacaggt gagcaccacc acacttggct actgttatta ttattttgac 78361 aagatttaat gtagaaagta tagcacctgt tatctaccag gaatagatga ggatgatcag 78421 gtacatttat gtttgaatct gagcgtttaa gtgtatggga agatattaga cactaccttt 78481 cctcataaga cactcagtga tctgaattga ggaacttcta gtttttgctc tcccacttct 78541 tggaaacccc tatctacttt cctatgtatt tgactacttg aggtagctta tacaagtaga 78601 atcatacaaa tgtattgttt tgtgactagc ttacttcact tacaataagt cctcaagttc 78661 ttccatgttg ctgtgtgtgt caaaatttcc ttccttgcta aggctgaata atattccatt 78721 gtatatatat tttctttatc cagtcattca tcaatgaata cttttttttg tgtgtgtgtg 78781 tggtagagtt tcgctcttgt tgcccaggct ggagtgcaat ggcgatctca gctcactgca 78841 acctccacct cctgggttca agcgattctc ctgccttagc ctcctgagta gctgggatta 78901 caggtgccca ccaacacgcc tggctaattt tttgtatttt tagtagagat ggggttttgc 78961 catattggcc gggctggtct cgaactcctg acctcaggtg atctgcctgc ctcggcctcc 79021 caaagtactg ggattacagg cgtgagccac tgcacccagc cacatcaatc aacactttga 79081 ttgtctccac attctggcta ctgtgaacat gggtgtggaa ctatcttcat gaagccctac 79141 tttaatttct ttttgtatat acccagaagt aaaattgctg gatcacatat aattctattt 79201 taaaattttt tcttagtttt aaattatctt ttggagacag ggtattgctc tgtacccagg 79261 ctggagtaag tggcacagtc acagttcact gcatccttga catcctgggc tcaagagatc 79321 ctcctgcctc agatttccaa gtagctagga ctaaaagtgt gccaccacca tgcctggtga 79381 atttttttta ttttttattt tttgaggcag gtcttcctct ggcactcagg ctggagtggt 79441 gcagtggtgt aatctcagct taccacagtc caataccagg ttcaagtgat cctcctacct 79501 cagcctcctg agtagctgag accacaggtg cacagccacc atggtggctg catcatttta 79561 cattcttatc aaaaggcaca tgtttctaat ttttccacat ccttgacaac acttgttgtg 79621 ttcttctgtt tcttttaatt gtagctactc tgttcgaatg gattttgatt gaaaacattt 79681 gaggtatggc ttttaggaga cttgatctta atttcctagc ctttttgacc tatagtttta 79741 ttgtggttta ttgtggaaaa aggtgccctg tagctgccta tccaggtcag tctaatcacc 79801 tttcatcatt caggatttat catctgaggt tcacaccaac atgtatacaa gtggcatgtc 79861 ttagtctgtt ttgtgcttct ataacagaat accacaggct gggtaattta caaagaagag 79921 acatttattg gctcgtagtt tggaggctgg gaagtccaat accaagatac tagcatctgg 79981 tgagggcctt ctcgctgcat cataacatga cagaagccat caaatggtag aagagcaaag 80041 agacagcaag agggaataag aacccattct tgcgatgata gcattagtcc acctatgagg 80101 gtggagccct catgatctga agcctcttaa aggttccacc tcttaatatt gttacagtgg 80161 caattagatt tcagcatgag tttggagaag acaaaggttt aaaccataac atggtgtaag 80221 tcacataact ctgggtcata tgcatatgaa agaattcgaa atttctctat gaacacatgc 80281 tagtcttatt tgggtttaga gaaatcttta gttgcagctg ttaattcatc ctgggtccaa 80341 ggtttacatc tgacagttgc ctacatactt ggctcttgaa agcatagatc tttagaggta 80401 attggcattt tggaatttta ggatattttt tgggaaaata aagggggcct gggcaaagga 80461 tagaagaggg aggaggtgca gaaggagaaa gattaggcag ggtacggtgg ctcacgcctg 80521 taatcccagc actttgggag gccaaggtgg gcagatcacg aggtcaggag attgagacca 80581 tcctggccaa catggtgaaa ccccgtcttt actaaaatac aaaaaattag ctgggcgtga 80641 tggtgcgtgc ctgtagtccc tgctactcag gaggctgagg caggggaacc acttgaaccc 80701 gggaggcaga gattgcagtg agccgaaatt gtgccactgc actccagcct gatgacagag 80761 caggactctg tctcaaaaaa aaaaaaaaaa aaagagaaag attagaaggg taaaggggaa 80821 gtagaagagt tagatataag gcagcaacta gaggaatagg agggggagga tattgcttaa 80881 gtttgccaaa ttgttcaatg gctttattca aggaaatttt tactgaaaca attttttatt 80941 cagaaccttg gagattgtac aaatacgatt cattccaata aaaaactact gcttactggg 81001 cctgagaaat tttgttttct ttcttttcta atgcatctct taaattaaca gtttttgttt 81061 tcttttcaac tgaaatgttc ctcctagtag ccactgaagg caaaaattat ccgggtgaag 81121 ttatgccact tagaaaccta aacactagaa ttgtgattgt aatcagagta catataacat 81181 gcaggagact ttcctggaaa ggagacgatt tcaatttaga cgatccaata agtgctggca 81241 ctgtctgaga ggagtgtaat gattatgtac cttaggcctg gttctacagc atgataggtg 81301 gctacttccc cacaagaaca cgacaatctt cacacctcag catgcagaaa atctgagaat 81361 actctttctg aatgaatgcc taattctctg gacaaaaaag aaaaaaaaaa aagaaaaaga 81421 agataagaga atcttatccc attttatcag tgacccaaag caaagtctgt ccaaatggtc 81481 accagtctgt tgagaatcag aaaacccttg gtctgtgata ttagctgaaa caagtcacta 81541 gagcctatgc ttgagttttt tttgtttttg tttttttaac ggagtttcac tcttgttgcc 81601 caggctgaag tgcaatggcg tgatcttggc tcactgcaac ctctgcctcc tggattcaag 81661 caattctcct gcctcagcat cccgagtagc tgggattata ggcgcatgcc tggccaattt 81721 tgtatttttt agtaaagatg gagtttttag taaagatggg gtttctccat gctggtcagg 81781 ctgatc // LOCUS AC004076 41322 bp DNA PRI 29-JAN-1998 DEFINITION Homo sapiens chromosome 19, cosmid R30217, complete sequence. ACCESSION AC004076 NID g2822142 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 41322) AUTHORS Lamerdin,J.E., McCready,P.M., Skowronski,E., Adamson,A.W., Burkhart-Schultz,K., Gordon,L., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Poundstone,P., Christensen,M., Georgescu,A., Avila,J., Liu,S., Bruce,R., Quan,G., Montgomery,M., Ow,D., Nolan,M., Trong,S., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 500 kb ZNF gene family- containing human contig in 19q13.4 JOURNAL Unpublished REFERENCE 2 (bases 1 to 41322) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (29-JAN-1998) Joint Genome Institute, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA COMMENT Map and sequence oriented from centromere to q telomere. Cosmid R30217 overlaps cosmid R28253 to the left and F18750 to the right. FEATURES Location/Qualifiers source 1..41322 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R30217" /chromosome="19" /map="19q13.4 between D19S303 and ZNF134" /cell_line="5Hl2-B" /clone_lib="LL19NC032 R chromosome 19-specific cosmid" /note="cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid 5HL2-B, which carries chromosome 19 as its only human chromosome." misc_feature 1..1431 /note="BLASTN similarity to AC003002 (80114..81544); match: 1, score: 1.8e-294; database searched: nt; Human DNA from overlapping chromosome 19-specific cosmids R29515 and R28253, genomic sequence, complete sequence [Homo sapiens]" repeat_region complement(10..98) /rpt_family="MSTC" repeat_region 383..680 /rpt_family="AluSc" misc_feature complement(989..1286) /note="DDS similarity to AA229025 nc50c11.s1 NCI_CGAP_Pr3 Homo sapiens cDNA clone IMAGE:1011572. Score: 550 Identity: 290/300 (96%)." repeat_region 1281..1318 /rpt_family="POLY_A" repeat_region complement(1443..1755) /rpt_family="AluSp" repeat_region 1917..2204 /rpt_family="AluSx" repeat_region complement(2735..2777) /rpt_family="(CA)n" repeat_region 3566..3652 /rpt_family="FRAM/FAM" repeat_region 3671..3719 /rpt_family="MER4D" repeat_region complement(3720..4019) /rpt_family="AluSp" repeat_region 4047..4324 /rpt_family="MER4D" misc_feature 4960..5217 /note="DDS similarity to multiple ESTs:~(4960..5217) AA446441 zw60d10.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 774451 5' similar to contains LTR3.t2 LTR3 repetitive element ;~(1..259); 98% identity.~~(4962..5216) AA195132 zr34b08.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 665271 5' similar to contains LTR3.b1 LTR3 repetitive element; (1..253); 99% identity.~~(4962..5216) W03459 za06e09.r1 Soares melanocyte 2NbHM Homo sapiens cDNA clone 291784 5' similar to contains LTR3.b1 LTR3 repetitive element;(1..253); 96% identity.~and others..." repeat_region complement(5599..5875) /rpt_family="LINE2" repeat_region complement(6952..6980) /rpt_family="AT_rich" repeat_region 7067..7364 /rpt_family="AluSg" repeat_region complement(7488..7593) /rpt_family="MIR" repeat_region 7718..8134 /rpt_family="LTR3" repeat_region complement(8649..9102) /rpt_family="LINE2" repeat_region 9114..9386 /rpt_family="AluJo" misc_feature 9573..9676 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 90.000" repeat_region 10469..10758 /rpt_family="AluSg" repeat_region 11041..11377 /rpt_family="MER74" misc_feature 11448..11576 /note="DPS similarity to (U66561) kruppel-related zinc finger protein [Homo sapiens] (1..43); 48% identity.~~Other overlapping matches:~(11508..11634) predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 83.000" misc_feature 12108..12167 /note="DPS similarity to (U66561) kruppel-related zinc finger protein [Homo sapiens] (45..63); 26% identity.~" repeat_region 12861..12900 /rpt_family="MER4D" misc_feature 12973..13537 /note="DPS similarity to (U66561) kruppel-related zinc finger protein [Homo sapiens] (64..249).~~Other overlapping matches:~(12973..15108) predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 87.000" CDS 13033..15108 /note="hypothetical ZNF-like protein" /codon_start=1 /product="R30217_1" /db_xref="PID:g2822143" /translation="MCSSILKDILHLAEHDGTHPEQGLYTCAAEHDLHQKEQIREKLT RSDEWRPSFVNHSAHVGERNFTCTQGGKDFTASSDLLQQQVLNSGWKLYRDTQDGEAF QGEQNDFNSSQGGKDFCHQHGLFEHQKTHNGERPYEFSECGELFRYNSNLIKYQQNHA GERPYEGTEYGKTFIRKSNLVQHQKIHSEGFLSKRSDPIEHQEILSRPTPYECTQCGK AFLTQAHLVGHQKTHTGEQPYECNKCGKFFMYNSKLIRHQKVHTGERRYECSECGKLF MDSFTLGRHQRVHTGERPFECSICGKFFSHRSTLNMHQRVHAGKRLYKCSECGKAFSL KHNVVQHLKIHTGERPYECTECEKAFVRKSHLVQHQKIHTDAFSKRSDLIQHKRIDIR PRPYTCSECGKAFLTQAHLVGHQKIHTGERPYECTQCAKAFVRKSHLVQHEKIHTDAF SKRSDLIQHKRIDLRPRPYVCSECGKAFLTQAHLDGHQKIQTGERRYECNECGKFFLD SYKLVIHQRIHTGEKPYKCSKCGKFFRYRCTLSRHQKVHTGERPYECSECGKFFRDSY KLIIHQRVHTGEKPYECSNCGKFLRYRSTFIKHHKVCTGEKPHECSKCRELFRTKSSL IIHQQSHTGESPFKLRECGKDFNKCNTGQRQKTHTGERSYECGESSKVFKYNSSLIKH QIIHTGKRP" misc_feature 13592..14095 /note="DPS similarity to (U66561) kruppel-related zinc finger protein [Homo sapiens] (250..417).~" misc_feature 14147..14314 /note="DPS similarity to (U66561) kruppel-related zinc finger protein [Homo sapiens] (418..473)." misc_feature 14366..15116 /note="DPS similarity to (U66561) kruppel-related zinc finger protein [Homo sapiens] (474..726)." repeat_region 15272..15565 /rpt_family="AluSx" repeat_region complement(15566..15720) /rpt_family="L1MB3" repeat_region 15721..16037 /rpt_family="MER1B" repeat_region complement(16055..16255) /rpt_family="L1MB3" repeat_region 16256..17064 /rpt_family="MER7B" repeat_region complement(17065..17363) /rpt_family="AluSg" repeat_region complement(17364..17658) /rpt_family="AluSg" repeat_region 17659..17849 /rpt_family="MER7B" repeat_region complement(17853..18015) /rpt_family="L1MB3" repeat_region 18024..18050 /rpt_family="(TAA)n" repeat_region complement(18051..18348) /rpt_family="AluY" repeat_region complement(18358..19222) /rpt_family="L1MB3" misc_feature 19980..20439 /note="DDS similarity to N20479 yx39g09.s1 Homo sapiens cDNA clone 264160 3'. Score: 900 Identity: 455/460 (98%)." repeat_region 20474..20521 /rpt_family="POLY_A" repeat_region complement(20832..20969) /rpt_family="MSTC" repeat_region complement(21068..21397) /rpt_family="AluSp" repeat_region complement(21402..21590) /rpt_family="MSTC" repeat_region 21846..22198 /rpt_family="MSTC" repeat_region complement(22532..22641) /rpt_family="L1MA9" repeat_region complement(22738..22856) /rpt_family="(GGAA)n" repeat_region complement(22871..23168) /rpt_family="AluSp" repeat_region complement(23254..23400) /rpt_family="L1ME" repeat_region complement(23450..23642) /rpt_family="L1ME" repeat_region complement(23638..23887) /rpt_family="L1MC3" repeat_region complement(23938..23987) /rpt_family="L1MC3" repeat_region complement(24013..24074) /rpt_family="Alu" repeat_region complement(24085..24385) /rpt_family="AluSx" repeat_region complement(24420..24659) /rpt_family="AluSx" misc_feature complement(25012..25944) /note="DPS similarity to (U36898) pheromone receptor VN6 [Rattus norvegicus]. Score: 237 Identity: 86/310 (27%)." misc_feature complement(25276..25393) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 92.000" repeat_region 26256..26462 /rpt_family="L1MD2" repeat_region 26568..26856 /rpt_family="AluSc" repeat_region complement(27477..27712) /rpt_family="AluY" repeat_region 27852..28142 /rpt_family="AluSp" repeat_region complement(28487..28532) /rpt_family="POLY_A" repeat_region complement(28535..28833) /rpt_family="AluSg" repeat_region complement(28836..29534) /rpt_family="L1PA13" misc_feature complement(29070..29149) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 42.000" misc_feature complement(29569..29692) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 61.000" repeat_region 29791..30089 /rpt_family="AluY" repeat_region 30090..30235 /rpt_family="L1MD3" repeat_region complement(30325..30624) /rpt_family="L1MA7" repeat_region 30729..31413 /rpt_family="L1" repeat_region complement(31414..31711) /rpt_family="AluY" repeat_region complement(31835..31867) /rpt_family="AT_rich" repeat_region complement(32086..32319) /rpt_family="L1MD2" repeat_region 32606..32909 /rpt_family="AluSp" repeat_region 32910..33253 /rpt_family="L1PA10" repeat_region complement(33249..33413) /rpt_family="L1" repeat_region complement(33413..36754) /rpt_family="L1M3/4" repeat_region complement(36761..36923) /rpt_family="L1ME1" repeat_region 36931..37313 /rpt_family="MLT1B" repeat_region 37421..37777 /rpt_family="MER21B" repeat_region complement(37801..37821) /rpt_family="AT_rich" repeat_region 37867..38159 /rpt_family="AluSc" repeat_region 38545..39092 /rpt_family="L1M4" repeat_region 39128..39710 /rpt_family="L1" repeat_region 39718..40001 /rpt_family="AluY" repeat_region 40453..40519 /rpt_family="(TAGA)n" repeat_region 40585..40625 /rpt_family="(CAGA)n" repeat_region 40656..41160 /rpt_family="L1" BASE COUNT 11685 a 8459 c 8967 g 12211 t ORIGIN 1 gatctgaagc ctcttaaagg ttccacctct taatattgtt acagtggcaa ttagatttca 61 gcatgagttt ggagaagaca aaggtttaaa ccataacatg gtgtaagtca cataactctg 121 ggtcatatgc atatgaaaga attcgaaatt tctctatgaa cacatgctag tcttatttgg 181 gtttagagaa atctttagtt gcagctgtta attcatcctg ggtccaaggt ttacatctga 241 cagttgccta catacttggc tcttgaaagc atagatcttt agaggtaatt ggcattttgg 301 aattttagga tattttttgg gaaaataaag ggggcctggg caaaggatag aagagggagg 361 aggtgcagaa ggagaaagat taggcagggt acggtggctc acgcctgtaa tcccagcact 421 ttgggaggcc aaggtgggca gatcacgagg tcaggagatt gagaccatcc tggccaacat 481 ggtgaaaccc cgtctttact aaaatacaaa aaattagctg ggcgtgatgg tgcgtgcctg 541 tagtccctgc tactcaggag gctgaggcag gggaaccact tgaacccggg aggcagagat 601 tgcagtgagc cgaaattgtg ccactgcact ccagcctgat gacagagcag gactctgtct 661 caaaaaaaaa aaaaaaaaaa gagaaagatt agaagggtaa aggggaagta gaagagttag 721 atataaggca gcaactagag gaataggagg gggaggatat tgcttaagtt tgccaaattg 781 ttcaatggct ttattcaagg aaatttttac tgaaacaatt ttttattcag aaccttggag 841 attgtacaaa tacgattcat tccaataaaa aactactgct tactgggcct gagaaatttt 901 gttttctttc ttttctaatg catctcttaa attaacagtt tttgttttct tttcaactga 961 aatgttcctc ctagtagcca ctgaaggcaa aaattatccg ggtgaagtta tgccacttag 1021 aaacctaaac actagaattg tgattgtaat cagagtacat ataacatgca ggagactttc 1081 ctggaaagga gacgatttca atttagacga tccaataagt gctggcactg tctgagagga 1141 gtgtaatgat tatgtacctt aggcctggtt ctacagcatg ataggtggct acttccccac 1201 aagaacacga caatcttcac acctcagcat gcagaaaatc tgagaatact ctttctgaat 1261 gaatgcctaa ttctctggac aaaaaagaaa aaaaaaaaag aaaaagaaga taagagaatc 1321 ttatcccatt ttatcagtga cccaaagcaa agtctgtcca aatggtcacc agtctgttga 1381 gaatcagaaa acccttggtc tgtgatatta gctgaaacaa gtcactagag cctatgcttg 1441 agtttttttt gtttttgttt ttttaacgga gtttcactct tgttgcccag gctgaagtgc 1501 aatggcgtga tcttggctca ctgcaacctc tgcctcctgg attcaagcaa ttctcctgcc 1561 tcagcatccc gagtagctgg gattataggc gcatgcctgg ccaattttgt attttttagt 1621 aaagatggag tttttagtaa agatggggtt tctccatgct ggtcaggctg atctcgaact 1681 ccggacctca ggtgatctgc ctgcctcggc ctcccaaaag gctgggatta caggcatgag 1741 ccaccgcgcc cagcctatgc ttgagtttta ccctattatt atcctcttca tgacaatgct 1801 ttaagaaagt acatcactaa actgttacat ataagatagc aaaaagagat gaaaagaaat 1861 ggtgtgatac tatcaaggat ggcaatgtca cgggagccaa ttaaaaaaaa aaaggaggcg 1921 gggcgcggtg gctcacacct ataatcctag cactttgggc gtctgaggca ggcagatcac 1981 ctgaggtcag gagttcgaaa ccagcctggc caatgtctct actaaaaata caaaattagc 2041 caggtgtggt ggcacgagcc tgtagtccca gctactcggg aggctgaggc aggagaattg 2101 cttgaacctg ggaggtggag gttgcagtga gctgagatcc tgccattgca ctgcagcctg 2161 ggcaacagag tgagactctg tcttaaaaaa aaaaaaaaaa aaaaaaaggt gggggggact 2221 aaacagagtc taaatagcca agaaacccag tgcaaagaca gtagtgttca aacttgttgc 2281 taacttaaca tgtcatcatg gatttgacaa ggtgtgaaga gcaaagcata aaggacctct 2341 gtggttacta tacctgtttg atgactatgg cccatggtga tgaacaggtt agacgttatt 2401 tcactgggac ctctgtaagt atgatacatt gtaacataat ttctacaatg gtcacatcta 2461 attatatcaa tcctttgttc actgcacctt cattagtaag aagtatgctt aatggtagca 2521 ttcatggact gcaaacagac aacttggaaa taaatgaaga cccaaatgag aacaaacata 2581 gcctattaaa tcagaccaca ccgtagtaaa ggagtcatcc acaatctctt gcactcagca 2641 gagcttgaaa ggcaagcaga ggtgtaaaaa gcttcatagt agataaaagg taaagtatta 2701 ggtatgccat gactggaggc tatagtcttg aggagtgtgt gtgtgtgtgt gtgtgtgtgt 2761 gtgtgtgtgt gtgtgtgaag gggttattta gaaggttgac atcctatgtg attggctagt 2821 gatgcactga ctttctccag ttggtcctaa gttgaacata tgggcaacaa ttagaaagct 2881 gtcagttctt agtcatgtct tagccatttt aagcgagttg ttatgtggga tgttgtctgg 2941 cttcctggcc ttgttgctag acatagtatt ctgacttggg agtggttact gtagatacca 3001 cattagattt ctgaatgggt tggtgtagat tatgggtcag agtcaatttt tatatttggt 3061 gtagccgttg tccatttgta tagcaaattt ctcacaccta aataaggaac tatccacgtt 3121 agtgggattt cttattttta actggttatg ttgaaaatta gtttaatatt cattaatttc 3181 ccacaaccct ttcaaaggtt tgtttttgcc agaccttcag ttgggcagag ttaatttttc 3241 cttttttttt tttttttaaa ctggttggca aactaaattg taacgctttg gccaaaaaat 3301 agccctttcc ttttctgaat ataaagagaa tgggttttaa tttttggtct tttctttgtt 3361 cttccttcac taatcctgca gaagagacca taaggcatta tccaaatctt catttgttgg 3421 gggcatggtt ttcaggtagt attgccacca cctgatggca atactattgc tactcaattt 3481 ttctctcact gctactcaat tgcagtgaga gaaaaatagc tctaaatgcc gactttttgt 3541 agtgcattgt aaaaagctga ataaagccag ggacggtgtc ctgtacctac agtcccagct 3601 gctctagagg ctgaggtggg aagatagctt tagctcagga gctgaggcct tatcataagg 3661 ccttataaga ctacctggag gcttcatctg catgatataa ccttggtctt cacaacccct 3721 tgtctttttt tgtttttgag acggagtttc gctcttgttg cacgggctgg agtacaatgg 3781 cgagatctcg gctcaccgta acctccgcct cccgggttca agcgattctc ctgcctcagc 3841 ctcccgagta gctggcatta caggcatgtg ccaccatgcc cgtctaattt tgtattttta 3901 gtagagatgg ggtttctaca tgttggtcag gctggtctcg aactcccaac ctcaggtgat 3961 ccgcccacct cggcctccca aagtgctggg attacaggcg tgagatacca cgcccggccc 4021 acaacccctt aacccagaca accatttcta ctgattccag gtctttagat acaaactctt 4081 tcaaccaatt gccaatcgga aaatctttga acccacctga cctggaagcc ccgcccctcc 4141 cttccagttg cgcctttcca gaggaatcca atgtacattt tacatgtatt gattgatgtc 4201 ttacgtcttc ctaaattgtt aaaaccaagc tgtagatggg ccaccctggg catatgttct 4261 taggatatct tggagccgtg tcatgggcca ttagtcactc atatttgtct cagaataaat 4321 ctcttcaact atttcacaaa gttccaccct tttcgtcgac agaaagatat tcctccttca 4381 actctccatc acatcctgtg tgggtctttt ctttcttgtc ccaacagaag cacactaacg 4441 atcaaaaaaa agtctccaga gcctcagtat gtgtgtactt gacaagcaca aaagagctga 4501 ccgctatgaa tggaggacca gttcattatg gtcagccatg tcgcttctga tgttcttcct 4561 ttcgggactt gctgtgcttc agagaatctg aaacgacagg tctctgaaat atttctcaga 4621 cttgtcctag actgctctcc gggaaaatct ggttacttgc agtttaggag tcagcaaagc 4681 ccctccacgt ccttaagagc cccggaactg catttcttaa cgctcctcca ggagcctcca 4741 gcttttttcc gtaaggtccg cgcgcccagc attgtgaggc gaggcaggca gcagcgtggg 4801 aactacatta cccagaagac actgcgggcg acacaggcag caacgtgaga actacattac 4861 ccagaagaca ctgcgggggc agggcagcga gtgtagccat tggtctagca gagagacgac 4921 taatgaggtc tcaattgtgt gggcgggact tctggcggcg ccctcatggt tgcgttagca 4981 tggctaccta gggatctgtt cactgattta gagggtccca gagctctggg tcgggactga 5041 ggtgaaagag cggaaaaacg cgagaagcgg tgttccttct acacagaggc tagagtgcgg 5101 atcggctgag tcggctgcag gcgcccctgc ctccgtcagc gtccaggtga ccgccgttcc 5161 cgccccgctc ttccctgggt ggactggagg aggccgcgcc gatgaacctg accgaggtgc 5221 gtgcagcgtc ccaggccgcc ccgcccaacc cctcctccca gtgctgaagt cccaaggagc 5281 cgccctgcag gttaggccct tgtgtctcta agagaggggc gttcttgtgt ggggtgcccg 5341 ggactgggac tgcggtgcga tgaggcgggg gagctcccgt cggagaccga cacgtctctg 5401 gggcttggag gcactgcaga acgggcggca ctggggagcc cgagcgtctt tgtctccaag 5461 gagcagaggt tgaggcggag tctcccctgg aagggcagtg gagccgcaga gagccataga 5521 gggctgtgca gggaggacga atgggagggt cacgcccaga gttagaagtt taaagttggg 5581 gaaaacagga tgttgaacgg gggcaggggg accgctgagg agaccccttg agttatggta 5641 gggctgggcc agaggggttg cagtagaggg gagatgtgat gagactctgg atggatctca 5701 gagatagggt caacaggatt tcctgctgca tcaaacatag ctgtgaggga aagtcggtag 5761 tgaatggtgt cttccatagg ttttgttttg gcctgaaaaa tctgaaagaa tggaattaga 5821 gatatcagct taaggagcgg gtttcagaca gaagatcctg ggttggattt tgggcgacat 5881 cacatgggta ggtggacgta atttccaaag gtgagggaag gtagctggcc tggagcttta 5941 aatgagttag aagttgtgca cttgagaagt tggcctgtgg atcagattga gaaggaagaa 6001 tcctacatcg ttttttcata tgggctgagg taggggaggg ctcagatttg gaggaaagac 6061 catggtaggg agggcatctc tgatactccc tgcagaaggg agcagatatg ggctaagagt 6121 ctggtctgtt tctgttgaaa aagaggcctg gatggataag gagttgccct ggaatacaga 6181 gactgaggca tgagataatt gtagtcactc cacccttcta actcagctgg ccctgagggc 6241 cactctttag ggaacaagat aaggccatgc agcaactggg tccaaaactg ggcagggctt 6301 tctcacatcc tccaccacag ttaagggcct ccagcaagct ccctttactg cattatatgg 6361 caggctgacc tgtttgctga acggaccagt aactgattga ctgctcagcc ccatcctaca 6421 actcattaac ttagggatgt aaaaacaggt aattgactta aacagtaagg aactatatca 6481 atcttaaata accaagcaag tcagaatcta gtttaaagag agtttattca agttcacatg 6541 ttgatgatag ccatctggga ccatagattc aagttgtcct gaatatacac ttcaattacc 6601 acagtgacaa gtgtttgctt tttttttaag gaaaaaagaa gcagtttctg agttgtttac 6661 caataactta tgttaaaata agataagttt ttattggcta tacattctta tttatattat 6721 agattacaga aacatgaaga taataggtga gacagctagt caggaacaga atgtcttaaa 6781 acagttgtcc tcaggcatag gttcagggat gtgactgaca tctcatatat tcttgcctct 6841 ctgagcctga tacattttgt gtaggtcaga caggtctggg ctattttttt ttttcctcag 6901 ccaatatcct tctaaatatg tacacatagg caattggaaa tagatgtaat caataattta 6961 aatgtattaa aatatttatt gttagaatgt taatttacct gtaactctgc ttatagtaca 7021 aagctttata tatatatcac atttccttaa aacattaaga agtgtgggct gggcacagtg 7081 gctcatgcct gtaatcccag cactttggga agcgaaggtg ggcggatcac gaggtcagga 7141 gtttgagacc agcctggcca acatggtgaa actctgtctc tactaaaaat acaaaaatta 7201 cctgggcgtg gtggcgggcg cctgtaatcc cagctactta ggaggctgag gcaggagaat 7261 cgtttgaacc cgggaggtgg aggttccagt gagctgagat tgtgccattg cactgcagcc 7321 tgggcaacag ggcgagacac tgtctcaaaa aaaaaaaaaa aaaagggaaa aatactcaaa 7381 ttttcaaaac ccttctactt tgtgctgaaa acatgtacaa aaccattaat cctatagaaa 7441 gtaatgctta tacatagtgt gtactactta tgcttcttgt caatatgagc cctgtgaagt 7501 gggaaccctt ttaatctcca ttttacacat aagggcactg aggcttaggg aggttcagtc 7561 cttgtccaaa gacacagagc tgctaagagg cagcactgaa cttaggactt gctgtagctt 7621 cagattactt tgctccagta cagggcagag atggttgtac agctcagcct gtaggatgcc 7681 acagtactgt cctgatactc atggcttgct tcttcctgtt acaggccgaa agaatgaggg 7741 tcgtgatcaa ctcagtatgc cactggaggc tatatgagta aacagcaaac tgtttctcat 7801 gaaagcagga tgttggcaaa ctgacaaact gcgtctgcca cccagaagga atgctgaagg 7861 cagtcacgac ccaggcacaa gtgtttcttg tgattaggca taattgaagc ctgttaacaa 7921 taatgtgaac ttgtgatcaa ttaagcagct gaccagtcgt tacctcctcc tccctttcat 7981 cctacccaat aaatgtggag ggctgtggaa gctcaggggc tgcctttgct cactagaagc 8041 agggagctct cttcttcccc agttcgcctt ctttaaaaca gtttctttcg tcttaagttt 8101 tcatttctac gttcgtcccc cttcattcag tctcgtaatg atggtctcaa gtagtaacag 8161 taactgtcgt agtgatggtc tcaagtagta actgtacaag tctgccacat cttcccacag 8221 ggtccattgc agtggcagca atgcataagg accctgcaca ggtaagtcaa ggatttctca 8281 ggctctctac cccaggcccc caccacttat ctctccgtta ttttaggcat cctgatatag 8341 ggaaaggtgg tgtcctgaaa cttcatgttt ttcccttcct tgggtctgaa gactgtggtt 8401 ccacgacaac caggagtcag aactcagcct ctggttatag agtggtccta gttccgacaa 8461 ctgatgggat aagatgtggc agggcgttca ggcacgggat atagaaagtt caaatgctta 8521 ctggtgcgac ttagctggga tgtttagtga actgaaaatc catgttgtat ctggtcgtga 8581 tagagtatgg cagcagggta ggaggttggg gcatacaggg catgcgggat cggcatggaa 8641 tggcagtcta aggttctggg cctttcatct gagggcgatg gtaactaggg aaggtttgag 8701 gaggggggaa aaagaggtga cctgacccag gttttagtgg ctccctctgc ctgcagaatg 8761 gaagaccacg tgcaggtgag ggtggagcca ccgatacctg ggtgatgggt ccagcaacag 8821 cccaggtgga ggttgttggt gactgaacga gagtggcggt ggtggaagtg ggaggagtgg 8881 ggggcttctg tatgtgtttg atctagagct gcctgtattt tataacgtag agagcaagga 8941 agcttgggag ctgagatgga agagggggtc aggaatggcc ccacagtttt ggccttattt 9001 ggaaggatgg atgtcccagc aactaagatg tagtaagttc tgtagtaggg ggagtgttta 9061 tatagtagag catatgagga accaagtttt ggccaagtca aaaatgttcc agtggccagg 9121 catggtggat cacacctgta atcccagcac tttgggaggc agaggtggga agattgcttg 9181 aggccagcct gggcaacata gggagactct gtccctacaa ataataataa atagctgggt 9241 gtggtggcgt ctctctgtgg tctcagctaa tcggcgtgct gaggcaggag ggtcacctga 9301 gcctgggaag taaaggctgc agtgagccat gattgtgccc ctgcactcca gcctgggcaa 9361 cagagtaaga caaacaaaaa caaaaatcca agccaaaaca aaaacaaaaa tacacaaaaa 9421 atgtttcaga gactttgagt ggaggtgcca ggctggcagt tgtacacaca ggaatggagt 9481 tcattggagg tgttcaggaa ggagacatcc actacatgat agctgaactg ttgtgaaact 9541 gggtcagatt taggaagcaa tggtgttagt aggaaggaag gagaaggaac agactgagga 9601 ctgggtgtct ccactgggca cctggatggt gttggtgggt gtcacaaaga cagatgagag 9661 agtggatcga ctgagggtat gtgtgtgtat gtttgtgtgt tatgttgggg gcagcctgtg 9721 gagtataagg aagggctttt gaggggagag tggccatgag atggatggtg aggcatctcc 9781 acaaactcac cagggtttca tggtgagtct tgcttggacc tgggtagttt catgcaaact 9841 gtgagactac tttaggggaa gatgggatca gcaaggagca ttgctaggcc tggggtgggg 9901 atagcattgg ggaggtcagt gtatctgggc tttggattag gattaggggg atagagaatg 9961 ttgtgactgc ttatgggaaa acatgtacat tgcaaaaggg ggagacagcc aagatctgga 10021 actcagagga ggtatgcaag aactgtgagg ttgtcacttg ggttttgggg gggctgaggt 10081 taaaccagaa aacaggactg ggaagggaga ccttcagaac agggctcggt gctgccccag 10141 gcgctaggga acattatatt ctgggcagcc caggagcaaa tagggttctg gtgcactgca 10201 ggactccatg tacagagtgg catcttggga ggccatggaa atgctcctgg ttgggccagg 10261 gctgctgtta gtattgcaag agtggaggca ttagatacat gagagaggta ctgtccaaag 10321 cgagatgcga aggtgaggag gagcatgaat gcgtggtccc agccctggca atgggcacgt 10381 aagggagaag agtgagcctg agtttgggtg tctgggaggc aagatctgga gaccccaagg 10441 aaatggggtg acaaaagaag gtagggccgg ccgggcgcag tggctcatgt ctgtaatccc 10501 agcactttgg gagaccaagg caggtggatc acgaggtcag gagttcgaga cccgcctgcc 10561 caacatgggg aaaccccgtc tctactaaaa atacaaaaat tagccgggag tggtggcagg 10621 tgcctgtaat cccagctact caggaggctg aggcaggaga attgcttgaa cctgggaggc 10681 agaagttgca gtgagccact gcactccagc ctgggtgaca gagcaagact cttgtctcaa 10741 aaaaaaaaaa aaaaaaaaaa agagggtagg gcgccctggg ctaggcattg atgatgcttt 10801 tgctttctcc ccacccatct ttgtgctgtg ctggactcag caatcagtag cagtgggggc 10861 gagagaagtg cgaatgctac caggatgtgg agtctcttcc tccttgggaa gtgggtggag 10921 gctaagcagg ggatggatgt gtgtgggtac tggggaagca ctgaagattc tggagaggga 10981 agtgtatcag tgattgaccc gctggatttt catttttgga aaagagtgag tgatgacaga 11041 gaggctaatg gcattgaaat aattttaatt actttcattt ccttagtgca gggggaatgc 11101 cgtgctgcac caggccacgg gaaatactag atttggttag gaggcagagg catgactgaa 11161 gggaaagcat atatcagaga ctgtattggg gttttggcag gaaaggcatg cagggtggga 11221 tgaggagctt aaaactcagt ggtttgcata agtctggcag ccttagggct aagagattat 11281 cccttattgt gtggtgacag ggcttgtgtt gattaggaca ggagaaataa tcatgtatga 11341 gcattagata gtggttcagc atatgggatc tggatcgcaa gggaaatgta aaaacattgg 11401 ctgttagttt ggccctccaa ttcttagatt ttaagtagat aaatacagat ctaaggaaac 11461 acagaataag aagttgttca tagactgaag tgtcataatt ttggcaggat tgtatggtct 11521 ttgaggatgt ggccatatat ttctcccaag aggaatgggg gatccttaat gatgctcaga 11581 gacacctgca cagcaatgtg atgttggaga actttgcgct tttgtcatca gtaggtaagg 11641 ccttcatacc tactctggtg tcttgtgctg ggtgctgttt ttttgctctt ttccatgaca 11701 actctgtctt tcccacacca gattgtgagt gctacgtcct gtcctctcct ggtttcctgg 11761 caaatgtgat aaggcagcca gagctgggct gtgtatactg taccacctcc ccttaagcag 11821 ccccagcttc tgttgctctg aaactttaca agataaggct cggaagtcag aaatcttcag 11881 gtttctttaa gatatcctaa tggcggtgcc tgtccctggt ttgattactc tgtccaaatg 11941 tgtgaaacct ctgtccaagg gtttccctgt tccccttgct gacatacttg gagacagtgt 12001 ctgtgctagg aatttcaggc acctccctgg tcaccacttg tgaggattgt gaagttattt 12061 ctgcaggacg ctctactggg tgttctggta gctttagaat gcagcatgtc ctgggtttat 12121 tgctctgtgt acatgctgtc ccctcccttt ccctgccttt cccgtagctg gacttatgcc 12181 gcagctagat agttgctcac cttgagcaag ggggaggtct ctgggtgcct ggcagagtgg 12241 atagtcctct attaatggca agagacactc agaaggactt ggccctggtg ggtgaaagct 12301 gggaaaggat gtgatatcaa ggctgggttc aaatcacatt gagaacttac cttgtgacca 12361 ctattgtttg atgcagggcc actggccact cgtcactctt cctttccttc cctgttgtta 12421 tttttactct gacttctgac ccctggcttc cacctctgac ccctcctctc cagccttcat 12481 ctctccacca ttctcttcca cctctccctt tgcagtgctg tctaatgtat gacctgtact 12541 ttaggtgtta tgaggtctgc gtgtatcctt cagtagtcca tgaatactcg ctctcacaca 12601 atcagatctc tctggatctg gacacaaccg tcatggcctg ttgaaagcca tttatagagg 12661 agtgagttgg agaccccacc tgccctccct gcatggtact tctcccttgt gtccttacca 12721 ccaggattcc cccactacac acacttcaca ggcccacctc tgaacactgg gccttcctct 12781 cactggcctt atttccttgc ctgtgaccag tcctgtgccc tcatttgtgg gtgtgtctca 12841 cagacatttg tgatggggct gtctcctccc tccggagttc acgtgcactt caccagcatt 12901 tctgttctta caggttgttg gcatggagcc aaggatgagg aggtaccttc caagcagtgt 12961 gtttctgtaa gagtgttaca ggtcacaatt ccaaagccag ctttgtccac cctgaaggcc 13021 cagccctgca agatgtgtag ctcaattctg aaggacattc tgcacctggc tgagcacgat 13081 ggaacacacc ctgagcaagg gctgtacaca tgtgcagcag agcatgacct gcaccaaaag 13141 gagcagatta gagagaagct caccagaagt gatgagtgga ggccttcatt tgtgaaccac 13201 agtgctcacg tgggagagag gaacttcaca tgcacgcagg gtggcaagga ttttactgcc 13261 agctcagacc ttctccagca acaggtctta aacagtgggt ggaagctgta cagggatacc 13321 caggatgggg aagcctttca aggtgaacag aatgatttca actccagcca aggtgggaaa 13381 gacttttgcc accaacatgg gctgtttgag caccaaaaaa cccataatgg ggagaggcct 13441 tatgagttca gtgaatgtgg ggaattgttt aggtacaact ccaaccttat taaatatcag 13501 caaaatcatg ctggagaaag gccttatgag ggcactgaat atggaaagac ctttattaga 13561 aagtccaacc tagttcagca ccagaaaatt cacagtgaag gctttctttc aaaaaggtct 13621 gaccccattg aacatcagga gattctcagt agaccaacac cttatgaatg cacccagtgt 13681 gggaaggcct ttcttacaca ggctcatctg gttggtcacc agaaaaccca tactggagaa 13741 cagccctatg aatgcaacaa gtgtgggaag ttttttatgt ataactccaa actcatcaga 13801 catcagaaag ttcacactgg ggagaggcgt tacgagtgca gtgaatgtgg gaaattgttt 13861 atggacagct tcacactcgg tagacatcag agagttcata ctggagaaag gccttttgaa 13921 tgcagcatat gtggaaaatt ctttagtcac cgctccacac tcaatatgca ccagagagtt 13981 catgctggca aaaggcttta taagtgtagc gaatgtggga aagcctttag cctcaaacat 14041 aatgttgttc agcatctgaa aattcatact ggagaacggc cttatgagtg cactgaatgt 14101 gagaaggcct ttgttagaaa gtcccaccta gttcagcacc agaaaatcca cactgatgca 14161 ttttcaaaaa ggtctgacct cattcaacac aagaggattg acattaggcc aaggccttat 14221 acatgcagtg aatgtgggaa ggccttcctt acacaggctc atctggttgg tcaccagaaa 14281 atccatactg gagaacggcc ttatgaatgc actcaatgtg cgaaggcctt tgttagaaag 14341 tcccacctag ttcagcatga gaaaatccac actgatgcat tttcaaaaag gtctgacctc 14401 attcaacaca agaggattga cctcaggcca aggccttatg tgtgtagtga atgtgggaag 14461 gccttcctta cacaggctca tctagatggt caccagaaaa tccagactgg agaacggcgt 14521 tatgaatgca atgaatgtgg gaaattcttt ttggacagct acaaacttgt tattcatcag 14581 agaattcaca ctggagaaaa gccttataaa tgcagcaaat gtgggaaatt ctttagatat 14641 cgctgtacac tgagtagaca tcagaaagtt cacactggag aaagacctta tgagtgtagt 14701 gaatgtggga aattttttag agatagctac aaactcatta ttcatcagag agttcatact 14761 ggagaaaagc cttatgaatg cagcaactgt gggaagtttc ttagataccg ctctacattc 14821 attaaacatc ataaagtttg cactggggag aagcctcatg agtgcagtaa atgtagggaa 14881 ttgtttagga ctaaatcgag ccttattata catcagcagt ctcacactgg agaaagtcct 14941 tttaagttaa gggaatgtgg gaaagacttc aacaaatgta atactggtca gcgccaaaaa 15001 actcacactg gagaaaggtc ttatgagtgt ggtgaatcca gcaaagtgtt taaatacaac 15061 tccagcctca ttaaacatca gataattcat actggaaaaa ggccttagtg gagtgaatgc 15121 aggaaagtca ccaaaactgt cacctcattc agcaccaaaa ggttcacatc ggaccaagaa 15181 cctattaata tatgtaaatc taatgttgaa agagttcaga tggaaatctg cgaggatttc 15241 ctgctgggaa ctacattaaa aacatttatg tccaggcgtg gtggctcacg cctgtaatcc 15301 cagcactttg ggaggcagag gtgggtggat cacctgaggt caggagtttg agaccagccg 15361 ggccaacatg gtgaagcccc atctctacta aaaatacaaa aattagctgg gcatggtggc 15421 aggtgcctgt aatcccagct attcagggga ctgaggcagg gagcatcact tgaacctggg 15481 aggcggaggt tgcagtgaac tgagattgtt ccattgcact ccagcctggg tgacagggcg 15541 agactctgtc tcaaaaagaa aaaaattata attgtggcaa attacaggta acttaaaatc 15601 taccatctta acccatattt aactgtactg ttcagtagtg ttaagtcatt cgcattgttg 15661 tcaattaata tccagaagtt ttttcaactt aatgaaacta aaacattata ccctttaaac 15721 caggggtcca caacccccag gctgcaaact ggtaccagtc tgtggcctgt taggaaccgg 15781 gccagcatca ctacctgagc ttcacctcct gtcagatgag cagcattaga ttctcatagc 15841 agcctgagcc ctattgtgaa ctgtgcctgt gagggatcta ggttgtgtgc tccttatgag 15901 aatctaacta atacctgatg atctgaggtg ggacagtttt attctgaaac tgtctgcccc 15961 ctcctctgtc cctgatgcgt ggaaaaattg gcttccatga aaccagtccc tggtgccaaa 16021 gaggttgggg accgctgctt taaactactc ctcttcctca ttccctacgc caaccagccc 16081 ctgacaatac tactgtatga attttactat aaatatctca tatgaggaaa cttaagtttt 16141 agtcttttgt gacttatttt gaggcttatt tccctaaatg tcttcaaggt ttatccatgt 16201 tgtagcatag cttatgattt tcatcttttt taaaggccga ataaaatccc attgtagtca 16261 tgtgtcgcct aacaagagga ctgtgttctg agtaatcatc ttaaggtgat tttgtcattg 16321 tgtgaacaaa gagtgtactt acacaaacct gaagggcaca gcctattacg cacctaggct 16381 atatgatatg gtccattgcg cctcagatat aaaccttaac agcatgttac agtattgaat 16441 attgtaggta attgaaacac aatgataagt aaatgtgtat ctaaacatat ctaagtttcc 16501 aaaaggtact gtaaaaatgt atgaaaaatg gtaaaaactt tatagggcac taacatgaac 16561 gaagcttgcg ggacccctct ggttgagtca gggagtgagt tgtgagtgaa tgtgaagctc 16621 taggacatta ctgtacccta ttgtagatgt tataaacatt gtacacttag gctactctaa 16681 attaattttt tctttagtaa taaattaacc ttagtctact gtaacgtttt tacttcatga 16741 acttttaaaa ttttaaaaag ctttttgact cttttgtaat aatagcttaa aacacaaaca 16801 cattatacag cctgtataga agtactcctt tatattctat aaactttttc tatttaattt 16861 tttaacttcg taaaaaataa gttaaaaatg aagacacaga cacacacatt agtcaaggcc 16921 tacagagggt gaggatcatt aatatcactg tcttttgcct ccacatcttg tcccactaga 16981 aggttttcag gctcaataac agggagttgt tatctcctat gataacaata ttctggaata 17041 ttttctgaag ccttgcctca cacttttttt tttttttttt ttttgagaca gagtctcgca 17101 ctgttgccag gctggagtgc agtggcacaa tctcggctca ctgcaacctc tgcctcctgg 17161 gttcaagtga ttctcctgct tcagcctccc gagtagctag gattacaggt gcctgccacc 17221 acgctcagct aattttttgt atttttagta gagatggcgt ttcactatgt tggccaggct 17281 ggtctcgaac tcctgacctc atgattcgcc cgcctcagcc tcccagagtg ctgggatgac 17341 aggcatgagc caccgcgcct ggctttattt tttttttgag acagagtctt gcactgttgc 17401 ctgggctggt gtgtagtggt gtgatcttgg ctcactgcaa cctccgcctc ccgggtacaa 17461 gcgattctcc tgcctcagcc tcccgagtag ctaggattac aggtgcccac cagcatgccc 17521 ggctaatttt ttgtattttt agtagagatg gggtttcact atgttggcca ggctggtctc 17581 caactcctga cctcatgatc cgcccatctc ggcctcccaa agtgctggga ttacaggcat 17641 gagccacagc gcccggcctg tgctctttat atgactggca gaccagtagg tttgtttaca 17701 ccagtgtcac cacacacact tgagtaatgg gattgagtta cagtggctgt ggtggcacta 17761 ggtgatagga gttgtttagc tctgttataa tacgggatta ctgttataca tgtggtccat 17821 ggtagacaga aatgtgtggc atatgactgt ggtatatgta tatgccactt ttcctgtatt 17881 ctttcttcag tgtacatttg ggttgcttcc actcttctgt tcttgttaat gccactatga 17941 gcatgaatat acaaatacct cctgaggccc tgctttaaat tcttcttgga catataccca 18001 gaagtggaaa tggtgcttcc cgttaataat aataattata ataataataa ttattattat 18061 tattttgaga cggagtcttg ctctgtcacc caggctggag ttcagtggca caatctcggc 18121 tcactgcaag ctccgcctcc cgggttcacg ccattctcct gcctcagcct cccaagtagc 18181 tgggactaca ggtgcccgcc accacgcccg gctaattttt tgtattttta gtagagacgg 18241 ggtttcaccg tgttagccag gatggtcttg atcttcctga ctttgtgatc catccgcctc 18301 agcctcccaa agtgctggga ttacaggcgt gagccaccac gcctggccac ttcccgttaa 18361 ttctatttta aattgtgggg ggaattattt tccatagcag ctgcaccatt ttacattctc 18421 accagtgcag gggttctagt ttttcacatt attgccaaca ctttgttttt taatcatgat 18481 gggtgtaaga tgttatttta ttatggtttt ttaatttgca tttcagtatt ggtgttgagc 18541 atcttttcat ttgcttgttg actaattgta taggttcttt ggagaaatat ctgttaaatg 18601 tctttgccca ttgaagaaag tttttgtaaa ttttcagata acattgtatg cttttgttgt 18661 gtacaacatg atgtgaagta tatatacttt gttgagtggt taacatgtac gtattatata 18721 tgtacatatt ctgtatacgt attttaaata ttaacccctt gcagatatgt gtttttcaaa 18781 tactgtctac cattcagtct gcttcctttt cattctgttc atgtcttttt tttttttttt 18841 tgcacaggag ttttcaaatt tcaatggagt ccaatatatg ttatgtttgt ttcctatgct 18901 tttggtatta tacccaagaa attattgcta aatccagtgt catgaacctt ttttcatgtt 18961 attttctagg agttttatag ttttaagttg tacttttatg tctgatttat ttcagtttgt 19021 tttagtaaat ggaggaaggt aagagttcaa cttcattctt ttgcatatag atatccagct 19081 ttttgaaagt catttgttga aaagactcct taataagttt ttgcgctctt gtcaaaaatc 19141 tgagcataag tatgaggact tccgtctggc ctctttattc cactatatgt atgcatttgt 19201 gctagtacca aggtgttttg atacataatt cttttaaaag aataatgttt ctgtccactg 19261 ttgaggccct tgctagaggc ttgacagtgc tacatttgga acagctgatt aagtacactt 19321 ccaaccactt tccttaacag gctgtcaaac tctgggtcaa tatatatcag ccccagtcac 19381 cccaggagca ggtatcagac atctggggac aactcctatg ccctgaaatg tgtaaaatta 19441 ctcaaattgg ccaagtatag aaagcccaga aaatctagct aacccgaatc cacttgccac 19501 acacaagctg cctcctacaa ctccagtttg ctgtattact atctatattc cttcacaact 19561 ccggtgcagc cctgaatcat gtcctctggt tctgagttgt aagttacaac atgctccacc 19621 tttcatctat ctgtcatgtc atgtcctgac atcaagaaaa cttctaaatc ttaaaactac 19681 atacccagaa gggtatcatg agtgcagcga atgttggaaa gtcttcagcc attaacctca 19741 ttcagcacca gtaaaccatc ttggagaaag accttaggaa tgcaggcagg caaagtgctt 19801 ttgccagcag ataaaatagt aaagctgtgg atattagctt ttgtgaggga gacctcaacc 19861 agaaatggaa tcttacatcc aaacatccac acaagggagg tttgttgtaa ttgtctgcta 19921 tatgagaagc ttttgtgaat taccttgcac tttctgacct gcctgggatc cttgccagtg 19981 ttaagtcact gaaagtgtgt actacaaaag acttccatcc actattagct gatatcacag 20041 tgtgtatcac cttaaaatgc ttagggaggg cagatagctg tgctctctac ctttatctgg 20101 agttattgag tctgatccct tcgggcgagg cctcattccc acttccatgg ctggtttggg 20161 tgcagacatc atccaacttt ggacagagga tacaggctgg cttgagctgg cagcttgtaa 20221 agattttcct ttcttcatgg gtattattgc ttgtcttata atgttggtga tagatttttc 20281 accttccaag ctgcccaaac ccagtcaatt tccacgtgac tatactggtt ttagtgatga 20341 taaagtcatt gtttaagcta catttactca ttgcagtttt attctctgtg tgctgtgggt 20401 tgagagcaaa attggagtct gccaaaatga aaaagtgagt ggttttgtga ggttttcaat 20461 tttatgtact tccaaggaaa tagcaaaaac aaaaaaacca aaaaaaaaaa aaaaaaaaca 20521 actcaatcca caaacgcctg atgtatattg cttcagaagg ttcctgggaa aaagaaggat 20581 ttctatccaa aatcctttac cctgaatttg aagattattt atggttgcac agtcagcttg 20641 cagccaggac aattggtttg acaactgtca ggatttatgg cagctgtgag agctgggtat 20701 ttgtacccat gcatgtccca tattctccag tattgtttag ggatagcttg ctcctgtgat 20761 gtgccttatg agctatgtgt cttgaaccca aaaatgatgt aggttatgtt cctggatgta 20821 aaatgtaaga gtgcattagt ttgttgtcca ttgcttataa aataatacct gaaactgggt 20881 tatttataaa gaaaaatcta tttcttacag ttatggaggt tgaaaggtcc aaggttgagt 20941 ggccgcattt ggtaagagcc ttcttgctgg tagactgtgc aaaaagtccc aaggtggtac 21001 agggcatcac atggtgaggc tgctgagtgt gctcacatgt taacgtgcta gcgcaaatca 21061 ttctctctct tttttttttt tttttttgag acggagtttc gctcttatcg cccaggctgg 21121 agtgcaatgg cgcgatcttg gctcactgca cctccgcttc ctgggttggc tcaccgcaac 21181 ctctgcctcc cgggttcaag caattcttct gtctcagcct cttgagtagc tgggattaca 21241 ggcatgtgcc accacacccc gctaagtttt tatttttagt agagcagggt ttctcccatg 21301 ttgatcaggc tggtctcgaa ctcctgacct caggtgatct gcccgccttg gccccccaaa 21361 gtgctgggat tacaggcaag agccacggcc cctggccctt tctcttctta taaaaccacc 21421 agttctccca ggataaccca ttagtctatc aacccataaa ttgattaatt cattcctgag 21481 gatagagccc tcatgattca atcacctctt aaaggtcgca cctctcagta ctaccacact 21541 gtagattaag cttcaatgtg acttttggag gagatggtca aaccatagca agcagtttgg 21601 gaaccatagt ctgtacagaa acagtgattg tattaatatg aagtatgaga caccagcaaa 21661 ctagagtgga atggatgagc gccttgtaaa gagacatcca tgaatgtttc agtaactatg 21721 gctatgtgag atgagggtat acagaaactc tgaggaccta cagacgtgtg tatctcctgt 21781 aggcaaagtc attgtcagag caagtaatcc aaaggtgtta tggattcttg tgtctctggc 21841 gttataaaac gtatgttgaa ttttaattgc cattttaaca gtgtggagaa atgggacctt 21901 taagaggtga ttaggtcatg atggctctgc ccttataaag gaatcaatgc ctttatcata 21961 agagtgggtt tgttaacccg agagtgacgt ttttataaaa tcagctgtct ctgtctttca 22021 tgctcacttg cccttctgtc atattatgac agagtaacaa ggccctcacc agatgccaag 22081 cagataccag tgccataccc ttggacttcc cagcctccag aactgtatga aacaaactac 22141 tttataaatt acgcaatctg tggtgttttg ctatagtagt ggaagacaga cacagacata 22201 aagcctgtta attcaaacca ctactgcaca ggtaggagcc tcatgactga atgaagggag 22261 attctatact tctgagataa gtcttttttt tttttaaacc taaacataat tcctggtgac 22321 tcaggtggac ataaccttcc ttgtctccag ttcttcttcc ccgagattct gaatttccaa 22381 caacattgct tccagttttt gatactataa aatcagagtt gacccacgta agacatgagg 22441 agaggctaat gatggaatca acacagcttt gccagaggtt ataaaagagg attgagaaat 22501 ccagatttta tttggaattt ttcttttcta acagctttat tgaggtatta gtgacattca 22561 gttgcacatt gttaaagtgt aaaatatata aaaacaatca ccaaagtcat gataataaac 22621 atatccattg cctcaatagt ttgtttgtac ccttttctat tctctctcgc tcacagtcct 22681 acccaaattt ttatgctcag gtagccactg gtttacttta tgtcactata tacatctttc 22741 cttccttcct tcctccctcc ctcctgctct tttctttttc cttttttctt ttcttttttc 22801 tttgtttctt tccctccctc cttctcttct cctctcctct cctcctctcc tcctcctctc 22861 ctctcctctc ttctcttctt tctctttctc agaatttcac tcttgttgcc caggctggag 22921 tgcaatggca cgatctcagc tcaccgcaac ctccaccttc tgggttcaag cgattctcct 22981 gcctcagcct cctgagtagc taggattaca ggcatgcgcc accacgcctg gctaattttg 23041 tatttttagt agagacgggg gtttctccat gttggccagg ctggtctcga acgcctgacc 23101 tcaagtgatc cacctgcctt ggcctcccaa agtgttggga ttacaggcgt gagccaccat 23161 gcccggccta taaattttct agtatcttat acaaatgaaa tcctacatta cattcatatg 23221 gagcattttt tgatatttat acatgttaag cgtattcatt ttttgttgat gaagagtatt 23281 gcattgaata taccacaatt tattaatcta ttaataaata ttttgattgt ttctagtgtt 23341 ggactaaaaa actaacaaag acccatgaac attcatatac aagtgtttat atgaatatat 23401 ttagcagaat tgcatttgtt atacctcctt gtccatgatg gttatggtgt attttaaaga 23461 ttaaccattt tcatagatgt gttgtggtca gtgtggtttt taaatttgca tttctctaat 23521 aattaatgat gctgaataag tttttatatg cttatttgtt atctgtatat ttggccaaat 23581 ttctaaatct tttccctttt tttaaaaaaa aatcttattt ttctatttta atagagtttt 23641 aattttagaa cattttaaga tgtacagaga aattgcaaat atagtacaga gatatctcat 23701 ttacaagcac acctagttgt tactaatata tagtcatata ttagtataca gcattattta 23761 caattagtga aacagttttg atacattatt aaactaaagt acatctttga ttgagatttc 23821 ctcagttttt accaaaatac atgttctgtt ccagaattcc acctaggata ccacattgca 23881 tttattttat tattttttat tcttttattt attttttttc atgtgtctct aaatgtaacc 23941 acattacagt tagtactcac atttctttag gcttgtcttg cttatgagaa tactcattta 24001 ttttgatatt gatttttaat tttttttttt ttagatagtc ttgctctatt gtccaggctg 24061 gagtgcagaa gcactttctt tctttctttt tttttttttt ttttgagacg gcgtcttgcc 24121 ctgtctccca ggctggagtg cagtggtggg atctcgactc actgcaacct ctgcctactg 24181 ggttcaagga ttctcctgcc tcagtctcct gagtagctgg gattacaggc acctgccacc 24241 atgtctggct aatttttgta tttttaatag agatggggtt tcaccatgtt ggccaggctg 24301 gtcttgaact cctgacttca agtgatctgc ccacctcagc ctctcaaagt gctggggtta 24361 ccggcgtgac ccacggtgcc cagccacttt tagttatttt tatttttatt ttgagtctcc 24421 ctccgtcacc caagctggta agtaattctc gtgcttcacc ctctggagta gctgggacta 24481 caggtgtgtg ccacatggtt ggctaatttt tgtatttgag acagggtttt tgccatgttg 24541 gccaggctgg tctcgaactc ctggcctcaa gtcatcttga gtggatcact tgaggcattt 24601 gggagtgcct cagcctccca aaacggtggg attacaggcg tgagccactg tgcccggcct 24661 gcacatgtct ttcttacatg tggactctcc aggggattta tgctgttcaa gaagcacctg 24721 aatcaaactg tgctttaaaa atatttttaa aacattccat gattcttcat agtacaaaaa 24781 tgagaaatct cccctaaatc tctaaagaat tcccttttta aactttttga tttcctgaac 24841 ttgtctgaag gacttaacac agttatatag agtgttcttt gtgtaacagt atccatgact 24901 ttcattttta gctctggtat tagatcttcc tggacaagag cattactggc catgtgtatg 24961 ttgctatcac aaataactag tagttaatat ttcctaaatc ttagcaagag ttatgataaa 25021 tagctgaatt ccatgaagag aaaagactca tggcatgaca accagattag gaaagagtgt 25081 tttccttgtc ctgcaggcaa aacagaactg agagatatga gtatcactca tgatgaggac 25141 aaaagggctg cgtgctggga aacatgaggc gaccaacaca gagttggtca ctatccactg 25201 gcctgggttt gcaactacag ttgtccaaat tgtcagaaaa ctatggactg aatagaaaac 25261 aaaaaaggag ctcaccagga ccatgatggt gtgtgtggct ctggcttcct gggaaggtct 25321 gcaggagagt ctgttgctgt gattgtgttg gacttgctgc ttgtgtctgt agaggaagaa 25381 gaccatggag ccactggccc agaccatgaa gcccaaactc ataaaatcag gggaaaaata 25441 taagactgca tgtaatgagc taaatctctt tgatgctttg taagaacagt atccataatt 25501 gttttttgca ctactgtttt tgctattcag tgggccattc actaatagaa gaacagatgc 25561 attcatcaag acatgggggg cccagcagag gagacaacag aagtcaataa accttgggga 25621 tctaatcttg atctccatcc acctgcatat actggggttg agcttaatgg cttggaatcc 25681 attgagaagg cagatggtgc tgagggaaac tcttgtgccc accctgtgat aataaaagac 25741 aaacttacat ccagtgtcat tcagcaaata tttcaatcca aaagctgcca ttgtctgagg 25801 tatcccttta aagaaaagga ccatggagtt agccaaggcc agttggctga gaatcaagtc 25861 cgtgggtctc agcttgtgtc cagtgaacaa aattaagtta taaaaacaaa ggagaaagga 25921 atttcccagg atcccaactc cagtctgaat gaggaagcta atccctgatt ttacttttcc 25981 aaaagccatt tcatcaaaat ctaggggatg ttgattttca ttgaggtctg aagaatcagt 26041 agaataaaaa agcagaaaga agtatcttgt catcagtgga gacagaagtt ttaatgtgtc 26101 tccaaccata ttttctcatc ttaaacagaa attaagattg tccctgttcg tgcaaaggca 26161 attgcttgtg tgtccaactg cagagaagca ggtgaatgaa tgagggctca accgcacaac 26221 agatccacaa atcaaaccta taaaactcag ggcttacaca catgcaacaa gtaaatacaa 26281 gcaaaactag ggaaatatat aaaaacagaa cacagaagag tcaatatcat tagagtgata 26341 ttatactata gtttgcaaaa tgtaatcatg acggaaccct gggtaatatg gacagggctt 26401 tttctgtatt atttcttatg actgcttgtg agtctacact tatttcaata aaaccttcaa 26461 ttaagtagcc tggagatatg aaaagcaaaa ataatacacc ataaaacctt ttccacacta 26521 atattttagt aaacaccatt gttattgatg cattgaaaca aattttaggc cgggtgcagt 26581 ggctcacacc tgtaattcca gcacttcggg aggccaagat ggagaccatc ctggccaaca 26641 tggtgaaacc ctgtctctac taataataca aaatactaat aatacaaaaa ttagccgggc 26701 atggcggcgc gtgctagtcc cagctactcg ggaggctgag gcaggagaat cgcttgaacc 26761 tgggaggcag aggttgcagt gagctgaaat caaaccattg ccctccagcc tgggcgacag 26821 agcgagactc tgtctcaaaa aaaaaaaaaa aagaaattaa aaaaacgcta caatacaatt 26881 ggtaacaagg ccttcagcac agtacaaatg ctaacttgaa gaatttttaa aaactataat 26941 ttttggaaca atcaacattg catcatggta tattttattt taagcacata tatttctgtt 27001 attttccacc ggacaggtct agagattgtg aattgactcc gtagaaataa ccatgaccac 27061 cttgagccca cacattttta tctcaatgcc atttctcaca agtaaatgca aaagatcctg 27121 aaaataaata ctaatcccag aaacacctaa gatatcctta aaacttgtta aataggaaat 27181 tcaatatgtt ccagaaagct tgggtttatt tttagggaaa gataattcaa ttgggaggga 27241 atttcattgc acaatattca tatatatcaa tcacaaaaga aatagtaact gcggccaggc 27301 acggtggctc acgcttgtaa tctcagcact ttgggaggct gaggcgggtg gattacaagg 27361 tcaggagatg gagaccatcc tggccaacat ggtgaaaccc cgtctctact aaaaatacaa 27421 aaattagctg ggtgttgtgg tgcgtgcctg tagtcccagc tactcgggag gctgaagcct 27481 cctgggttca cgccattctc ctgcctcagc ctcctgagta gctgggacta caggcgccca 27541 ccaccacgcc cggctagtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtttagtaga 27601 gacggggttg caccgtgtta gccaggatgg tctcgatctc ctgacctcgt gatccgcccg 27661 cctcggcctc ccaaagtgct gggattacag ggatgagcca cctcgcccgg ccataattta 27721 gattcttttg gtgtctctat ggaattgcta tcatcaggga aaattatgag gtattttctc 27781 ccatgaataa ctacatctct ttaggggtgg agcaatcaat aacacagcag tatcaagatt 27841 ctcaggcgag aggtggctca cgcctgtaat cccagcagtt tgggaggccg aggctggtgg 27901 atcagctgag gtcgggagtt cgagaccagc ctgaccaaca gggagaaacc ccgtctctac 27961 taaaaataca aaaattagtc agaggtggtg gcgcatgcct gtaatcccag ctactcggga 28021 ggctgaggca ggagaatcgt ttgaacccca gaggcggagg ttgcagtgag ccgagatcat 28081 gccattgcac tccagcctga gcaacaagag tgaaactcgg tctcaaaaaa aaaaaaaaaa 28141 aactcaggct ctgaggtatg aaaacctttt agattcccac aaaccacttg ttgacaaatg 28201 tttttcctat ttatatgaac tcatcattta ttttcaagat agttcctgat tctttgcttt 28261 aatctcttat tttgctccat tgtcctagtt gtataattta tttccaatat gcatttggtt 28321 tcctattttt tcagcttctc cttataaagt accattttta ctatgactat tttgaaagct 28381 tcttgtatgt ataaagttag tgtgattatt gactttaaca taccacctcc ttctcctcct 28441 agcctcagaa agaagtgttg gtaactggtg agaattttca gacagatttt cctttttttt 28501 tttaatttaa tttaattttt ttttctttct ttgctttctt tttttttttt ttttgagaca 28561 gagtctcact ctgttgccca ggctagagtg cagtggcgct atctcggctc actgcaaact 28621 ccacctccca cgttcaagcg attcacctgc ctcagcctcc tgagtatctg ggattacagg 28681 cgcctgccac tgcgcctggc taaatttttt ttgtattttt agtagagatg gggtttcacc 28741 atgttggccc ggctggtctc gaattcctga cctcgtgatc cacccacctt ggcctcccaa 28801 agtgctggga ttacaggcgt gagccaccgc gccaatttaa tttaattttt taaagttcca 28861 ggatacaggt gccggatgtg taggtttgtt acataggtaa acgtgtgcca tggtggtttg 28921 ctgcacctat caacccgtca cctaggtatt aagctcagca tgcgttagct atttatcctg 28981 atgtgctcca tcccccgacc cctttgacat attccagtgt gtgttgttcc cctccctgtc 29041 tccatgtgtt ctcatcgctc aactctcact tataagtgag aacatttggt gtttggtttt 29101 ctgtttctgt gttagttttc tgaggataat ggcttccagc tccatccata tccctgcaaa 29161 ggacgtgatc tcattctttt tatggctgca tagtattcca tggtgtatat gtaccacatt 29221 ttctttatca ctgatgggca tttgggttga ttccatttct ttgctattgt gaatagtgct 29281 gcaatgaaca tatacatgca tgtatcctta taacagaatg ataataatag aatgatttat 29341 attcctttgg atatatacac agtaatggga ttgctgggtc aaatggtatt tctggtttta 29401 ggtctttgag gaattgctac actgtcttcc acaatggttg aactaattta ccttcccacc 29461 aacagtggaa aagcattcct atttctccac caccttgcca gcatctgttg tttcttgacg 29521 ttttaataac agcccactga acagcaaaaa ccttagtgtg aaaaacaatt atggagtctg 29581 ctcctgggca atgcaagaaa tatttggcac cttatataca attatatact tttctcctga 29641 ctttttgagt tttggcttct tggtgtgggt cagtggctcc atagtctttg tcctatacag 29701 acacaagcag caagtccaac acatttgtag caagagactc tcctcaaatc agtataagta 29761 aaattaggaa aatattaaga tgatagtgag ggccgggtgt ggtggctcat gcctgtaatc 29821 tcagcacttt gggaggccga tgcaggcgga tcacgaggtc aggagataga gaccatcctg 29881 gctaacacgg tgaaaccccg tctctactaa aaatacaaaa aattagccgg acattgtggc 29941 gggcacctgt agtcccagct actcgggagg ctgaggcagg agaatggcat gaacccagga 30001 ggtggagctt gcagtgagcc gagatcgcgc cactgcactc cagcctgggc gacagagtga 30061 gactccatct caaaaaaaaa aaaaaaagat gatagtgaat atatcaaaat caatatcatc 30121 attgtgatat tatactgtaa ttgccaaatg taaccatggg tggaaactgg gccatgtgta 30181 gaggctctct gagttatttt ttaaactgct tgtgagtcta aaattaaatt aaaattattg 30241 ttaataaaat ggtcagtaaa aatgctcagg aatatgtaag gcaaaaataa ggcattataa 30301 aaccttgtcc gtcattatgt tttaatacac atctcctcac atacttatca tttctttctg 30361 atgagaacat ttaaaatcta ctcttttggc aattttggaa tagacattaa caatagtcac 30421 cacgccgtgc aacagatcac tagaacacat tccttgtgtc taactgaaac ttcgtatgct 30481 ttcacaaaaa cctcccgttt ttcagtttgc cctctatctc tttagcctct ggtaaccacc 30541 attctactct ctacttttat aaactcaact tgcctaaatt ctgcatacaa gtgagaccat 30601 gaccatgtgg tgttggtctt tttggacttt ctttgaatgt gtcctgcctt cagagatgtg 30661 ttgacctaca agaaaaatgt attgaagcat accactataa tgtgtgcaca tttctgaatg 30721 tcatgtataa gatacctatt tttgccactc ctattcagta tagtactaga ggtcctagcc 30781 agagaaatca ggcaagaaaa atacctagct aactaaataa aagatattcg ggtgagaaaa 30841 tataaagtaa aattgttttt ctttgttgta atatgatctt ataaactctt agaactgggg 30901 aacaaattta atatagttgt aagacacaaa atcaatatac aaaaatcagt agcattgcgg 30961 tacaccaata atgaaatagc caaaacagaa atcaagaagg caattccatt tgcaatagct 31021 ataaaaagca aaataaaata aaatagaaca ctgagaaata tttaatcaag aggggaaaga 31081 catcaaggaa aactacaaag cattgataaa ataaactgaa gaggacacaa acaaatggaa 31141 agacatccca tgctcatgta ttcaaagaat acatattgtt aaaataacag acagtactac 31201 ccaaaacaat ctaaagattt aatgcaagcc caatcaacat acaaatgcca ttttttcaca 31261 gatatagaaa aaaattctaa aatttgtatg gatcttcaaa agagcccaaa tagccaaagc 31321 aatcctgagc caaaagaaca aacctggagc atcacactac ctgacttcaa caccttacac 31381 agccatagta tccaaaacag catagtattg gtattttgtt tttgtttttt gagacggaga 31441 ctcgctgtgt cgcccaggct ggagtgcagt ggcgccacct cggctcactg caagctccgc 31501 ctcccgggtt cacgccattc tcctgcctca gcctccccag tagctgggac tacaggcgcc 31561 cgccaccacg cctggctaat tttttgtatt tttagtagag acggggtttc accatattag 31621 ccaggttggt cttgatctcc tgacctcgtg atctgcccgt ctcggcctcc caaagtgctg 31681 ggattacagg cgtgagccac cgcgcccggc ctaggacttc tttttaacat atgtgttgag 31741 attccaatct ttgtggttgc tatttctttg atgattaatt tatagattac aatgattatt 31801 actatgcagt caaatcacaa tgtattttct atagaaatta atacaaaata tatatttaaa 31861 ataaaatgag ttagtttagt cttccaaaaa tttttgtagt ttagagatta tttaaattat 31921 ttgagttagc atttgtactt gctttgttct atggtgaaag ctttgctacc aattacattg 31981 taggtttatt tgtttcaatt catcaatacc caatggtgtt taccaagata tgattaagac 32041 aaggttttac aatatcttat tttcatctta catatcactg ggctatttaa ttgaaaattt 32101 tattgaaata attatagatt cacaagcagt tgtaggaaat aaaacagaaa gagcctggcc 32161 catatcgccc agtttttcct catggttaca tttgcaaatt atagtataat atcacaataa 32221 tgttattaac tttgatagat cttatatctt ttaaatattt cccttgtttt atttctattc 32281 ctttgtgagt gtagaacccc tgagttttat ggattttata atttcatttt aattgaacat 32341 tatttaggaa ttatataaag tatttatgtt tgtaaaaacc caaagatata tcacagaata 32401 attccataga atcctaactt tcatttctgt ttccccacct gtctgtccct atatacatat 32461 gcctctgctg gcttcaaata attcagcatc ctttgtacac tcttctgacc catgcctttt 32521 tcacttaagc atataccctt aagtaaaacc atatatgcag ttcaacctta atgcaaacta 32581 tgtgttcata caaagtactg agctagactg ggcgcagtgg cacacgcctg taatcccagc 32641 actttgggag gccgaggtgg gcaggtcacc tgtggccagg agttcgagat cagcttgacc 32701 aacacggaga aactccgtct ctactaaaaa tacaaaaaat tatccgggcg tggtggcgca 32761 tgcctgtaat cccagctact caggaggctg aggcaggaga atcacttgaa cccgggaggc 32821 agaggttgtg gtgagccgag attacaccat tgcactccag cctgggcaac aagagtgcaa 32881 ctccacctca aaaaaaaaaa aaaaaaaaag aatgagatca tgtcctttgc agggacatgg 32941 atgaagctag aagccattat ccttagcaaa ctaatgcagg aacagaaaac caaacaccac 33001 atgttctcac ttataagtgg gagctgaaca atgagaacac atggacacag ggaggggaac 33061 aatacacact ggggcctgtc gagggatggg gctgggggag ggagagcatt aggaaaaatg 33121 gctaatatat gctgggctta atacctaggt gatgggttga taggtgcagc aaatcaccat 33181 ggcacatgtt tacctattta acaaacctcc acattctgca catgtacccc agaactaaaa 33241 attaaaataa aaaaaactcg ttttgttgat cttttgtact ttaaaatctc tatttcattt 33301 atttctgctc tgatctttat ttccttctac caattttgag tttaatttct ttgttttagt 33361 tttcttcttt gttttagttt taggtttttt gctttagttt cttttgggtt tttttctttg 33421 ttaatgtatt ctctgaatga tctgcccatt gctgaagatg aggtgttgaa gttctctggt 33481 attactgtat tgaagtctat ccttttgatc tactaatatt tgctttatgt gtttgtgcgt 33541 gccctaatgt tggggtcata tatatttaca attgttacat ctttctaaat tgactttttt 33601 cattatataa ttactttctc ttttttagtt tttaacttga aaaactgcat tatttaaaaa 33661 taattatttt acctgatata aatatagcta gacctgttcc attttggttt tcatttgcat 33721 ggcatatctt tttctatccc ttcactctca gtctgtacat ctttataggt gtagtgagtt 33781 ttcttgtagg cagcatatag ttgggtcttt ttttttttca atccattcaa ctactctatg 33841 tcttttagtt gaagaatttc atccatttac attaaatctt atatttatag ttagggactt 33901 actacttcca ttttgttatt tgttttctaa gtcttctcta attttcttac tccctttttt 33961 gtggataagt gattttctct ggtagtatga tttaatcctt tgctttttgt ttttagtatc 34021 tctactataa atttttgcat tatggttgcc atgagacata aaaagtatct tacagttaca 34081 gcaagttatt ttaaactgat gacaacttga ctttgatcac aatgaaaaga aataagcaaa 34141 gaataaacta aaaatctcta cactataact catcactcct tttcttttcg ttgtctcact 34201 ttatatcttt ttatatatct cctaaaaagc tattcagtca tcattcttga cagatttgta 34261 ttttagtgct cacactaaag atatgagggg ttcacacacc agaactacag tgttaaaata 34321 ttctcaattt gtctgtgtat ttacttttgc cagtaagttt tactccttca gatgatttct 34381 tgttgcacat tagcattctt ttctttcaga ctgatgaact ctctttacta tttcttgtaa 34441 gacaggcctg gtattgatga attccctcag cttttgtttg ctgggaaagc ctttatttcg 34501 ccttcatggt tgaagaatat atttgatgac tagaatattc ttggttgaag atggttccct 34561 tcagcacttt aaatacgtct tttcactttc ttctggccta tgagatttct gctgagaaga 34621 ctgctgccaa tctactggag ttcctttata tgttatttgt ttcttttctc ttgctacctt 34681 tcctcttttt catctttgac ctttgagagt ttgattgtta tatgtcttgt gttgaatctg 34741 gttggcattc tttgactttc ttatacctgg atattcatat atatatgaat atgaatcata 34801 tatctatatg aatatatata tgaatatgaa tcatatatat gaatatatat atatttggaa 34861 agtactctat aattatttct ttgaattaac tgtctatccc tacatctttc tctactccct 34921 ttttaaaggt aatcactctt agacttgccc ttttgtggct attttctata tcttttaagc 34981 acagttcatt ctttttcttt tttctcctct gtattttcaa tagcctgtat ttaagctaac 35041 aaattcttct gcttaatcaa ttatgctgtt gttatattct gatgtatttt tcagttcatc 35101 cattgtcttt ttcagcacta ggatttctgt ttgatttttt tttttttttt tggtaattat 35161 ttcaatctct ttgttaaatt tctctgctaa aattctgaat tctttcttca tgttttcttg 35221 acattctttg agcttcttca agacagatat tttgaattcc ctgtctgaaa ggtattccct 35281 atctgataga tctctatcac tccagggctg gtcactgggc agcttattta gtccatttga 35341 tgagatcata ttttcttgaa tgttctcaat acttgtggat atttgtacat gtctgaacac 35401 tgaagagttt cttatttatt ccagtcttca cagtctggcc ttgcttgtac tcattcttct 35461 tcagagggcc ttcaaagaat tcaaagggga ctgagtgctg agttccctaa gctgtggtca 35521 ctgcagccat ttcagcacta gagggtgcca taatcccaag tatgcaataa atcttgcaga 35581 ctcctagata cccagcctag atagatgtgg gaaagagaag acaatgttct gggttcccag 35641 gcaaagtccc ttactctctt ctctctcatt ccccgaagta taagaagtcc ctctatatat 35701 accaggctgc ctgtaattgg gggagacgtg acacagggac tcctgtgtcc atcacagctg 35761 gcatcacagt aatgctagtc taaacccatg gactctatga ccagcacagt actggggatt 35821 gcccaaggtc cacagtcagt actacctgac tgccactgac gtttattcaa ggcttgaggc 35881 cactttagtc agcaggtggt aaaaccagcc agaacttgtg tccaaccaac caggacagca 35941 gattcccttc tggcctgggt tgggtctagc cgcactgtcc aggaagaaag gcctgggatc 36001 ggtggcttca ggattctact tggcgcttta ttttcctatg actgagctgg tactcaagtt 36061 gaaaaacaaa gtcatctgta ctcttccctc tcctttcctc aagcagaagg agtctctttc 36121 tgagccacac tgcctggagt tgggggaggg gtgtggcaca ctcccttggc tgctacggct 36181 ggtgtcatgc tgggtcacat gcacccaaag tccacagcct tagagaccac tgcagcacca 36241 gggcttgccc aaggactgca gtccttgtga cctaactgcc actcaaactt attctgggcc 36301 ccaggccact tcggtcagct gttggtggtg ccagctgaga ctcaggttcc tttcacgggg 36361 gttgaaagga atttgcttct ggccccaggc tggtgtaaat gctgtctcca tgggcacaag 36421 tagaattgtg ccgtgttgtg ttcttctgtg cctgggcagc accaggttcc aatgcagtcc 36481 cacactcact tcgtgctctc ttctacaagc acacagattt ttctctgcat gttggtctgc 36541 caagggatgg gaaaggggtg gtttaagcaa tgcaagacta cctttcctac ccacttcaat 36601 gcctttcttc ttgatattat gttaaaatca ggtactccga tcactcatct agttttctgg 36661 ttcttatgaa tgtgctttct tgcatgagta gttgttcagc tcgtgttcct gtggggagaa 36721 aattgatgaa tgtttctctt tggccttctt gctccactga tttttctttt tctagatttt 36781 tacataaatt gaatgatact acatgtaatc atctgcatct ggcttctttc actgagctta 36841 aggcttttga gatttgtcca tattatgtat aggcatcctt ttatattgct ggatagtgtt 36901 ccatttcata gagatgctac aatcgattgc tgccatggac attattgtgc atacccaaaa 36961 tttatatgtt gaaatcctaa ccccctagta cctcaaaatg tgatgtattt ggaggtagga 37021 ctttaaagaa ttaagttgaa atcagcccat taggttggac cctaatctta ccagtgtcct 37081 tataagaaat taagacacac agagacacca ggggtacaca tgcgtacagt gatggccttg 37141 tgaagagaca gaaagagggc agccatgtgc aagccaagga gagtggcctc agagaaatcc 37201 aacactgcca gcatcttaat cttggacttt agaattgtta gaaaattaat ttctgttgtt 37261 tgagccaccc agtctttggt attgtgttat ggcagcccta gaaaattaat acaattactc 37321 attcacctat ggatggtcat ttgagaattt tttttaagtg ttctagttat tggttagatg 37381 tgaagtttcc ataccttctc cccacaaagt caaggcacat caccctcctg gtacatcaac 37441 atgttcacca accaggaaac tccaccaagc ctctgcgtca agagttctaa gttggtaggg 37501 aggaacgagg agggggtatc tcattttgta gctgtgatta aatcacaacc acatggctga 37561 cttcaatctc tagttccttt cctctcccca aaggtcaatc tggctcaaag tttcaaccct 37621 cttattacat ggttggtctt tctgatgaag agcccccatt ctgaaccttt ctaggagcct 37681 ataatgaatc acttcgttag tataaaaaat acacttctct cactcaaaga attacaagat 37741 ttttgtagta ctaagcctgg gacaaagatc cagatatatt ttttattaca ccacagactg 37801 tatttaatat aataattaat agattttcta ctgttaaacc aactttaaga tcctaacaaa 37861 tctcaaggct gggtgcggtg gctcacgcct gtaatcccag cactttggga ggccgaggca 37921 ggcaaatcat gaggtcaaga gattaaggcc atcctggcca acatgatgaa accctgtgtc 37981 tacaaaaaat acaaaaatta gctgggcatg gtggcgtgca cctgtaatcc cggctactca 38041 ggatgatgag gcaggagaat tgcttgaacc caggaggcgg acgttacagt gagctgagat 38101 cgcgccactg cactccagcc tggcaataga gggagactcc gtctcaaaaa aaaaaaaaat 38161 cctaacaaat ctcactagaa tatgacttat tctttctctc taagagatat atagatatat 38221 gtattaaata tctttagtag aggatttttc agatctcgag tgagcctggt cagttggtat 38281 ctttcaatgg atttttgttt tccaagtttc aacttactga gaaagggttg tttaaaacaa 38341 tcacacatta tccattatat gtttcaaagt tatctagtac tatactatgt ttcattgctc 38401 ctatctgtaa tatgtgattt acgttctttc tgattaaaaa aaggagacac cttgaataaa 38461 ttattaaaat tgatagacct tccactatat cagaaagaaa aaacatgggt tcatagattt 38521 tctcaaaaaa gaaaaaacta aataaggtta ctgtgaaaat taattctgag gccctagtac 38581 aaagatggag agctcagagc ccaaaagtca tcctgtgtta aggagacata gctaagagtc 38641 tcggtttgcc aaggtagctg gagtctacag agcagagtac aggagagaaa aaacttcaga 38701 gatatagagg gccatcccca agtattcagc tgagtataat ctgtgtttgc atatagaaaa 38761 agaccagagg atgggtaaag aaccacctga aaggattata cgggaagcaa ttcccaaggc 38821 ttatacagaa aacatgatag ttctcatttc aaccagcttg agtaaaaaaa actcataact 38881 catgaagcat taggtagaga atataggaag gttgtgtctc agtaatggga ggggaatttg 38941 tcttaaaatg tatgctgcaa taatctcatc gagaaatgtt tagaatcaag actccccaaa 39001 agatcatatt gtttccaagt aacttaacca tgtaacagaa aaaagaaatc aagaatatct 39061 atagaaaatt aaaaaaaaat cagtacctaa aattcagtgt ccattcaaaa ttaaaccatg 39121 ttaacagtat cagacaaggg agattttaga gaaagaatat taccagagat taacaggatc 39181 ataatgataa agtggtcaac acatcaagtg gacataaaat aggaaatatt tattattcac 39241 ctaataaaaa catactaaaa tatgtgaagc aaaatctgac agaactgcag agaagtaaat 39301 ccaatttaat attcagagat gtcaacaatc ctcttaaaat aattgctgga accaaaacca 39361 cagaatgtca ataaagatag aaaaaactta aataacatca accattttta cctaatcaac 39421 acttgtagaa ccccttcatc aaacgagagt acagtataca tttcaagtga acatgtaact 39481 cataccaaga tagcccatat tctgggccac aaagcaaatc tcaataaatg taaataattc 39541 aagtcttaat gtactctgac cacaatggaa ttattgcaga aactaccaga aagataagca 39601 aaatccccaa atatttagga attaaataac atacatttaa aaagagccat gtgtcaagaa 39661 gaattttttt aactggaaaa catcctaaac tcaactaaaa tgaaaccaca tcggctgggc 39721 tcacgcctgt aatcccagca ctttgggagg ctgaggcagg cggatcacaa gatcaggaga 39781 tcgagaccat cctggctaac acggtgaaac ctcgtctcta ctaaaaatac aaaaaattag 39841 ccaggcatgg tggcaggcgc ctgtagtcca agctactcgg gaggctgagg caggagaatg 39901 gtgtgaaccc aggaggcaga gcttgcagtg agccgagatc gcgccactgc actccaacct 39961 ggatgacaga gcaagacact gtctcaaaaa aaaaaatgaa accacatcaa gcttgatggg 40021 aggtaattaa agcagttctc agatagattt ttaacattaa catctcttcc ccatattagg 40081 ggaaaaaaac ccaaaggttt gaaatgaagg acctcagctt ccatgttaaa aagtagcaaa 40141 agaaatgaaa tccaataaaa gcggaaggca ttagtaaaac tcaaaatgga aaagaaaaat 40201 cccaaaatat cagaaaacag aaaattcaat aataccaagt actttaagat caataaaatt 40261 gtaaaattga taaatctcta cccaaaccaa tcaaggttaa aagaaaaata tagacattac 40321 cagtatcagg aatagaagag gttgcaccac tacaaattct acaagtatga aaataatatt 40381 ttgaaaaatt tgagtactta aataaaacag aaaaactcct tgaaagacac aaaccatcaa 40441 agctcactct agagatagat ggatggatgg agtgacaaac agatggacgg acagacagat 40501 gtatatataa ctagtatagc cctccatcta ccatggaaat caaacttgta gttagatgct 40561 gtcctacaaa acacacttca gggccagagt gacagacaga caacacacac acagacagat 40621 gtacatataa cttgtacagc cctctatcta ccacagaaat caaacttata gttagaaact 40681 ttcctacaaa acaaactcca ggcccagaca gcttccttgg tgaattctac caaacattta 40741 aggaagaaat aataccaatt attcacaaac tcttccagag agctgaagag caggaatact 40801 tcccaactta ctctataaag ctggcattat gctgatacca aagccaaacc tgggcaagat 40861 ataaaattac aaagaaaaaa aagcaagcaa gcaagaaaga aagttaggga caaatatcac 40921 tcgtgaacat tggtgcaaaa cttgttacaa aaaaacaaaa caaaacaaaa caaaatagca 40981 aatcaaattc aacaatatac taaatgacta atacatcatg accaaatggg aattattaca 41041 gaaatgcaag gttggcttaa catttaaaaa aatcagtcaa tgtaattcat attaaacaac 41101 taaaacagaa aaagaaaaaa accatatagg attatggtca ccagagtcca agaaaacgcc 41161 ccacaaatcc ctattttcac ctttgtgtag ttctcttcca cttttaacca ggattagcta 41221 gggttactta tgaaatatag cagaaatgac ggctcatcac tttcagttta aggtaatgac 41281 agatactgtg gctttgttct tgtgtgagtt ctctcctgga tc // LOCUS AF001687 2794 bp DNA PRI 01-DEC-1997 DEFINITION Homo sapiens U4/U6 snRNP 60 kDa protein gene, complete cds. ACCESSION AF001687 NID g2653735 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2794) AUTHORS Lauber,J., Plessel,G., Prehm,S., Will,C.L., Groening,K. and Luehrmann,R. TITLE The human U4/U6 snRNP contains 60 and 90kD proteins that are structurally homologous to the yeast splicing factors Prp4p and Prp3p JOURNAL RNA (1997) In press REFERENCE 2 (bases 1 to 2794) AUTHORS Lauber,J., Plessel,G., Prehm,S., Will,C.L., Groening,K. and Luehrmann,R. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) Institut fuer Molekularbiologie und Tumorforschung, Emil Mannkopff Str. 2, Marburg 35037, Germany FEATURES Location/Qualifiers source 1..2794 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 94..1659 /codon_start=1 /product="U4/U6 snRNP 60 kDa protein" /db_xref="PID:g2653736" /translation="MASSRASSTATKTKAPDDLVAPVVKKPHIYYGSLEEKERERLAK GESGILGKDGLKAGIEAGNINITSGEVFEIEEHISERQAELLAEFERRKRARQINVST DDSEVKACLRALGEPITLFGEGPAERRERLKNILSVVDTDALKKTKKDDEKSKKSKEE YQQTWYHEGPNSLKVARLWIANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLR SLNNFCSQIGDDRPISYCHFSPNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVG AIVFHPKSTVSLDPKDVNLASCAADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSG RFLGTTCYDRSWRLWDLEAQEEILHQEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWD LRTGRCIMFLEDHLKEIYGINFSPNGYHIATGSGDNTCKVWDLRQRRCVYTIPPHQNL VTGVKFEPIHGNFLLTGAYDNTAKIWTHPGWSPLKTLAGHEGKVMGLDISSDGQLIAT YSYDRTFKLWMLE" BASE COUNT 744 a 633 c 708 g 709 t ORIGIN 1 gggcggagca cttcccctct gctgggcgcg cggtggacgg tctgaaaggg agtgttcggg 61 tttcgctggg gcctcgcggc tccagagccc agcatggctt cctcgcgagc ctcttccacg 121 gcaaccaaaa ctaaagcacc cgacgactta gttgctccgg tcgtgaaaaa accacacatc 181 tattatggaa gtttggagga gaaggagagg gagcgtctgg ccaaaggaga gtctgggatt 241 ttggggaaag acggacttaa agcagggatc gaagctggaa atattaatat aacctctgga 301 gaagtgttcg agattgaaga gcatatcagc gagcgacagg cagaattatt ggctgagttt 361 gagagaagga agcgagcccg gcagatcaat gtttccacag atgactcaga ggtcaaagct 421 tgccttagag ccttggggga acccatcaca ctttttggag agggtcctgc tgaaagaaga 481 gaaaggttaa aaaatatcct ctcagttgtc gatactgatg ccttgaaaaa gaccaaaaag 541 gatgatgaga agtctaaaaa gtccaaagaa gagtatcagc aaacctggta tcatgaagga 601 ccaaatagct tgaaggtggc aagactatgg attgctaatt attcgttgcc cagggcaatg 661 aaacgcttgg aagaggcccg actccataag gagattcctg agacaacaag gacctcccag 721 atgcaagagc tgcacaagtc tctccggtct ttgaataatt tttgcagtca gattggggat 781 gatcggccta tctcctactg tcactttagt cccaattcca agatgctggc cacagcttgt 841 tggagtgggc tttgcaagct ctggtctgtt cctgattgca acctccttca cactcttcga 901 gggcataaca caaatgtagg agcaattgta ttccatccca aatccactgt ctccttggac 961 ccaaaagatg tcaacctggc ctcttgtgcg gctgatggct ctgtgaagct ttggagtctt 1021 gacagtgatg aaccagtggc agatattgaa ggccatacag tgcgtgtggc gcgggtaatg 1081 tggcatcctt caggacgttt cctgggcacc acctgctatg accgttcatg gcgcttatgg 1141 gatttggagg ctcaagagga gatcctgcat caggaaggcc atagcatggg tgtgtatgac 1201 attgccttcc atcaagatgg ctctttggct ggcactgggg gactggatgc atttggtcga 1261 gtttgggacc tacgcacagg acgttgtatc atgttcttag aagaccacct gaaagaaatc 1321 tatggaataa atttctcccc caatggctat cacattgcaa ccggcagtgg tgacaacacc 1381 tgcaaagtgt gggacctccg acagcggcgt tgcgtctaca ccatccctcc tcatcagaac 1441 ttagtgactg gtgtcaagtt tgagcctatc catgggaact tcttgcttac tggtgcctat 1501 gataacacag ccaagatctg gacgcaccca ggctggtccc cgctgaagac tctggctggc 1561 cacgaaggca aagtgatggg cctagatatt tcttccgatg ggcagctcat agccacttac 1621 tcatatgaca ggaccttcaa gctgtggatg ctggaataga tgacaatggg aaaaggactt 1681 gaacctcaag ctctctctaa ggagctgttt tcctcaaacg agaagaattg aagtgttagt 1741 tctatcatgt tttctgccaa ttaccatgca tagaccctca gtagaattgg atttccatgt 1801 cagcccccac tccaggaagg cagcccaatc cctaggtgat ggggaacccc tctcacggtt 1861 caaaatttat taccttttta cgccctgcca cgaactgtgt agacattgtt tttattaatc 1921 ttttgtttgg ccgggcgtgg tggctcacgc ctgtaatcct agcactttgg gaggccgagg 1981 tgggtagatc gcttgagctc aggagttcaa gatgagcctg ggcaacatgg caaatgccgt 2041 ctctgcaaaa aaatactaaa attagctggt cgcggtggct tctgcctgtg attccggcta 2101 cttgggaggc tgaggtggga gggattgctt aagcctggga ggtagaggtt gcagtgagcc 2161 gagattgcgc cattgcactc tagcctgtgt gacagagcaa gaccctgtct caaaaaaaaa 2221 aaaaaatttg ttcgaatgcc ttatagcctt cctcacagca cccaggattg tgactgactc 2281 tgcattttta attcttgaaa cttggctttc cataacatgg tacatgcttc aggcctacat 2341 atgacccaga gagcaaggtg gctgaactat agtctggaag ccctcaggta aagaggcaca 2401 tctcaccact cattgcttaa acaattgatt catagcgagc acttttcctt tccctggaga 2461 atgggatgtg aagcagtaga ccgcagccac gccgatggtt atacagtgaa gaagacttca 2521 cctcttccta ttgagtttgc ttggaatgct gacagctcag gcactctgaa ctgaacattt 2581 gctttgtcag aaaatatctt tttttttacc tttgaagttt ggcaaccttc atgttacccc 2641 aaagcaaaac cattgtgtca ggagtcaaac aaatgtttag aaagcaaaca tgacgtctct 2701 attgtacaac ctcctttctc ttggctgttt aaaggatgta cttcgtgtat taaagggtac 2761 tttatgttga gtacgaaaaa aaaaaaaaaa aaaa // LOCUS AF005419 1113 bp DNA PRI 09-AUG-1997 DEFINITION Homo sapiens P2Y5-like receptor gene, complete cds. ACCESSION AF005419 NID g2240034 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1113) AUTHORS Janssens,R., Boeynaems,J.M., Godart,M. and Communi,D. TITLE Cloning of a human heptahelical receptor closely related to the P2Y5 receptor JOURNAL Biochem. Biophys. Res. Commun. 236 (1), 106-112 (1997) MEDLINE 97366605 REFERENCE 2 (bases 1 to 1113) AUTHORS Janssens,R., Boeynaems,J.M., Godart,M. and Communi,D. TITLE Direct Submission JOURNAL Submitted (26-MAY-1997) Institute for Interdisciplinary Research, Universite Libre de Bruxelles, Route de Lennik 808, Building C, C5-145, Brussels 1070, Belgium FEATURES Location/Qualifiers source 1..1113 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lDashII" mRNA 1..1113 /product="P2Y5-like receptor" CDS 1..1113 /codon_start=1 /product="P2Y5-like receptor" /db_xref="PID:g2240035" /translation="MGDRRFIDFQFQDSNSSLRPRLGNATANNTCIVDDSFKYNLNGA VYSVVFILGLITNSVSLFVFCFRMKMRSETAIFITNLAVSDLLFVCTLPFKIFYNFNR HWPFGDTLCKISGTAFLTNIYGSMLFLTCISVDRFLAIVYPFRSRTIRTRRNSAIVCA GVWILVLSGGISASLFSTTNVNNATTTCFEGLSKRVWKTYLSKITIFIEVVGFIIPLI LNVSCSSVVLRTLRKPATLSQIGTNKKKVLKMITVHMAVFVVCFVPYNSVLFLYALVR SQAITNCFLERFAKIMYPITLCLATLNCCFDPFIYYFTLESFQKSFYINAHIRMESLF KTETPLTTKPSLPAIQEEVSDQTTNNGGELMLESTF" BASE COUNT 276 a 262 c 201 g 374 t ORIGIN 1 atgggtgaca gaagattcat tgacttccaa ttccaagatt caaattcaag cctcagaccc 61 aggttgggca atgctactgc caataatact tgcattgttg atgattcctt caagtataat 121 ctcaatggtg ctgtctacag tgttgtattc atcttgggtc tgataaccaa cagtgtctct 181 ctgtttgtct tctgtttccg catgaaaatg agaagtgaga ctgctatttt tatcaccaat 241 ctagctgtct ctgatttgct ttttgtctgt acactacctt ttaaaatatt ttacaacttc 301 aaccgccact ggccttttgg tgacaccctc tgcaagatct ctggaactgc attccttacc 361 aacatctatg ggagcatgct ctttctcacc tgtattagtg tggatcgttt cctggccatt 421 gtctatcctt ttcgatctcg tactattagg actaggagga attctgccat tgtgtgtgct 481 ggtgtctgga tcctagtcct cagtggcggt atttcagcct ctttgttttc caccactaat 541 gtcaacaatg caaccaccac ctgctttgaa ggcctctcca aacgtgtctg gaagacttat 601 ttatccaaga tcacaatatt tattgaagtt gttgggttta tcattcctct aatattgaat 661 gtctcttgct cttctgtggt gctgagaact cttcgcaagc ctgctactct gtctcaaatt 721 gggaccaata agaaaaaagt actgaaaatg atcacagtac atatggcagt ctttgtggta 781 tgctttgtac cctacaactc tgtcctcttc ttgtatgccc tggtgcgctc ccaagctatt 841 actaattgct ttttggaaag atttgcaaag atcatgtacc caatcacctt gtgccttgca 901 actctgaact gttgttttga ccctttcatc tattacttca cccttgaatc ctttcagaag 961 tccttctaca tcaatgccca catcagaatg gagtccctgt ttaagactga aacacctttg 1021 accacaaagc cttcccttcc agctattcaa gaggaagtga gtgatcaaac aacaaataat 1081 ggtggtgaat taatgctaga atccaccttt tag // LOCUS AF007189 1601 bp DNA PRI 22-JAN-1998 DEFINITION Homo sapiens rat ventral prostate.1 homolog gene, complete cds. ACCESSION AF007189 NID g2459927 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1601) AUTHORS Peacock,R.E., Keen,T.J. and Inglehearn,C.F. TITLE Analysis of a human gene homologous to rat ventral prostate.1 protein JOURNAL Genomics 46 (3), 443-449 (1997) MEDLINE 98110580 REFERENCE 2 (bases 1 to 1601) AUTHORS Keen,T.J. TITLE Direct Submission JOURNAL Submitted (04-JUN-1997) Department of Molecular Genetics, Institute of Ophthalmology, University College London, Bath St., London EC1V 9EL, UK FEATURES Location/Qualifiers source 1..1601 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q11" mRNA <477..>1139 /note="RVP.1 homolog" /product="rat ventral prostate.1 homolog" CDS 477..1139 /note="RVP.1 homolog; similar to Rattus norvegicus ORF, encoded by GenBank Accession Number M74067" /codon_start=1 /product="rat ventral prostate.1 homolog" /db_xref="PID:g2459928" /translation="MSMGLEITGTALAVLGWLGTIVCCALPMWRVSAFIGSNIITSQN IWEGLWMNCVVQSTGQMQCKVYDSLLALPQDLQAARALIVVAILLAAFGLLVALVGAQ CTNCVQDDTAKAKITIVAGVLFLLAALLTLVPVSWSANTIIRDFYNPVVPEAQKREMG AGLYVGWAAAALQLLGGALLCCSCPPREKKYTATKVVYSAPRSTGPGASLGTGYDRKD YV" polyA_signal 1508..1513 BASE COUNT 251 a 595 c 520 g 235 t ORIGIN 1 ccgagcagag agggacaagg accgaggggc gggggctggg tcaggtcccg cccttccttg 61 gcaacccccc taccccggcg ttgtgggccg cgcagggcaa gtccttccaa gtccttcctc 121 ccaggacgct cggtgaaggt gggaggcagg ggccacgtca ctgtccttag gccctggaga 181 gcgcggctcc gcccctaccg cccccaccgg agcgctgggc acttagctaa gacgcaccgg 241 ccccagccca gggccagccc agtcgccgcc gcccgcccac aaagccacag gcaggtgcag 301 gcgcagccgc ggcgagagcg tatggagccg agccgttagc gcgcgccgtc ggtgagtcag 361 tccgtccgtc cgtccgtccg tcggggcgcc gcagctcccg ccaggcccag cggccccggc 421 ccctcgtctc cccgcacccg gagccacccg gtggagcggg ccttgccgcg gcagccatgt 481 ccatgggcct ggagatcacg ggcaccgcgc tggccgtgct gggctggctg ggcaccatcg 541 tgtgctgcgc gttgcccatg tggcgcgtgt cggccttcat cggcagcaac atcatcacgt 601 cgcagaacat ctgggagggc ctgtggatga actgcgtggt gcagagcacc ggccagatgc 661 agtgcaaggt gtacgactcg ctgctggcac tgccacagga ccttcaggcg gcccgcgccc 721 tcatcgtggt ggccatcctg ctggccgcct tcgggctgct agtggcgctg gtgggcgccc 781 agtgcaccaa ctgcgtgcag gacgacacgg ccaaggccaa gatcaccatc gtggcaggcg 841 tgctgttcct tctcgccgcc ctgctcaccc tcgtgccggt gtcctggtcg gccaacacca 901 ttatccggga cttctacaac cccgtggtgc ccgaggcgca gaagcgcgag atgggcgcgg 961 gcctgtacgt gggctgggcg gccgcggcgc tgcagctgct ggggggcgcg ctgctctgct 1021 gctcgtgtcc cccacgcgag aagaagtaca cggccaccaa ggtcgtctac tccgcgccgc 1081 gctccaccgg cccgggagcc agcctgggca caggctacga ccgcaaggac tacgtctaag 1141 ggacagacgc agggagaccc caccaccacc accaccacca acaccaccac caccaccgcg 1201 agctggagcg cgcaccaggc catccagcgt gcagccttgc ctcggaggcc agcccacccc 1261 cagaagccag gaagcccccg cgctggactg gggcagcttc cccagcagcc acggctttgc 1321 gggccgggca gtcgacttcg gggcccaggg accaacctgc atggactgtg aaacctcacc 1381 cttctggagc acggggcctg ggtgaccgcc aatacttgac caccccgtcg agccccatcg 1441 ggccgctgcc cccatgctcg cgctgggcag ggaccggcag ccctggaagg ggcacttgat 1501 atttttcaat aaaagccttt cgttttgcac tcgctggagc ctccttgttc ccagcgcctc 1561 cacctccctg cagagctccc cctcggggcg gccttggctg g // LOCUS AF008216 5785 bp DNA PRI 16-JAN-1998 DEFINITION Homo sapiens candidate tumor suppressor pp32r1 (PP32R1) gene, complete cds. ACCESSION AF008216 NID g2738512 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5785) AUTHORS Kochevar,G.J., Brody,J.R. and Pasternack,G.R. TITLE The Structure of a Gene Encoding pp32r1, a New Member of the pp32 Family JOURNAL Unpublished REFERENCE 2 (bases 1 to 5785) AUTHORS Kochevar,G.J., Brody,J.R. and Pasternack,G.R. TITLE Direct Submission JOURNAL Submitted (13-JUN-1997) Pathology, Johns Hopkins University School of Medicine, 720 Rutland Avenue, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..5785 /organism="Homo sapiens" /db_xref="taxon:9606" gene 4453..5157 /gene="PP32R1" CDS 4453..5157 /gene="PP32R1" /function="candidate tumor suppressor" /note="in contrast to pp32, pp32r1 augments oncogene-mediated transformation of rat embryo fibroblasts (Mol. Biol. Cell 8(SUPPL.): 137A, 1997.); similar to other members of the human pp32 family including pp32, encoded by GenBank Accession Number U73477, SSP29, encoded by GenBank Accession Number U70439, PHAPI2a, encoded by GenBank Accession Number Y07569, PHAPI2b, encoded by GenBank Accession Number Y07570, and APRIL protein, encoded by GenBank Accession Number Y07969" /codon_start=1 /product="pp32r1" /db_xref="PID:g2738513" /translation="MEMGRRIHSELRNRAPSDVKELALDNSRSNEGKLEALTDEFEEL EFLSKINGGLTSISDLPKLKLRKLELRVSGGLEVLAEKCPNLTHLYLSGNKIKDLSTI EPLKQLENLKSLDLFNCEVTNLNDYGENVFKLLLQLTYLDSCYWDHKEAPYSDIEDHV EGLDDEEEGEHEEEYDEDAQVVEDEEGEEEEEEGEEEDVSGGDEEDEEGYNDGEVDGE EDEEELGEEERGQKRK" BASE COUNT 1755 a 1087 c 1162 g 1781 t ORIGIN 1 aagctttcct gatctctaaa tcaaggtcag ctccctaagc tcttggctcc cgtactgaaa 61 ctttttctta tgtaactctc ataaacacat agcataatgt tttgcatgtt tttcttccct 121 atcagttgca agttccagca gagctgatat attttcattt cattcgctac tatagcccta 181 gagcctgaca tagtttctgg ctgtgaatgc tcaataaata tttgtttaat tgagtagaaa 241 cataaagtat ctatttcatt gaaggaaaga ataattagct acatttttct ttttcttgcc 301 ttaatatttg aggaatttgc ttatatgtca taataaaaaa gttaaagcct tatacattat 361 actaaggaat ttggacatta aattcaagct agcctttcta taaacaaaat actgaatttc 421 tgtccctaaa tttgttcctt ccctattctt ccccattgag atgacaccaa atccctctag 481 ctgctcaaac caagtacccg tatgttattc ttaattatct ctttaccttg cttctcatat 541 gcaatttgtt aacaagtcat cttcagtctg tatccattat tctccctttc cagaccacca 601 acatgtcttg actatactgc tacaatagcc tcccaactct tgtcctactt aaaattcatt 661 gtaaaaaatc agtcttggcc gggcacggtg gctcacacct ataatcccag cactttggga 721 gtcccaggcg ggcgggtcac gaggtcaaga gatggagacc atcatggcca acatggtgaa 781 accctgtctc tactataaat acaaaaaaat tatctgggtg tggtggcaca tgcctgtaat 841 cccaactact agggaggctg aggcaggaga atcgcttgaa cctgggaggc ggaggttgca 901 gtgagccgag atcgcaccat tgcactccag cctggcaaca gagcgagact ccatcccaaa 961 acaaaacaaa acaaaaccat gtaaaacatg tctgtaaaac atgtcagatt tcgtgttcag 1021 aagtcttaca tgtcttttca ttatgctaag ataaaaccca aatgcatttt cttggtttct 1081 aaagccaaga aaataagagt tgctttcagc aaccttgttt cttccgccat gcttttccct 1141 agctcactct ttttaggcaa gtcgacctga ttttctttct gttagtctgt ttctgcctcg 1201 tggtctggct ttctttctgt tagtctgttt ccacctcgtg gtcttggtcc tggctcttca 1261 ttctgcctgg aatgctctcc actccagatc cttactagat cttagctcag tcatcaccct 1321 cgcaggaaga tcttccaacc attcacctgc atacacctat ggctgctccc tagagaacat 1381 cattctgttt tcttcacttc ctagcactta ctgctttctg aaattatcta ctttgattgt 1441 ttatttcttt ctttactctt actaggatac ctgggtcatt aaaggaggga tatttctctc 1501 ttatttactg ttataaactt aatgcttagg ctgtagaagt tatacaatat ttgaagaata 1561 aatcgttaaa tgtataacat ttttgaagaa agataattgt gggatccatt tagtttgcaa 1621 acatttgatc tgtgtgttag acagaaggcc atggtaaagg acaaagacat attttatagg 1681 actgtaccct gaaaaataaa taaacttgaa ccagttatac aagacttatg tgcaggaaac 1741 aggtaccagt tatatttaga aatggtaaat caccttctaa gcataactca gagcacaata 1801 tattagaggg tagagagaga agtgcgtctt agatattggt aatcatatta ggactgacgc 1861 catccttgat ttttcttctg ggaaacagct caaaatgact atttaatgtt tacaatgata 1921 tcttgcatct tgccagtaaa taatataata gacactagga atccaaattg taagatgaac 1981 aagtctttat agagggagag ccaaatacac aataaataac acaaggtggt aaatgcagta 2041 atacaaacat acataccatg cataggagtg cagagaaggt gtgcttctcc gaatgcagtc 2101 acccagaaag tccttctgta gaaagggata tcttaaatgg tgcttaaagg aaaagtaacc 2161 aaaggcaact aaagattgca aggaggtccc aggaaaaagc aaaagaacca aaggtacata 2221 ggcacaaaag tagcctgcct tcctgggaac ttccaatagt ttgctggagc acacagttag 2281 aagtactgtg ccatgggagc aaagactgaa gacatatgca ggttcaaggg cacagagccc 2341 catatatgtc atgataagat attgggaagc cactggggag ctactgaaac tttaagcagg 2401 gaaataaaat tgtcatatct acaccttaga aatttgattt ttttctcttc ttttatcttc 2461 tcttctcctc tcttctctct ctctctctct ctctctctct gtgtgtgtgt gtgtgtgtgt 2521 gtgtgtgtgt gacagagtcc tgctctgtca cccaggctgg agtgtagtgg agtgatctcc 2581 gcttactgca gtctctgcct ctcaagcgat tccctgcctc agcctcccga gtagctggga 2641 ttacaggcgg gctctacaac agctggctaa cttttgtatt ttttggtaac aaccaggttt 2701 taccatgttg gccaggctgg tcttgaactc ctgacctcag gtgatctgcc tgccttggct 2761 ttccaaagtg ctgggattac aggcgtgagc caccctgcct ggtgtagaag tttgattttg 2821 atgtcagtgt ggtagatgaa tttgtgggaa gcaaaacaag atagagttca atgacagtga 2881 aaagtttatt gtataagcta tataaaagaa aatgttgaag gtttgaaatc cattagtggc 2941 agtaagggtg tacagaacga aactatttga gaagtacaca aggcaagtct tactttcaag 3001 gcagtttatg taagctcatt caattgtctc agtgttcttg ctatgtgtgg gttataggat 3061 ttggaacata tgatcaatct gagcacacat cagtaaactg aataggatta ttaaaatcca 3121 caagcatttt actagtggaa tctgtgatat tttctagcta ctcttgcttg ttttatttga 3181 atcttttgct catatcctat agtaaagatt tcaggaaata tatttttatt tgcctagaat 3241 tttagccttt tagttttttg aatctattgc tcatattctt atagtaagag tttcagggaa 3301 tgtatttcta tttgtctgga attttagcct ttcaggtttt tgagcccctc ttttgcttat 3361 gggacatagt atgagacaag atgaaatgat acttctattc ccaattcact gatggggaaa 3421 atgaagcaaa aaatgttatt cactcaaggc ttctgccatg tttcctggtg gaattacggc 3481 tcagacacaa atttcctaat gcctgtgctg ctaacttctc aatagaacac tatattaatt 3541 tatcttcttc ctgagtgttt ttccacaaat cccatagcct gtgaaaagat tgttttaggg 3601 aaatattatt tttaatatag catattttgt caatgtggga cataggacta gtacctgctg 3661 aaaaccatct catgatcctt gtgtaagaac taattcacac tagaaatact attttccttg 3721 ctcattaaaa acataaatgt ctcagaaagt aaaaaattat tcctctctaa ataaacatac 3781 atgccactca aattttattc ctctaccact tgccgtatct aaacctagtt agatactttg 3841 gttttaggta taatctgaca gaacagatac aaccaagatc acattgtgag tcagaagtgg 3901 aaaattcata attcatgatg ataccaataa aagatagatt tagcttttta caggatgttt 3961 ttggcatttt attctttcat ttgaggggag atctcaccaa aatatgtctt tcatggttca 4021 ttgtgttatt taatttctgt gatgcatatt ctcaggttac tttaaaccta gtctatagat 4081 tcaaagatat cccgtgtcag gtctctaaaa gtaaaaagaa aaatgggtac ttgtgaaggc 4141 tgattcacag taagtagtgt agaggggagt gccttgtgta ttcacaaatt atcaacgtga 4201 gcatcagata agattttctt tagtcacaca cacctacctt cttactagga agatccatat 4261 acttgaataa ttgttctgct tgacccaggt tacttatcag tccctttatt ataatatttg 4321 taaatattgg ggctcgagaa ccgagcggag ctggttgagt cttcaaagtc ctaaaacgtg 4381 cggccgtggg ttcgaggttt attgattgaa ttcggctggc acgagagcct ctgcagacag 4441 agagcgcgag agatggagat gggcagacgg attcattcag agctgcggaa cagggcgccc 4501 tctgatgtga aagaacttgc cctggacaac agtcggtcga atgaaggcaa actcgaagcc 4561 ctcacagatg aatttgaaga actggaattc ttaagtaaaa tcaacggagg cctcacctca 4621 atctcagact taccaaagtt aaagttgaga aagcttgaac taagagtctc agggggcctg 4681 gaagtattgg cagaaaagtg tccaaacctc acgcatctat atttaagtgg caacaaaatt 4741 aaagacctca gcacaataga gccactgaaa cagttagaaa acctcaagag cttagacctt 4801 ttcaattgcg aggtaaccaa cctgaacgac tacggagaaa acgtgttcaa gcttctcctg 4861 caactcacat atctcgacag ctgttactgg gaccacaagg aggcccctta ctcagatatt 4921 gaggaccacg tggagggcct ggatgacgag gaggagggtg agcatgagga ggagtatgat 4981 gaagatgctc aggtagtgga agatgaggag ggcgaggagg aggaggagga aggtgaagag 5041 gaggacgtga gtggagggga cgaggaggat gaagaaggtt ataacgatgg agaggtagat 5101 ggcgaggaag atgaagaaga gcttggtgaa gaagaaaggg gtcagaagcg aaaatgagaa 5161 cctgaagatg agggagaaga tgatgactaa gtagaataac ctattttgaa aaattcctat 5221 tgtgatttga ctgtttttac ccatatcccc tcccccctcc aatcctgccc cctgaaactt 5281 acttttttct gattgtaaca ttgctgtggg aatgagacgg gaaaagtgta ctgggggttg 5341 tggagggagg gagggcagga ggcggtggac taaaatacta tttttactgc caaataaaat 5401 aatatttgta aatattaact gggatactag ctttgtagaa tgattactat taattattct 5461 ctctctcttt ttattttttt acacattcta ttcttttaag tatagtcctt ttagtccaag 5521 gaaaaggcac tacaatccac ttattaatgc ttgctactgt gttcaagtaa aataagctcc 5581 aggatttaac aaaaagagga aagaaaatat ttacaatgaa aatgttgcta aaaatttaaa 5641 acaaattaca gtaaatgtat tgttaaagca aattctattt ttaaaattta ttaataagga 5701 aataatttgc taaagcaaat ttttggaaaa ataataatgc actttatact tgattttatt 5761 tattaaaaca atgatttata agctt // LOCUS AF014643 3528 bp DNA PRI 02-JAN-1998 DEFINITION Homo sapiens connexin46.6 (Cx46.6) gene, complete cds. ACCESSION AF014643 NID g2738576 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3528) AUTHORS Bloemker,B.K., Swaroop,A. and Kimberling,W.J. TITLE Cloning and Molecular Characterization of Human Connexin46.6, a New Gap Junction Gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 3528) AUTHORS Bloemker,B.K., Swaroop,A. and Kimberling,W.J. TITLE Direct Submission JOURNAL Submitted (02-JUL-1997) Genetics Dept., Boystown National Research Hospital, 555 N 30th St., Omaha, NE 68131, USA FEATURES Location/Qualifiers source 1..3528 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q41-q42" repeat_region 764..834 /note="9-22-9-22-9 bp repeat" mRNA <1386..3472 /gene="Cx46.6" /product="connexin46.6" gene <1386..3472 /gene="Cx46.6" CDS 1414..2724 /gene="Cx46.6" /note="gap junction protein" /codon_start=1 /product="connexin46.6" /db_xref="PID:g2738577" /translation="MSWSFLTRLLEEIHNHSTFVGKVWLTVLVVFRIVLTAVGGEAIY SDEQAKFTCNTRQPGCDNVCYDAFAPLSHVRFWVFHIVVISTPSVMYLGYAVHRLARA SEQERRRALRRRPGPRRAPRAHLPPPHAGWPEPADLGEEEPMLGLGEEEEEEETGAAE GAGEEAEEAGAEEACTKAVGADGKAAGTPGPTGQHDGRRRIQREGLMRVYVAQLVARA AFEVAFLVGQYLLYGFEVRPFFPCSRQPCPQVVDCFVSRPTEKTVFLLVMYVVSCLCL LLNLCEMAHLGLGSAQDAVRGRRGPPASAPAPAPRPPPCAFPAAAAGLACPPDYSLVV RAAERARAHDQNLANLALQALRDGAAAGDRDRDSSPCVGLPAASRGPPRAGAPASRTG SATSAGTVGEQGRPGTHERPGAKPRAGSEKGSASSRDGKTTVWI" polyA_signal 3451..3456 /gene="Cx46.6" BASE COUNT 551 a 1082 c 1257 g 638 t ORIGIN 1 gcatgcacgt gtacatgtgt atgcatgtgt gtgcatgggt atctgcatat gcatgtgcgt 61 gttcatgcat gtgcacgtgt gtgcatgtgt acacatgtga atctgtgggt atgtatctga 121 gtgtgtgtgc acatgtgaat gtgtgtggct ctgaggggta ctgcacagat gtgaggaagc 181 agggccactt ccccaggaac cctgactgcc ccctcttcct gccccgggtg gggaggcctc 241 agctgcataa agaggccggg agcctcacca gcgactgctg ctggtcccac acctgccgct 301 gctgcctccc aggccaggta ccgggagggg gccagaggcg gagggagcta aggggtctcc 361 tgcctcagcg acccaggagc aggtactggc cctggggcaa ccgccagcag agggtgggca 421 ggggagctgc aggagctctc cttctttgga gcacaggccc tgctgcacag ccctttcctg 481 ggcacttgcc caccttgggc ttggctggtc tgcggcatag ctgtctctga gggtcgcagg 541 tgctgagtgt ggcctcacat cactgggtct ataacctcgc tggacaccgt ccctcctgga 601 cggacgactg gcttcatcct gaccccagct agagatggtc tgggttgaga ccatggagga 661 ccaggaacct agactgggcg ggcggagccc tgggaccctg ggcacctgag aagggcagcg 721 ggaccagccg ggggctggag ggaggatgga ggattttgtg gaggtggagg gaccccgacg 781 cccctgtcca gggtgtggag ggaccccgac gcccctgtcc agggtgtgga gggaggcaca 841 tcatggcctc tgggggccga ggggctatgg ggattgtgga ctggtggcac tttggggtct 901 gggacctgat ggtggtgcac gcacgggggg ctgtgaagga ctgatggctg atggggctgc 961 tcgggcttgg gggctaggga gctggagggg ccacgaggga ttgagtggct acgcaggctg 1021 gggagaccct ttggatgctg aaactctgtg ggaccctccc caacaacctg gatgcggctg 1081 gggctgggtt ggtggccatg ggtgcactgg acactgatac caccagtccc cacacacttg 1141 agtggtgctt gcctcagtgt ccccatctgc ctcatgaagg caactcaccc acctggggcc 1201 ctgcatctgc accaccatgg gccggatcac gtgggggctt ccacttccgt ttaaggcggt 1261 aagctccacg tcattgactg tgtaagcaga gaggggccag ctgccatgca agcctggagc 1321 cccggctctg agcgccgcgg gctcctaagt gcaggcccct ggctgacccc taccccgccc 1381 cacaggaccc gcccgcccgc ccctatgacc aacatgagct ggagcttcct gacgcggctg 1441 ctggaggaga tccacaacca ctccaccttc gtgggcaagg tgtggctcac ggtgctggtg 1501 gtcttccgca tcgtgctgac ggctgtgggc ggcgaggcca tctactcgga cgagcaggcc 1561 aagttcactt gcaacacgcg gcagccaggc tgcgacaacg tctgctatga cgccttcgcg 1621 cccctgtcgc acgtgcgctt ctgggtcttc catattgtgg tcatctccac tccctcggtc 1681 atgtacctgg gctacgccgt gcaccgcctg gcccgtgcgt ctgagcagga gcggcgccgc 1741 gccctccgcc gccgcccggg gccacgccgc gcgccccgag cgcacctgcc gcccccgcac 1801 gccggctggc ctgagcccgc cgacctgggc gaggaggagc ccatgctggg cctgggcgag 1861 gaggaggagg aggaggagac gggggcagcc gagggcgccg gcgaggaagc ggaggaggca 1921 ggcgcggagg aggcgtgcac taaggcggtc ggcgctgacg gcaaggcggc agggaccccg 1981 ggcccgaccg ggcaacacga tgggcggagg cgcatccagc gggagggcct gatgcgcgtg 2041 tacgtggccc agctggtggc cagggcagct ttcgaggtgg ccttcctggt gggccagtac 2101 ctgctgtacg gcttcgaggt gcgaccgttt tttccctgca gccgccagcc ctgcccgcaa 2161 gtggtggact gcttcgtgtc gcgccctact gaaaagacgg ttttcctgct ggttatgtac 2221 gtggtcagct gcctgtgcct gctgctcaac ctctgtgaga tggcccacct gggcttgggc 2281 agcgcgcagg acgcggtgcg cggccgccgc ggccccccgg cctccgcccc cgcccccgcg 2341 ccgcggcccc cgccctgcgc cttccctgcg gcggccgctg gcttggcctg cccgcccgac 2401 tacagcctgg tggtgcgggc ggccgagcgc gctcgggcgc atgaccagaa cctggcaaac 2461 ctggccctgc aggcgctgcg cgacggggca gcggctgggg accgcgaccg ggacagttcg 2521 ccgtgcgtcg gcctccctgc ggcctcccgg gggcccccca gagcaggcgc ccccgcgtcc 2581 cggacgggca gtgctacctc tgcgggcact gtcggggagc agggccggcc cggcacccac 2641 gagcggccag gagccaagcc cagggctggc tccgagaagg gcagtgccag cagcagggac 2701 gggaagacca ccgtgtggat ctgagggcgc tggcttgcga gctgggccag ggaagaggag 2761 ggttgggggg ctccggtgga aacctgcgac cccttctcct cagccttctc cttagccggt 2821 ggcctcaggc agactctgcc cagaggggca gccaggctgc tcagggaagg ggctgaaagc 2881 ggcagaggag tgccctggct tggtcaccac tggggccaag gtggggtgga gagaggccta 2941 cgagccagaa agggccctct gctgtggtct gaaccccagg gggagtgggg cattgactcc 3001 acccctgtcc tgagctggaa taggtcctct gggatgccag ctctcccctt tgtgcttccc 3061 tgcagcaacc catggagggc ccagggtgcc tggtatgggc atcagttggt gggggtgcgg 3121 gggtgcgtgt ccccattccc tgcaacagca aatggggctc cttcttcagc cctccccttc 3181 ccagccccaa actgagacag actgggagct gggagcctgg ggtggacagg accataccct 3241 ctttgagctt ctgcgatgcc ggccttccgt tcctctggga ggcttgaagt tctgcagaga 3301 tgttgatatg ccttgcagct tggacccaat gggtggtggt cagggcctgg gggcttggcc 3361 atgctggggg aatggggctc tgggttcctg cctgtggcct gtctgtcctc ctccctaatt 3421 cagacccagc ctcaagagga aagggagtaa aataaaacta acttgtttat aaccttgtgt 3481 gtgcatgtgt atgcatgtgc acgtgtggct atgtgtgtga ctgcatgc // LOCUS AF014837 1984 bp DNA PRI 02-OCT-1997 DEFINITION Homo sapiens m6A methyltransferase (MT-A70) gene, complete cds. ACCESSION AF014837 NID g2460036 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1984) AUTHORS Bokar,J.A., Shambaugh,M.E., Polayes,D., Matera,A.G. and Rottman,F.M. TITLE Purification and cDNA cloning of the AdoMet-binding subunit of the human mRNA (N6-adenosine-)-methyltransferase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1984) AUTHORS Bokar,J.A., Shambaugh,M.E. and Rottman,F.M. TITLE Direct Submission JOURNAL Submitted (18-JUL-1997) Molecular Biology, Case Western Res. University, 10900 Euclid Ave., Cleveland, OH 44106, USA FEATURES Location/Qualifiers source 1..1984 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 119..1858 /gene="MT-A70" CDS 119..1858 /gene="MT-A70" /note="AdoMet-binding subunit; (N6-adenosine)-methyltransferase" /codon_start=1 /product="m6A methyltransferase" /db_xref="PID:g2460037" /translation="MSDTWSSIQAHKKQLDSLRERLQRRRKQDSGHLDLRNPEAALSP TFRSDSPVPTAPTSGGPKPSTASAVPELATDPELEKKLLHHLSDLALTLPTDAVSICL AISTPDAPATQDGVESLLQKFAAQELIEVKRGLLQDDAHPTLVTYADHSKLSAMMGAV AEKKGPGEVAGTVTGQKRRAEQDSTTVAAFASSLVSGLNSSASEPAKEPAKKSRKHAA SDVDLEIESLLNQQSTKEQQSKKVSQEILELLITTTAKEQSIVEIRSRGRAQVQEFCD YGTKEECMKASDADRPCRKLHFRRIINKHTDESLGDCSFLNTCFHMDTCKYVHYEIDA CMDSEAPGSKDHTPSQELALTQSVGGDSSADRLFPPQWICCDIRYLVVSILGKFAVVM ADPPWDIHMELPYGTLTDDEMRRLNIPVLQDDGFLFLWVTGRAMELGRECLNLWGYER VDEIIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGVKGNPQGFNQGLDCDVIVAEVRS TSHKPDEIYGMIERLSPGTRKIELFGRPHNVQPNWITLGNQLDGIHLLDPDVVARFKQ RYPDGIISKPKNL" BASE COUNT 531 a 477 c 509 g 467 t ORIGIN 1 cattttccgg ttagccttcg gggtgtccgc gtgagaattg gctatatcct ggagcgagtg 61 ctgggaggtg ctagtccgcc gcgccttatt cgagaggtgt cagggctggg agactaggat 121 gtcggacacg tggagctcta tccaggccca caagaagcag ctggactctc tgcgggagag 181 gctgcagcgg aggcggaagc aggactcggg gcacttggat ctacggaatc cagaggcagc 241 attgtctcca accttccgta gtgacagccc agtgcctact gcacccacct ctggtggccc 301 taagcccagc acagcttcag cagttcctga attagctaca gatcctgagt tagagaagaa 361 gttgctacac cacctctctg atctggcctt aacattgccc actgatgctg tgtccatctg 421 tcttgccatc tccacgccag atgctcctgc cactcaagat ggggtagaaa gcctcctgca 481 gaagtttgca gctcaggagt tgattgaggt aaagcgaggt ctcctacaag atgatgcaca 541 tcctactctt gtaacctatg ctgaccattc caagctctct gccatgatgg gtgctgtggc 601 agaaaagaag ggccctgggg aggtagcagg gactgtcaca gggcagaagc ggcgtgcaga 661 acaggactcg actacagtag ctgcctttgc cagttcgtta gtctctggtc tgaactcttc 721 agcatcggaa ccagcaaagg agccagccaa gaaatcaagg aaacatgctg cctcagatgt 781 tgatctggag atagagagcc ttctgaacca acagtccact aaggaacaac agagcaagaa 841 ggtcagtcag gagatcctag agctattaat tactacaaca gccaaggaac aatccattgt 901 tgaaattcgc tctcgaggtc gggcccaagt gcaagaattc tgtgactatg gaaccaagga 961 ggagtgcatg aaagccagtg atgctgatcg accctgtcgc aagctgcact tcagacgaat 1021 tatcaataaa cacactgatg agtctttagg tgactgctct ttccttaata catgtttcca 1081 catggatacc tgcaagtatg ttcactatga aattgatgct tgcatggatt ctgaggcccc 1141 tggcagcaaa gaccacacgc caagccagga gcttgctctt acacagagtg tcggaggtga 1201 ttccagtgca gaccgactct tcccacctca gtggatctgt tgtgatatcc gctacctggt 1261 cgtcagtatc ttgggcaagt ttgcagttgt gatggctgac ccaccctggg atattcacat 1321 ggaactgccc tatgggaccc tgacagatga tgagatgcgc aggctcaaca tacccgtact 1381 acaggatgat ggctttctct tcctctgggt cacaggcagg gccatggagt tggggagaga 1441 atgtctaaac ctctgggggt atgaacgggt agatgaaatt atttgggtga agacaaatca 1501 actgcaacgc atcattcgga caggccgtac aggtcactgg ttgaaccatg ggaaggaaca 1561 ctgcttggtt ggtgtcaaag gaaatcccca aggcttcaac cagggtctgg attgtgatgt 1621 gatcgtagct gaggttcgtt ccaccagtca taaaccagat gaaatctatg gcatgattga 1681 aagactatct cctggcactc gcaagattga gttatttgga cgaccacaca atgtgcaacc 1741 caactggatc acccttggaa accaactgga tgggatccac ctactagacc cagatgtggt 1801 tgcacggttc aagcaaaggt acccagatgg tatcatctct aaacctaaga atttatagaa 1861 gcacttcctt acagagctaa gaatccatag ccatggctct gtaagctaaa cctgaagagt 1921 gatatttgta caatagcttt cttctttatt taaataaaca tttgtattgt aaaaaaaaaa 1981 aaaa // LOCUS AF022953 1242 bp DNA PRI 30-OCT-1997 DEFINITION Homo sapiens beta2-adrenergic receptor (ADRB2) gene, complete cds. ACCESSION AF022953 NID g2570526 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1242) AUTHORS Reihsaus,E., Innis,M., MacIntyre,N. and Liggett,S.B. TITLE Mutations in the gene encoding for the beta 2-adrenergic receptor in normal and asthmatic subjects JOURNAL Am. J. Respir. Cell Mol. Biol. 8 (3), 334-339 (1993) MEDLINE 93192047 REFERENCE 2 (bases 1 to 1242) AUTHORS Green,S.A., Cole,G., Jacinto,M., Innis,M. and Liggett,S.B. TITLE A polymorphism of the human beta 2-adrenergic receptor within the fourth transmembrane domain alters ligand binding and functional properties of the receptor JOURNAL J. Biol. Chem. 268 (31), 23116-23121 (1993) MEDLINE 94043092 REFERENCE 3 (bases 1 to 1242) AUTHORS Green,S.A., Turki,J., Innis,M. and Liggett,S.B. TITLE Amino-terminal polymorphisms of the human beta 2-adrenergic receptor impart distinct agonist-promoted regulatory properties JOURNAL Biochemistry 33 (32), 9414-9419 (1994) MEDLINE 94347707 REMARK Erratum:[[published erratum appears in Biochemistry 1994 Nov 29;33(47):14368]] REFERENCE 4 (bases 1 to 1242) AUTHORS Liggett,S.B. and Green,S.A. TITLE Direct Submission JOURNAL Submitted (04-SEP-1997) Medicine, Univ of Cincinnati, 231 Bethesda Ave ML670564, Cincinnati, OH 45267-0564, USA FEATURES Location/Qualifiers source 1..1242 /organism="Homo sapiens" gene 1..1242 /gene="ADRB2" CDS 1..1242 /gene="ADRB2" /codon_start=1 /product="beta2-adrenergic receptor" /db_xref="PID:g2570527" /translation="MGQPGNGSAFLLAPNGSHAPDHDVTQQRDEVWVVGMGIVMSLIV LAIVFGNVLVITAIAKFERLQTVTNYFITSLACADLVMGLAVVPFGAAHILMKMWTFG NFWCEFWTSIDVLCVTASIETLCVIAVDRYFAITSPFKYQSLLTKNKARVIILMVWIV SGLTSFLPIQMHWYRATHQEAINCYANETCCDFFTNQAYAIASSIVSFYVPLVIMVFV YSRVFQEAKRQLQKIDKSEGRFHVQNLSQVEQDGRTGHGLRRSSKFCLKEHKALKTLG IIMGTFTLCWLPFFIVNIVHVIQDNLIRKEVYILLNWIGYVNSGFNPLIYCRSPDFRI AFQELLCLRRSSLKAYGNGYSSNGNTGEQSGYHVEQEKENKLLCEDLPGTEDFVGHQG TVPSDNIDSQGRNCSTNDSLL" variation 46 /gene="ADRB2" /note="Arg16 to Gly polymorphism" /replace="a" BASE COUNT 275 a 331 c 326 g 310 t ORIGIN 1 atggggcaac ccgggaacgg cagcgccttc ttgctggcac ccaatggaag ccatgcgccg 61 gaccacgacg tcacgcagca aagggacgag gtgtgggtgg tgggcatggg catcgtcatg 121 tctctcatcg tcctggccat cgtgtttggc aatgtgctgg tcatcacagc cattgccaag 181 ttcgagcgtc tgcagacggt caccaactac ttcatcactt cactggcctg tgctgatctg 241 gtcatgggcc tggcagtggt gccctttggg gccgcccata ttcttatgaa aatgtggact 301 tttggcaact tctggtgcga gttttggact tccattgatg tgctgtgcgt cacggccagc 361 attgagaccc tgtgcgtgat cgcagtggat cgctactttg ccattacttc acctttcaag 421 taccagagcc tgctgaccaa gaataaggcc cgggtgatca ttctgatggt gtggattgtg 481 tcaggcctta cctccttctt gcccattcag atgcactggt accgggccac ccaccaggaa 541 gccatcaact gctatgccaa tgagacctgc tgtgacttct tcacgaacca agcctatgcc 601 attgcctctt ccatcgtgtc cttctacgtt cccctggtga tcatggtctt cgtctactcc 661 agggtctttc aggaggccaa aaggcagctc cagaagattg acaaatctga gggccgcttc 721 catgtccaga accttagcca ggtggagcag gatgggcgga cggggcatgg actccgcaga 781 tcttccaagt tctgcttgaa ggagcacaaa gccctcaaga cgttaggcat catcatgggc 841 actttcaccc tctgctggct gcccttcttc atcgttaaca ttgtgcatgt gatccaggat 901 aacctcatcc gtaaggaagt ttacatcctc ctaaattgga taggctatgt caattctggt 961 ttcaatcccc ttatctactg ccggagccca gatttcagga ttgccttcca ggagcttctg 1021 tgcctgcgca ggtcttcttt gaaggcctat gggaatggct actccagcaa cggcaacaca 1081 ggggagcaga gtggatatca cgtggaacag gagaaagaaa ataaactgct gtgtgaagac 1141 ctcccaggca cggaagactt tgtgggccat caaggtactg tgcctagcga taacattgat 1201 tcacaaggga ggaattgtag tacaaatgac tcactgctgt aa // LOCUS AF024687 923 bp DNA PRI 21-NOV-1997 DEFINITION Homo sapiens putative G protein-coupled receptor (GPR40) gene, complete cds. ACCESSION AF024687 NID g2612945 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 923) AUTHORS Sawzdargo,M., George,S.R., Nguyen,T., Xu,S., Kolakowski,L.F. and O'Dowd,B.F. TITLE A cluster of four novel human G protein-coupled receptor genes occurring in close proximity to CD22 gene on chromosome 19q13.1 JOURNAL Biochem. Biophys. Res. Commun. 239 (2), 543-547 (1997) MEDLINE 98008875 REFERENCE 2 (bases 1 to 923) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (15-SEP-1997) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..923 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1" gene 11..913 /gene="GPR40" CDS 11..913 /gene="GPR40" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g2612946" /translation="MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARLRLTPSLVY ALNLGCSDLLLTVSLPLKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAALSA GRYLGAAFPLGYQAFRRPCYSWGVCAAIWALVLCHLGLVFGLEAPGGWLDHSNTSLGI NTPVNGSPVCLEAWDPASAGPARFSLSLLLFFLPLAITAFCYVGCLRALARSGLTHRR KLRAAWVAGGALLTLLLCVGPYNASNVASFLYPNLGGSWRKLGLITGAWSVVLNPLVT GYLGRGPGLKTVCAARTQGGKSQK" BASE COUNT 109 a 338 c 289 g 187 t ORIGIN 1 cggcggcccc atggacctgc ccccgcagct ctccttcggc ctctatgtgg ccgcctttgc 61 gctgggcttc ccgctcaacg tcctggccat ccgaggcgcg acggcccacg cccggctccg 121 tctcacccct agcctggtct acgccctgaa cctgggctgc tccgacctgc tgctgacagt 181 ctctctgccc ctgaaggcgg tggaggcgct agcctccggg gcctggcctc tgccggcctc 241 gctgtgcccc gtcttcgcgg tggcccactt cttcccactc tatgccggcg ggggcttcct 301 ggccgccctg agtgcaggcc gctacctggg agcagccttc cccttgggct accaagcctt 361 ccggaggccg tgctattcct ggggggtgtg cgcggccatc tgggccctcg tcctgtgtca 421 cctgggtctg gtctttgggt tggaggctcc aggaggctgg ctggaccaca gcaacacctc 481 cctgggcatc aacacaccgg tcaacggctc tccggtctgc ctggaggcct gggacccggc 541 ctctgccggc ccggcccgct tcagcctctc tctcctgctc ttttttctgc ccttggccat 601 cacagccttc tgctacgtgg gctgcctccg ggcactggcc cgctccggcc tgacgcacag 661 gcggaagctg cgggccgcct gggtggccgg cggggccctc ctcacgctgc tgctctgcgt 721 aggaccctac aacgcctcca acgtggccag cttcctgtac cccaatctag gaggctcctg 781 gcggaagctg gggctcatca cgggtgcctg gagtgtggtg cttaatccgc tggtgaccgg 841 ttacttggga aggggtcctg gcctgaagac agtgtgtgcg gcaagaacgc aagggggcaa 901 gtcccagaag taacgccact gct // LOCUS AF024688 1061 bp DNA PRI 21-NOV-1997 DEFINITION Homo sapiens putative G protein-coupled receptor (GPR41) gene, complete cds. ACCESSION AF024688 NID g2612947 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1061) AUTHORS Sawzdargo,M., George,S.R., Nguyen,T., Xu,S., Kolakowski,L.F. and O'Dowd,B.F. TITLE A cluster of four novel human G protein-coupled receptor genes occurring in close proximity to CD22 gene on chromosome 19q13.1 JOURNAL Biochem. Biophys. Res. Commun. 239 (2), 543-547 (1997) MEDLINE 98008875 REFERENCE 2 (bases 1 to 1061) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (15-SEP-1997) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1061 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1" gene 11..1051 /gene="GPR41" CDS 11..1051 /gene="GPR41" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g2612948" /translation="MDTGPDQSYFSGNHWFVFSVYLLTFLVGLPLNLLALVVFVGKLQ RRPVAVDVLLLNLTASDLLLLLFLPFRMVEAANGMHWPLPFILCPLSGFIFFTTIYLT ALFLAAVSIERFLSVAHPLWYKTRPRLGQAGLVSVACWLLASAHCSVVYVIEFSGDIS HSQGTNGTCYLEFRKDQLAILLPVRLEMAVVLFVVPLIITSYCYSRLVWILGRGGSHR RQRRVAGLLAATLLNFLVCFGPYNVSHVVGYICGESPAWRIYVTLLSTLNSCVDPFVY YFSSSGFQADFHELLRRLCGLWGQWQQESSMELKEQKGGEEQRADRPAERKTSEHSQG CGTGGQVACAES" BASE COUNT 172 a 334 c 321 g 234 t ORIGIN 1 ggccaccacc atggatacag gccccgacca gtcctacttc tccggcaatc actggttcgt 61 cttctcggtg taccttctca ctttcctggt ggggctcccc ctcaacctgc tggccctggt 121 ggtcttcgtg ggcaagctgc agcgccgccc ggtggccgtg gacgtgctcc tgctcaacct 181 gaccgcctcg gacctgctcc tgctgctgtt cctgcctttc cgcatggtgg aggcagccaa 241 tggcatgcac tggcccctgc ccttcatcct ctgcccactc tctggattca tcttcttcac 301 caccatctat ctcaccgccc tcttcctggc agctgtgagc attgaacgct tcctgagtgt 361 ggcccaccca ctgtggtaca agacccggcc gaggctgggg caggcaggtc tggtgagtgt 421 ggcctgctgg ctgttggcct ctgctcactg cagcgtggtc tacgtcatag aattctcagg 481 ggacatctcc cacagccagg gcaccaatgg gacctgctac ctggagttcc ggaaggacca 541 gctagccatc ctcctgcccg tgcggctgga gatggctgtg gtcctctttg tggtcccgct 601 gatcatcacc agctactgct acagccgcct ggtgtggatc ctcggcagag ggggcagcca 661 ccgccggcag aggagggtgg cggggctgtt ggcggccacg ctgctcaact tccttgtctg 721 ctttgggccc tacaacgtgt cccatgtcgt gggctatatc tgcggtgaaa gcccggcatg 781 gaggatctac gtgacgcttc tcagcaccct gaactcctgt gtcgacccct ttgtctacta 841 cttctcctcc tccgggttcc aagccgactt tcatgagctg ctgaggaggt tgtgtgggct 901 ctggggccag tggcagcagg agagcagcat ggagctgaag gagcagaagg gaggggagga 961 gcagagagcg gaccgaccag ctgaaagaaa gaccagtgaa cactcacagg gctgtggaac 1021 tggtggccag gtggcctgtg ctgaaagcta ggtcctccgg g // LOCUS AF024690 1013 bp DNA PRI 21-NOV-1997 DEFINITION Homo sapiens putative G protein-coupled receptor (GPR43) gene, complete cds. ACCESSION AF024690 NID g2612951 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1013) AUTHORS Sawzdargo,M., George,S.R., Nguyen,T., Xu,S., Kolakowski,L.F. and O'Dowd,B.F. TITLE A cluster of four novel human G protein-coupled receptor genes occurring in close proximity to CD22 gene on chromosome 19q13.1 JOURNAL Biochem. Biophys. Res. Commun. 239 (2), 543-547 (1997) MEDLINE 98008875 REFERENCE 2 (bases 1 to 1013) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (15-SEP-1997) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1013 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1" gene 11..1003 /gene="GPR43" CDS 11..1003 /gene="GPR43" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g2612952" /translation="MLPDWKSSLILMAYIIIFLTGLPANLLALRAFVGRIRQPQPAPV HILLLSLTLADLLLLLLLPFKIIEAASNFRWYLPKVVCALTSFGFYSSIYCSTWLLAG ISIERYLGVAFPVQYKLSRRPLYGVIAALVAWVMSFGHCTIVIIVQYLNTTEQVRSGN EITCYENFTDNQLDVVLPVRLELCLVLFFIPMAVTIFCYWRFVWIMLSQPLVGAQRRR RAVGLAVVTLLNFLVCFGPYNVSHLVGYHQRKSPWWRSIAVVFSSLNASLDPLLFYFS SSVVRRAFGRGLQVLRNQGSSLLGRRGKDTAEGTNEDRGVGQGEGMPSSDFTTE" BASE COUNT 173 a 307 c 292 g 241 t ORIGIN 1 cccctccagg atgctgccgg actggaagag ctccttgatc ctcatggctt acatcatcat 61 cttcctcact ggcctccctg ccaacctcct ggccctgcgg gcctttgtgg ggcggatccg 121 ccagccccag cctgcacctg tgcacatcct cctgctgagc ctgacgctgg ccgacctcct 181 cctgctgctg ctgctgccct tcaagatcat cgaggctgcg tcgaacttcc gctggtacct 241 gcccaaggtc gtctgcgccc tcacgagttt tggcttctac agcagcatct actgcagcac 301 gtggctcctg gcgggcatca gcatcgagcg ctacctggga gtggctttcc ccgtgcagta 361 caagctctcc cgccggcctc tgtatggagt gattgcagct ctggtggcct gggttatgtc 421 ctttggtcac tgcaccatcg tgatcatcgt tcaatacttg aacacgactg agcaggtcag 481 aagtggcaat gaaattacct gctacgagaa cttcaccgat aaccagttgg acgtggtgct 541 gcccgtgcgg ctggagctgt gcctggtgct cttcttcatc cccatggcag tcaccatctt 601 ctgctactgg cgttttgtgt ggatcatgct ctcccagccc cttgtggggg cccagaggcg 661 gcgccgagcc gtggggctgg ctgtggtgac gctgctcaat ttcctggtgt gcttcggacc 721 ttacaacgtg tcccacctgg tggggtatca ccagagaaaa agcccctggt ggcggtcaat 781 agccgtggtg ttcagttcac tcaacgccag tctggacccc ctgctcttct atttctcttc 841 ttcagtggtg cgcagggcat ttgggagagg gctgcaggtg ctgcggaatc agggctcctc 901 cctgttggga cgcagaggca aagacacagc agaggggaca aatgaggaca ggggtgtggg 961 tcaaggagaa gggatgccaa gttcggactt cactacagag tagcagtttc cct // LOCUS AF024711 1445 bp DNA PRI 09-DEC-1997 DEFINITION Homo sapiens cone rod homeobox protein (CRX) gene, complete cds. ACCESSION AF024711 NID g2665533 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1445) AUTHORS Freund,C.L., Gregory-Evans,C.Y., Furukawa,T., Papaioannou,M., Looser,J., Ploder,L., Bellingham,J., Ng,D., Herbrick,J.S., Duncan,A., Scherer,S.W., Tsui,L.-C., Loutradis-Anagnostou,A., Jacobson,S.G., Cepko,C.L., Bhattacharya,S.S. and McInnes,R.R. TITLE Cone-rod dystrophy due to mutations in a novel photoreceptor-specific homeobox gene (CRX) essential for maintenance of the photoreceptor JOURNAL Cell 91 (4), 543-553 (1997) MEDLINE 98050929 REFERENCE 2 (bases 1 to 1445) AUTHORS Freund,C.L., Looser,J., Ploder,L., Ng,D. and McInnes,R.R. TITLE Direct Submission JOURNAL Submitted (12-SEP-1997) Genetics, The Hospital for Sick Children, 555 University Ave., Toronto, ON M5G 1X8, Canada FEATURES Location/Qualifiers source 1..1445 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" gene 36..935 /note="homeobox gene; responsible for autosomal dominant cone-rod dystrophy human disease" /gene="CRX" CDS 36..935 /gene="CRX" /note="homeodomain protein" /codon_start=1 /product="cone rod homeobox protein" /db_xref="PID:g2665534" /translation="MMAYMNPGPHYSVNALALSGPSVDLMHQAVPYPSAPRKQRRERT TFTRSQLEELEALFAKTQYPDVYAREEVALKINLPESRVQVWFKNRRAKCRQQRQQQK QQQQPPGGQAKARPAKRKAGTSPRPSTDVCPDPLGISDSYSPPLPGPSGSPTTAVATV SIWSPASESPLPEAQRAGLVASGPSLTSAPYAMTYAPASAFCSSPSAYGSPSSYFSGL DPYLSPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGAYSPVDSLEFKDPTGTW KFTYNPMDPLDYKDQSAWKFQIL" BASE COUNT 300 a 477 c 365 g 303 t ORIGIN 1 gccccctgac ttgggcctca gtgtccccga agatcatgat ggcgtatatg aacccggggc 61 cccactattc tgtcaacgcc ttggccctaa gtggccccag tgtggatctg atgcaccagg 121 ctgtgcccta cccaagcgcc cccaggaagc agcggcggga gcgcaccacc ttcacccgga 181 gccaactgga ggagctggag gcactgtttg ccaagaccca gtacccagac gtctatgccc 241 gtgaggaggt ggctctgaag atcaatctgc ctgagtccag ggttcaggtt tggttcaaga 301 accggagggc taaatgcagg cagcagcgac agcagcagaa acagcagcag cagcccccag 361 ggggccaggc caaggcccgg cctgccaaga ggaaggcggg cacgtcccca agaccctcca 421 cagatgtgtg tccagaccct ctgggcatct cagattccta cagtccccct ctgcccggcc 481 cctcaggctc cccaaccacg gcagtggcca ctgtgtccat ctggagccca gcctcagagt 541 cccctttgcc tgaggcgcag cgggctgggc tggtggcctc agggccgtct ctgacctccg 601 ccccctatgc catgacctac gccccggcct ccgctttctg ctcttccccc tccgcctatg 661 ggtctccgag ctcctatttc agcggcctag acccctacct ttctcccatg gtgccccagc 721 tagggggccc ggctcttagc cccctctctg gcccctccgt gggaccttcc ctggcccagt 781 cccccacctc cctatcaggc cagagctatg gcgcctacag ccccgtggat agcttggaat 841 tcaaggaccc cacgggcacc tggaaattca cctacaatcc catggaccct ctggactaca 901 aggatcagag tgcctggaag tttcagatct tgtagaggac gcagtctcca tctctctcca 961 tcgggcctcg ggaccctttc tcttctgaat ctgcttccct gcagtttaga tcccgggatg 1021 gcattcctga gaaagcaacc cgaaccagct gtccttctga cagctcggtg ttcagcttac 1081 agagaccacc cctttcctcc acagggagag gctcctccct ctcctgggac agctcacagg 1141 tcctagtgat tctctcaacc ctaacaccgt ctggcacgat tgtgaccgct gaagtacacc 1201 acgagctcca ggcttcagaa agtggtgctg agaacttgct ccaagaagaa gtcaaaccaa 1261 acttgcagtt gatttggggt catgtttagg tcagaatcac cgtgcccttg aacaagcagg 1321 taggggggct tgataactta actttccacg tggacagaat tttttttttt gttttgtttt 1381 tgttctgcac tccagcctgg gcaacaagag tgaaactctg tctcaaaaaa aaaaaaaaaa 1441 aaacg // LOCUS AF026564 1581 bp DNA PRI 07-OCT-1997 DEFINITION Homo sapiens RNA binding protein II (RBMII) gene, complete cds. ACCESSION AF026564 NID g2465929 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1581) AUTHORS Chai,N.N., Zhou,H., Hernandez,J., Najmabadi,H., Bhasin,S. and Yen,P.H. TITLE Structure and organization of the RBM genes on the human Y chromosome: Transposition and amplification of an ancestral autosomal hnRNPG gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 1581) AUTHORS Chai,N.N., Zhou,H., Hernandez,J., Najmabadi,H., Bhasin,S. and Yen,P.H. TITLE Direct Submission JOURNAL Submitted (25-SEP-1997) Division of Medical Genetics, E-4, Harbor-UCLA Medical Center, 1124 W. Carson St., Torrance, CA 90502, USA FEATURES Location/Qualifiers source 1..1581 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="CEPH YAC library" /clone="786C10" /sub_clone="7S2" /chromosome="Y" /map="Yq11.23" gene 119..1075 /gene="RBMII" CDS 119..1075 /gene="RBMII" /note="candidate for the Azoospermia Factor; germ-cell specific expression" /codon_start=1 /product="RNA binding protein" /db_xref="PID:g2465930" /translation="MVEADCHGKLFIGGLNREANEKVLKEVFAKHGPLLEVLLIKGRT SKSRDFVVIIFENAADAKNAARDMNGKSLDGKEIKVEQAKKPSFPSGGRRRPPPSSRN RSPSGSLRSARGSSGGTRPWLPSHEGHLDDGGYALDLNTSSSRGAIPIKRGPSSRSGG PPPKTSAPSAMARSNSWMGGQGPISRGRENYGGPPCREPISSWRNDRMSPRDDGYAIK ERNHPLSRESRDYAPLSRDYAYHDYGHSSWDEHFSRGYSDCDGCGEVMLEIILNVQVE VLIEMHFRDREPLMVHHLQECLCCLMVEAATMIIAINEIDMA" BASE COUNT 479 a 285 c 380 g 437 t ORIGIN 1 caggccagct gcagcggtct ttcctgcagt tggccctgtg gtgtcccgaa gccggatgca 61 tacgacctga gtgacgggag accctgaggc tgtttgtcct cctgaaaagc acctcacaat 121 ggtagaagca gattgtcatg gcaagctttt cattggtggc ctcaatagag aagccaatga 181 aaaggtgctt aaagaagtat ttgcaaaaca tggtcccctt ttggaagttc ttttgataaa 241 aggtcgaacc agtaagtcca gagattttgt ggtcattatt tttgagaatg ctgcagatgc 301 taagaatgct gccagagata tgaatggaaa gtctttggat ggaaaagaaa taaaagtaga 361 acaagcaaag aaaccatctt ttccaagtgg tggtaggcgg agaccaccac cttcttcaag 421 aaacagaagc ccttcaggaa gtctgagatc tgcaagagga agtagtggag gaacaagacc 481 gtggctgccc tcacatgaag gacacttgga tgatggtgga tacgctcttg atctcaacac 541 gagttcttct aggggagcca ttccaattaa aagaggtcca tcttcacgaa gtggaggtcc 601 tcctcctaaa acatctgctc cttctgctat ggcaagaagc aatagttgga tgggaggcca 661 aggtcccata tcacgtggaa gagagaatta tggaggtcct ccatgcagag agccaatctc 721 ttcctggaga aatgaccgta tgtcaccaag agatgatggt tatgcaatta aggaaagaaa 781 tcatccactt tcccgagaat ctagggatta tgctccactg tctagagact atgcatacca 841 tgattatggt cattctagtt gggatgaaca tttctctaga ggatatagtg attgtgatgg 901 ctgtggtgag gtgatgttag agatcattct gaacgtccaa gtggaagttc ttatagagat 961 gcatttcaga gatagggaac ctctcatggt gcaccatctg caggagtgcc tctgttgtct 1021 tatggtggaa gcagccacca tgattatagc aataaatgag atagatatgg cataagtcgg 1081 gagagttact caaggagctg tggtgatttt tattcccgtg attgtgggca cgttgacaga 1141 aaagaccaaa gcaatctacc ttctctggat agggtacacc ctgctccttg tgaaacatgt 1201 ggtagctcaa gatatttgtc atctacagga gatggtgggg aaggtggatc tgacaaaaga 1261 ggctgaagca gatatgaaag caagtattca aataatagtt attgcatact aaaccttgtt 1321 tgcaaatcga aaattgacct gttatttctg cattgttacc tgcgtcttac taaaagaaac 1381 atgtatgttt tgtggagaga ggtagatact aacttcctcc atgaattttt tgaggtattc 1441 aaaggaattt tatttccaat aaataaaggg aattttattt ccaagtaatt tcatactagc 1501 taatgctatt tgaaaactat ctgtttagat gtaatatcta cattaaaatt ttcagaataa 1561 aattttacat gtaatgcaaa a // LOCUS AF027956 1128 bp DNA PRI 02-JAN-1998 DEFINITION Homo sapiens G protein-coupled receptor (GPR30) gene, complete cds. ACCESSION AF027956 NID g2739106 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1128) AUTHORS O'Dowd,B.F., Nguyen,T., Marchese,A., Cheng,R., Lynch,K.R., Heng,H.H.Q., Kolakowski,J.L.F. Jr. and George,S.R. TITLE Discovery of three novel G protein-coupled receptor genes JOURNAL Genomics (1997) In press REFERENCE 2 (bases 1 to 1128) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (03-OCT-1997) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1128 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7p22" gene 1..1128 /gene="GPR30" CDS 1..1128 /gene="GPR30" /note="orphan G protein-coupled receptor" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g2739107" /translation="MDVTSQARGVGLEMYPGTAHAAAPNTTSPELNLSHPLLGTALAN GTGELSEHQQYVIGLFLSCLYTIFLFPIGFVGNILILVVNISFREKMTIPDLYFINLA VADLILVADSLIEVFNLHERYYDIAVLCTFMSLFLQVNMYSSVFFLTWMSFDRYIALA RAMRCSLFRTKHHARLSCGLIWMASVSATLVPFTAVHLQHTDEACFCFADVREVQWLE VTLGFIVPFAIIGLCYSLIVRVLVRAHRHRGLRPRRQKALRMILAVVLVFFVCWLPEN VFISVHLLQRTQPGAAPCKQSFRHAHPLTGHIVNLAAFSNSCLNPLIYSFLGETFRDK LRLYIEQKTNLPALNRFCHAALKAVIPDSTEQSDVRFSSAV" BASE COUNT 184 a 403 c 306 g 235 t ORIGIN 1 atggatgtga cttcccaagc ccggggcgtg ggcctggaga tgtacccagg caccgcgcac 61 gctgcggccc ccaacaccac ctcccccgag ctcaacctgt cccacccgct cctgggcacc 121 gccctggcca atgggacagg tgagctctcg gagcaccagc agtacgtgat cggcctgttc 181 ctctcgtgcc tctacaccat cttcctcttc cccatcggct ttgtgggcaa catcctgatc 241 ctggtggtga acatcagctt ccgcgagaag atgaccatcc ccgacctgta cttcatcaac 301 ctggcggtgg cggacctcat cctggtggcc gactccctca ttgaggtgtt caacctgcac 361 gagcggtact acgacatcgc cgtcctgtgc accttcatgt cgctcttcct gcaggtcaac 421 atgtacagca gcgtcttctt cctcacctgg atgagcttcg accgctacat cgccctggcc 481 agggccatgc gctgcagcct gttccgcacc aagcaccacg cccggctgag ctgtggcctc 541 atctggatgg catccgtgtc agccacgctg gtgcccttca ccgccgtgca cctgcagcac 601 accgacgagg cctgcttctg tttcgcggat gtccgggagg tgcagtggct cgaggtcacg 661 ctgggcttca tcgtgccctt cgccatcatc ggcctgtgct actccctcat tgtccgggtg 721 ctggtcaggg cgcaccggca ccgtgggctg cggccccggc ggcagaaggc gctccgcatg 781 atcctcgcgg tggtgctggt cttcttcgtc tgctggctgc cggagaacgt cttcatcagc 841 gtgcacctcc tgcagcggac gcagcctggg gccgctccct gcaagcagtc tttccgccat 901 gcccaccccc tcacgggcca cattgtcaac ctcgccgcct tctccaacag ctgcctaaac 961 cccctcatct acagctttct cggggagacc ttcagggaca agctgaggct gtacattgag 1021 cagaaaacaa atttgccggc cctgaaccgc ttctgtcacg ctgccctgaa ggccgtcatt 1081 ccagacagca ccgagcagtc ggatgtgagg ttcagcagtg ccgtgtga // LOCUS AF027957 1299 bp DNA PRI 02-JAN-1998 DEFINITION Homo sapiens G protein-coupled receptor (GPR35) gene, complete cds. ACCESSION AF027957 NID g2739108 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1299) AUTHORS O'Dowd,B.F., Nguyen,T., Marchese,A., Cheng,R., Lynch,K.R., Heng,H.H.Q., Kolakowski,J.L.F. Jr. and George,S.R. TITLE Discovery of three novel G protein-coupled receptor genes JOURNAL Genomics (1997) In press REFERENCE 2 (bases 1 to 1299) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (03-OCT-1997) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1299 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q37.3" gene 214..1143 /gene="GPR35" CDS 214..1143 /gene="GPR35" /note="orphan G protein-coupled receptor" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g2739109" /translation="MNGTYNTCGSSDLTWPPAIKLGFYAYLGVLLVLGLLLNSLALWV FCCRMQQWTETRIYMTNLAVADLCLLCTLPFVLHSLRDTSDTPLCQLSQGIYLTNRYM SISLVTAIAVDRYVAVRHPLRARGLRSPRQAAAVCAVLWVLVIGSLVARWLLGIQEGG FCFRSTRHNFNSMRFPLLGFYLPLAVVVFCSLKVVTALAQRPPTDVGQAEATRKAARM VWANLLVFVVCFLPLHVGLTVRLAVGWNACALLETIRRALYITSKLSDANCCLDAICY YYMAKEFQEASALAVAPRAKAHKSQDSLCVTLA" BASE COUNT 189 a 429 c 412 g 269 t ORIGIN 1 tgggaagagg atctgtccag gggttagacc ttcaagggtg acttggagtt ctttacggca 61 cccatgcttt cttgaggagt tttgtgtttg tgggtgtggg gtcggggctc acctcctccc 121 acatcctgcc cagaggtggg cagagtgggg gcagtgcctt gctccccctg ctcgctctct 181 gctgactccg gctccctgtg ctgccccagg accatgaatg gcacctacaa cacctgtggc 241 tccagcgacc tcacctggcc cccagcgatc aagctgggct tctacgccta cttgggcgtc 301 ctgctggtgc taggcctgct gctcaacagc ctggcgctct gggtgttctg ctgccgcatg 361 cagcagtgga cggagacccg catctacatg accaacctgg cggtggccga cctctgcctg 421 ctgtgcacct tgcccttcgt gctgcactcc ctgcgagaca cctcagacac gccgctgtgc 481 cagctctccc agggcatcta cctgaccaac aggtacatga gcatcagcct ggtcacggcc 541 atcgccgtgg accgctatgt ggccgtgcgg cacccgctgc gtgcccgcgg gctgcggtcc 601 cccaggcagg ctgcggccgt gtgcgcggtc ctctgggtgc tggtcatcgg ctccctggtg 661 gctcgctggc tcctggggat tcaggagggc ggcttctgct tcaggagcac ccggcacaat 721 ttcaactcca tgcggttccc gctgctggga ttctacctgc ccctggccgt ggtggtcttc 781 tgctccctga aggtggtgac tgccctggcc cagaggccac ccaccgacgt ggggcaggca 841 gaggccaccc gcaaggctgc ccgcatggtc tgggccaacc tcctggtgtt cgtggtctgc 901 ttcctgcccc tgcacgtggg gctgacagtg cgcctcgcag tgggctggaa cgcctgtgcc 961 ctcctggaga cgatccgtcg cgccctgtac ataaccagca agctctcaga tgccaactgc 1021 tgcctggacg ccatctgcta ctactacatg gccaaggagt tccaggaggc gtctgcactg 1081 gccgtggctc cccgtgctaa ggcccacaaa agccaggact ctctgtgcgt gaccctcgcc 1141 taagaggcgt gctgtgggcg ctgtgggcca ggtctcgggg gctccgggag gtgctgcctg 1201 ccaggggaag ctggaaccag tagcaaggag cccgggatca gccctgaact cactgtgtat 1261 tctcttggag ccttgggtgg gcagggacgg ccaggtacc // LOCUS AF029081 10034 bp DNA PRI 20-DEC-1997 DEFINITION Homo sapiens 14-3-3 sigma protein promoter and gene, complete cds. ACCESSION AF029081 NID g2702352 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10034) AUTHORS Hermeking,H., Lengauer,C., Polyak,K., He,T.-C., Zhang,L., Thiagalingam,S., Kinzler,K.W. and Vogelstein,B. TITLE 14-3-3sigma is a p53-regulated Inhibitor of G2/M Progression JOURNAL Mol. Cell (1997) In press REFERENCE 2 (bases 1 to 10034) AUTHORS Hermeking,H. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) Johns Hopkins Oncology Center, Johns Hopkins School of Medicine, 424 North Bond Street, Baltimore, MD 21231-1001, USA FEATURES Location/Qualifiers source 1..10034 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p35" promoter 1..8567 protein_bind 6091..6110 /note="BDS-1; not p53-responsive" /bound_moiety="p53" protein_bind 6753..6775 /note="BDS-2; p53-responsive" /bound_moiety="p53" mRNA 8568..9876 /product="14-3-3 sigma protein" CDS 8638..9384 /function="cell cycle inhibition" /codon_start=1 /product="14-3-3 sigma protein" /db_xref="PID:g2702353" /translation="MERASLIQKAKLAEQAERYEDMAAFMKGAVEKGEELSCEERNLL SVAYKNVVGGQRAAWRVLSSIEQKSNEEGSEEKGPEVREYREKVETELQGVCDTVLGL LDSHLIKEAGDAESRVFYLKMKGDYYRYLAEVATGDDKKRIIDSARSAYQEAMDISKK EMPPTNPIRLGLALNFSVFHYEIANSPEEAISLAKTTFDEAMADLHTLSEDSYKDSTL IMQLLRDNLTLWTADNAGEEGGEAPQEPQS" polyA_signal 9855..9860 polyA_site 9876 BASE COUNT 2297 a 2732 c 2724 g 2281 t ORIGIN 1 ggatcccagc ctgcccctcc acttctctcc caagccaggt cccggcatgg gtgggttatg 61 ctcatgctgg caatacttga aacgggttta ttaatgctgg gtattttgca caattttata 121 gacctctttt ctacatagtc ttttttaaat ggaaggagaa aatgtcagcc acattactgt 181 ctgtgtagtg ccaggtgaag ggttatcaga aggctggttg gttttaataa gtttattcca 241 agagaccttc tggctggaat gagtgagagt gtgtgtgcat gtgtgtgtgt gttcatgtgt 301 gccctgtatg aatgtggctg gctcccagat cccctgggct gccccctgcc ccatcccctt 361 tgagtatcag aagcactctg agccaagggg acagggggca cgtgcactgg tcacgagaaa 421 accctgggct cccactgggg ctcagcccag cctcctatct ttccttcttc tatggacttc 481 agacagccag tgtctgggga ctctgccact ctacccccag ccctacccac cagcccccag 541 gtgaggcttc cagctgggac ctgcccagac aggctgagcc tgggcgtggt gggtggggtg 601 atggctctgg ggagcggctg ccatcctaca agccacaccc cctcctctga gctctgaata 661 tgggacccag tgccaggagc tggaagacaa ggtgtttctg ccaaacggga cctccatcca 721 gagaaaagga agaaggtgca gggtgggcca agaggcaagt gaaggttggc ctgagtctgg 781 gccggaaact cagaggatgt ttctcctctg ctgggagctg tagtttctta tcaaaataga 841 tattgttcca ccatccccct ccttggccct tcaagtgggc tgaagccttg gaaagtgaca 901 taggaagtcc ccagatcttg cccttctcac tccagaggct agtggtcaca gacagctggg 961 aatggcagcc acagagggtc cctctggaga aacagcttca ccccagcctc agggccctgg 1021 gcatcactgc agtggccctg ggaggtgagg aagaagctgg ctagaggagg gggctcccac 1081 ctacctttta tttaagccag tattctttgt tcctgcttgt aataaaactt cagtttataa 1141 gagttgcttt gctttggttt ggtttttgtt tgcttttcct ttgctgaggc cccaactggg 1201 agccctctgt tctttcagac aaatttggtt ctttcctggg gagactgtga gaaggcaggc 1261 agcccagtga tctggctaca ttttccctca cctggctgga gctctgtccg ctggaggaag 1321 agcagagagg gctgcggctg agcccccatg ggcacgtgaa aagaggccat cctgtcccct 1381 ctttgtcccc tccaccttcc cctgcctcag gggcttggag accccaaatt cttcttccct 1441 actgcctttc cactccgatc cccaatgagt gcccagctaa gaaaatgttt gagacagtag 1501 attccagttt gagagccgga gcttccctgg ctaccacctc caacctgggc accagggccc 1561 agccagacaa ctcataacac tggcccacct ctctggtatc tccctcagga ggacacctgt 1621 caggattttg ccatctcctg cacagcctga ggggagctaa caggcctctt tgcagagggt 1681 tagctggtaa gaccgtttct tccctgtcgg ccagcactgc ccgctcccct ccacacacca 1741 tctcatcctc atcgcatgcc tcgccaaccc catggagccc gtccatctgt ctggtgtgtg 1801 gtgcggtgtg tgtgctggtg gtggtagggt ctccagggac tccccgctaa gcagaaggat 1861 cgggatatag ggcaaggcta aaagcccagc cccattgtgg actgaggaag tacgttcgcg 1921 cagagcagct ctccagctgg aagaggaggt ggagggtgag gctggggaga ggatggcgaa 1981 cctgccctga ggtgcttggg tctgtgctgg tggggtcctg gtatgcaggg gccaccggtc 2041 actaacactc ttatgtcctg gctttctgtc cccgctgagc tttctctcac ccgcccgttt 2101 tctctcctgc ttcattgcct gctgcctaag ccttggccct tctctcgggc agaggcaggt 2161 gctgtggcag cacctctccc caccaccggg cccctgcagg ccgcctccct cctcccaggc 2221 ctgctaaccc tctctcttct ccttctttgc tgtcctgccg gggatctcca gtgtgtgcgg 2281 gggcttaagg acctcctgag gaccgctgct ctctgcctct ccaggaatgg cctgggggga 2341 gccaggcacc cggcacctcc acctgcctaa cctgtggccc atctgccacc atctgtgcct 2401 acagggtctg ccccccagcc tgcccggcct gtgtgctctc taggacccca tagggggcag 2461 gggctggcct ctttgcccca ttcccgctcc atgccggcca gagtgtagaa agccataacg 2521 cacgcagcca tcagcacaat aatgtgactc tacgctgata tgctccctct ctcctccact 2581 gacttcccct tcccggattt gtgaggtgtc aagactagga atctggcctt agagcctgcc 2641 cctccacccc ctcagatcag gcatagccat agtcaagccc agcaggtttc ctcaggagct 2701 gtctggggtg ttgatggtgg atgacgctgc tgaacaagtt tggtgactgt tctaagcaca 2761 actggcttga tactgttccc acggcctgtc cacctcccac ccccaaccct ccaccagagt 2821 aggtaggatg tagggagggt gcgtgccgcc tttgctctag gcactgaggg accaagctag 2881 ccgtgcacag ccccatacac ttcaggggcg taaaggaaag agctgagcca aggaaaatca 2941 gctgagccca gggctggggg ctgcttgtct gctatcctgt accttttttt tttttaacca 3001 aaataaagat tcccctcttc ttgccatacc attggctgtc tggtggcgcc tttactttgg 3061 ggcccaggga tgggacctgc agtgggcgtg tggaacatat ggctccccct cgctcccagc 3121 tttcttccag ctggccagtg ctgctctgga gatttacaag cacaacgaag ccaggaggga 3181 cacaggaaaa gtggctgaca tccttttcac tctgcccctc cagaactctt ggtctcaatt 3241 ccagacacca cccagcctta gctgacctct ggattctgat aggtcccagt gcaggctgag 3301 acagagggtt taactccagt ttgggactgc catacccatg aactgagccc agcccagggt 3361 aacgatctca tggaaacttc tctctcccca gttgctgcac tacatcaaga tacacacatg 3421 tgcatacact gtactatggg ctaaaaaaat acgtaccgct accgttcagc aagggcttgc 3481 cgagtcccgg gcccattttc tcatcttaac ctgtgaggag gatgatgtca gcctttttac 3541 agatgaggga actgagactc aaggaagaaa caggagctgc ccaaggtcac ccagctggca 3601 aagcagcaaa tcccagatcg gaacctgatc tctgccccga gctctgagcc atctgcacta 3661 cccaaggaat gaatacagcg gtgggaggat gagatcttgg agaaacccta aaattagaga 3721 atgtcatagc cagtagaggg cttagagttg atctgggcca gcctccttgt tttactgatg 3781 gagaaattga agcccagagg caggaaggga cctgcccaag gccttataac agagctggga 3841 tgcagtccca cactctgacc tcattccatt ctctctccat aaattctgca ctgtctctag 3901 actggactgg tttagatgtg ggatactcta aacagcagtg ccttcaagag aaaaagaatc 3961 agaactacga atcacttaaa agtaatgtaa gctactctgg gcacactgcc tatggggtcg 4021 ccctgctcca caaggagcca caaaaataat taaaataatt taatatccct tcccaaaggt 4081 aaccagtaaa gtaagctctt ggctaggtaa ctggactctt gttcacaact agccagtggg 4141 aaaaggtgct agagcttcct ctggccacct gtttaatttg atcattccaa gacagaaaca 4201 tttcttagga agttctttct agaatctacc tggtgtccct cccactgcta tcagagccct 4261 gtcctctgtc ctcagtggag gtagagagca aatggttgct gctttcttca tcacaaccct 4321 tcaaagccta ttattaccag ctaagaagga ttggttgact atgggccaga gcccctgagc 4381 ctgctggtag aatggatgct gtacaggagg gtggggaggt agcaggcaga atgaggaaag 4441 cccctttgag ctgcaacccc agctcctgtc ctgctgactc agacagctga ctgtggagct 4501 ccatgccctg ccagggcctg ctgcctcctg cccgtctgag ctcctgaact tgggaaatgg 4561 aggcccagag gcaaagggag gtacctgaga caggaactga gtcaggatca acaggccaga 4621 gcgggcagga ggtatcaggc agcctggctc ccagatgcac ccctgagctc cagcagggga 4681 ggagtaggaa tgaaggggct tccttgccct tgctcatggc tatgcggagg gcgtgaacca 4741 ccaccaggtc ctctggctta agtggcggga agcaaatggt ccctccctgg actcaggctc 4801 caaagttcct gggcctgcct tccaggttcc cagtgtcctg ggatctccag ctttccccag 4861 gacttgggga agccccggct ggatgactag tacaaatgaa ggcccctgag gttccaggac 4921 ctgctgaggt cacaggaata tcctagatca agcttgtcca acccacggcc cacaggctgc 4981 atgtggccca gaatggcttt gaatgcagcc caacacaaat tagtaaactt tcttaaaaca 5041 ttatgagatt tttttgcaaa tttttttttt ttttttagct catcagttat tggtagtgtt 5101 ggtatatttt atgtgtggcc caagacaatt cttccaatgt ggcccaggga agccaaaaga 5161 ttggacacgc ctgtcctaga tggagaggaa ggaggcagtg ctgagcacat ctggccattc 5221 atccatctgg agagagaagg ctatgggcaa actgcttcct ctcccctgta gacacccagc 5281 tgggaaggtc tggcctttgg taagtcctgg cttggggtcc ttcctcattt cacagaacct 5341 aactctatgt tagtgctttg tgagtatatg ttgatcataa taaagttgac gggatttttt 5401 cacatgataa taatagttgt catctggccg ggcatggtgg cttatgccta taatttcagc 5461 actttggaag gctgaggcag gtggatcact tgaggtcagc tgttcgagac cagcctggcc 5521 aacatggtga aaccacatct ctacttaaaa aaaaaaaaaa tacaaaaatt agctgggtgt 5581 ggtggtgcac ccttgtaatc ccagctactc gggaggctga ggcaggagaa tcacttgaac 5641 ccaggaggtg gaggttgcag tgagctgaga ttgtgccact acactccagc ctgggtgaca 5701 agagcgaaac tccgtctcaa aaaaaaagaa aataataata ataatagttg ccatccattc 5761 tactgtgctt tccattaact cgtgtaatcc tcacaagtcc cattttatag ttacaggaac 5821 tgaggctcac agagcttaaa tcacttggcc aaggccacaa acagctataa gaattacatt 5881 taggcagtct gattccaaag atactagtct attctgtatc tcatagacaa acaatacata 5941 ttcacttttt tgttgttgtt ttgttttgag acggagtctt gctctgtcac ccaggctgga 6001 gtgcagtggc gccatctcgg ctcactgcaa cgtccgcctc ccgggttcaa gcgattctcc 6061 tgcctcagcc tcccgagtag ctgggactac aggcatgtgc caccatgccc ggctaatttt 6121 ttgtattttt agtagagaca gggttttcct gggttagcca gaatggtctc gatctcctga 6181 ccttgtgatc cacccacctc agcctcccaa agtgctgaga tgacaggcgt gagccaccgc 6241 gtccgaccta tattcactat ttataaattg gagagaataa gaaaatcaaa agggccaggt 6301 gtagtgactc acacctgtaa tcccagcact ttgggaagcc aaggcaggag gattgcttga 6361 acccagaagt tcgagaccag cctgggcaac atggtgagac cctgtctcta caaaaaatac 6421 aaaaattagc tgggcgttgt ggtgagcacc ttattcttag gaagctgagg caggaggatc 6481 acctgaggcc aaggaggttg agactgcagt gagctgtgat cataccactg tacttcagcc 6541 tggacatcag agtaagaccc tatctctaaa aaggaaattg agaagaaaga aaatcaaagg 6601 gaagcaaaat cactcactct cactacctca agataccctc tagaagttgg tattttagtg 6661 tggttcctat tgttttctgt gtcagttctc tgatttgagc aaaatctttg ggacgtcaaa 6721 cttaaaatcc cctttacttc cttggaaacc ctgtagcatt agcccagaca tgtccctact 6781 cctccttgtg gcaaagagaa ggatctcgtc tttggtcccc agagttctgg cctaagcctc 6841 cctccaggag ggaagatgag tgttcagaca ctcagagtag ctgggggaga cacaggcctg 6901 tgaaattatc ctggctcaac tattaggtcg gcagaatccc agtgaaggga gccctacctc 6961 tgagccccat ctaagctttg gctatgggtg gggcagataa gcaggaatcc atccctatag 7021 gctcaatgcc aacaccctta ggtgaaactc ttgatgaaac ttgaggccag ggctccggca 7081 agcagggaaa gaacgttggc aacagaggtc tccatctctg aggactctgc caggggtcag 7141 agatggggca atggtcaaaa ggaaggaaca ggccaggcac agtggctcat gcccataatc 7201 ccagcacttt gggaggctga ggcaggagga tcgcttgagc ccaggagttt gagacctgcc 7261 tgggcaatgt agtgagatct gctctctatt taaaaaaaaa aaaaaggaaa gaacaagtaa 7321 acttctgaga aacaggctgg gggaggcatc acgtagctgg aattgctgcc ccataaaaca 7381 gaatggtatg tgtcactgcc acctcccttt ctcagtcctc tctctcccca ggttgctagc 7441 gtccccctgg gggatcaaac tggactgctt cccagcctca gacagagagc agtctgagtc 7501 aggcaggaaa gtgggacagc cggggagctg gaccccaccc tctgtgagcc ccgctggtac 7561 ctgatggcat gtggcttgga gagggcaggt gacctggcgt ggagggccag agggtaaatc 7621 ctcaaacaag tggcaacagg ccaccaactt gaaagggaaa attgtgtagt gatgggaaat 7681 gtgtccaaca aacctactgg gtgactaatt acaaaggctg ggctggagct tcagaggctg 7741 cttgttaaac acttcattaa gcggcactct gaaagctgcc acctgcgcat tctgggagct 7801 cagaggggac cctgaggggg aatgaggcct ggaggatgga accatcttca ggtagactga 7861 gaaggagcct ggatctcact tccaaacaca gtctggagct cataggtcag aggcctcaat 7921 gggagaaaag ctaaaggaag agggtgcaga aaggagtttc agggaattgg tggctatgtg 7981 actttgagca aatctcaccc ctctctgaga cttagtgttc ccatctctat ggtcctgtgt 8041 gtgtcacaga gacatggtgg ggattaaatt cgatcgtgat atgaaagtgc ttgggaaact 8101 ccatggccct acctaaacat gagttatcct cacctgaacc aaggggggaa gttacctggc 8161 aggattagga accccatcct cctgaacctt tatgggctct gtcgaggctg aagcagccag 8221 gggctaaagc cagtccttag cccctggaag ggcactgtga aagtggatct gatttgagaa 8281 gccgtttcct gatgtgggca gccatgtgat gccagccccg aacaagaggg ggcagcctgg 8341 agcctggaaa ggtgccagtg caggtggggc ccacgcccag atttctcctg ctgactgttc 8401 tgatgattca cccccacatc ccagcctttt tacctttact gcagagccgg aaagggtgtg 8461 gggaagagag gagagggagg caggtcttgg gccctggtcc cgccccctgc tcctccccac 8521 ccttctctgg gcctggccac ccagccaaaa ggcaggccaa gagcaggaga gacacagagt 8581 ccggcattgg tcccaggcag cagttagccc gccgcccgcc tgtgtgtccc cagagccatg 8641 gagagagcca gtctgatcca gaaggccaag ctggcagagc aggccgaacg ctatgaggac 8701 atggcagcct tcatgaaagg cgccgtggag aagggcgagg agctctcctg cgaagagcga 8761 aacctgctct cagtagccta taagaacgtg gtgggcggcc agagggctgc ctggagggtg 8821 ctgtccagta ttgagcagaa aagcaacgag gagggctcgg aggagaaggg gcccgaggtg 8881 cgtgagtacc gggagaaggt ggagactgag ctccagggcg tgtgcgacac cgtgctgggc 8941 ctgctggaca gccacctcat caaggaggcc ggggacgccg agagccgggt cttctacctg 9001 aagatgaagg gtgactacta ccgctacctg gccgaggtgg ccaccggtga cgacaagaag 9061 cgcatcattg actcagcccg gtcagcctac caggaggcca tggacatcag caagaaggag 9121 atgccgccca ccaaccccat ccgcctgggc ctggccctga acttttccgt cttccactac 9181 gagatcgcca acagccccga ggaggccatc tctctggcca agaccacttt cgacgaggcc 9241 atggctgatc tgcacaccct cagcgaggac tcctacaaag acagcaccct catcatgcag 9301 ctgctgcgag acaacctgac actgtggacg gccgacaacg ccggggaaga ggggggcgag 9361 gctccccagg agccccagag ctgagtgttg cccgccaccg ccccgccctg ccccctccag 9421 tcccccaccc tgccgagagg actagtatgg ggtgggaggc cccacccttc tcccctaggc 9481 gctgttcttg ctccaaaggg ctccgtggag agggactggc agagctgagg ccacctgggg 9541 ctggggatcc cactcttctt gcagctgttg agcgcaccta accactggtc atgcccccac 9601 ccctgctctc cgcacccgct tcctcccgac cccaggacca ggctacttct cccctcctct 9661 tgcctccctc ctgcccctgc tgcctctgat cgtaggaatt gaggagtgtc ccgccttgtg 9721 gctgagaact ggacagtggc aggggctgga gatgggtgtg tgtgtgtgtg tgtgtgtgtg 9781 tgtgtgcgcg cgcgccagtg caagaccgag actgagggaa agcatgtctg ctgggtgtga 9841 ccatgtttcc tctcaataaa gttcccctgt gacactcctc ctgtctctct tccagttctt 9901 ggcgatgggc tgggagtggg actggaatct gacttagaga ccctgacttt ggacctctga 9961 gttagggccc tgaactccct aggtggctca gtggcccgca cgcaagactt tgagtccagg 10021 tgaggccggg gtcc // LOCUS AF036906 1460 bp DNA PRI 02-FEB-1998 DEFINITION Homo sapiens linker for activation of T cells (LAT) mRNA, alternatively spliced form, complete cds. ACCESSION AF036906 NID g2828025 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1460) AUTHORS Zhang,W., Sloan-Lancaster,J., Kitchen,J., Trible,R.P. and Samelson,L.E. TITLE LAT: the ZAP-70 tyrosine kinase substrate that links T cell receptor to cellular activation JOURNAL Cell (1997) In press REFERENCE 2 (bases 1 to 1460) AUTHORS Zhang,W., Sloan-Lancaster,J., Kitchen,J., Trible,R.P. and Samelson,L.E. TITLE Direct Submission JOURNAL Submitted (05-DEC-1997) Cell Biology and Metabolism Branch, National Institute of Child Health and Development, National Institute of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA COMMENT LAT is a highly tyrosine phosphorylated protein, previously described as p36-38, and it associates with many signaling molecules, such as Grb2, PLC-gamma1, PI-3 kinase, cbl, Vav, and SLP-76, either directly or indirectly upon T cell activation. It is a potential type III transmembrane protein. FEATURES Location/Qualifiers source 1..1460 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat T cells" gene 79..867 /note="linker for activation of T cell" /gene="LAT" CDS 79..867 /gene="LAT" /note="tyrosine kinase substrate; This a alternatively spliced form of LAT" /codon_start=1 /product="LAT" /db_xref="PID:g2828026" /translation="MEEAILVPCVLGLLLLPILAMLMALCVHCHRLPGSYDSTSSDSL YPRGIQFKRPHTVAPWPPAYPPVTSYPPLSQPDLLPIPRSPQPLGGSHRTPSSRRDSD GANSVASYENEGASGIRGAQAGWGVWGPSWTRLTPVSLPPEPACEDADEDEDDYHNPG YLVVLPDSTPATSTAAPSAPALSTPGIRDSAFSMESIDDYVNVPESGESAEASLDGSR EYVNVSQELHPGAAKTEPAALSSQEAEEVEEEGAPDYENLQELN" BASE COUNT 269 a 443 c 432 g 316 t ORIGIN 1 accccatctt catctggcct tgactctgcc cttgaggggc ctaggggtgc agccagcctg 61 ctccgagctc ccctgcagat ggaggaggcc atcctggtcc cctgcgtgct ggggctcctg 121 ctgctgccca tcctggccat gttgatggca ctgtgtgtgc actgccacag actgccaggc 181 tcctacgaca gcacatcctc agatagtttg tatccaaggg gcatccagtt caaacggcct 241 cacacggttg ccccctggcc acctgcctac ccacctgtca cctcctaccc acccctgagc 301 cagccagacc tgctccccat cccaagatcc ccgcagcccc ttgggggctc ccaccggacg 361 ccatcttccc ggcgggattc tgatggtgcc aacagtgtgg cgagctacga gaacgagggt 421 gcgtctggga tccgaggtgc ccaggctggg tggggagtct ggggtccgtc ctggactagg 481 ctgacccctg tgtcgttacc cccagaacca gcctgtgagg atgcagatga ggatgaggac 541 gactatcaca acccaggcta cctggtggtg cttcctgaca gcaccccggc cactagcact 601 gctgccccat cagctcctgc actcagcacc cctggcatcc gagacagtgc cttctccatg 661 gagtccattg atgattacgt gaacgttccg gagagcgggg agagcgcaga agcgtctctg 721 gatggcagcc gggagtatgt gaatgtgtcc caggaactgc atcctggagc ggctaagact 781 gagcctgccg ccctgagttc ccaggaggca gaggaagtgg aggaagaggg ggctccagat 841 tacgagaatc tgcaggagct gaactgaggg cctgtggagg ccgagtctgt cctggaacca 901 ggcttgcctg ggacggctga gctgggcagc tggaagtggc tctggggtcc tcacatggcg 961 tcctgccctt gctccagcct gacaacagcc tgagaaatcc ccccgtaact tattatcact 1021 ttggggttcg gcctgtgtcc cccgaacgct ctgcaccttc tgacgcagcc tgagaatgac 1081 ctgccctggc cccagcccta ctctgtgtaa tagaataaag gcctgcgtgt gtctgtgttg 1141 agcgtgcgtc tgtgtgtgcc tgtgtgcgag tctgagtcag agatttggag atgtctctgt 1201 gtgtttgtgt gtatctgtgg gtctccatcc tccatggggg ctcagccagg tgctgtgaca 1261 ccccccttct gaatgaagcc ttctgacctg ggctggcact gctgggggtg aggacacatt 1321 gccccatgag acagtcccag aacacggcag ctgctggctg tgacaatggt ttcaccatcc 1381 ttagaccaag ggatgggacc tgatgacctg ggaggactct tttagttctt acctcttgtg 1441 gttctcaata aaacagaacg // LOCUS D38752 1020 bp DNA PRI 09-OCT-1997 DEFINITION Homo sapiens gene for fibroblast growth factor-8, complete cds. ACCESSION D38752 NID g2463547 KEYWORDS fibroblast growth factor-8. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tanaka,A., Miyamoto,K., Matsuo,H., Matsumoto,K. and Yoshida,H. TITLE Human androgen-induced growth factor in prostate and breast cancer cells: its molecular cloning and growth properties JOURNAL FEBS Lett. 363 (3), 226-230 (1995) MEDLINE 95255551 REFERENCE 2 (bases 1 to 1020) AUTHORS Tanaka,A. TITLE Direct Submission JOURNAL Submitted (31-OCT-1994) to the DDBJ/EMBL/GenBank databases. Akira Tanaka, Jichi Medical School, Department of Pathology; 3311-1 Yakushiji, Minamikawachi-machi, Kawachi-gun, Tochigi 329-04, Japan (E-mail:atanaka@jichi.ac.jp, Tel:0285-44-2111(ex.3316), Fax:0285-44-8467) FEATURES Location/Qualifiers source 1..1020 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 171..818 /note="FGF-8" /codon_start=1 /product="fibroblast growth factor-8" /db_xref="PID:d1023395" /db_xref="PID:g2463548" /translation="MGSPRSALSCLLLHLLVLCLQAQVTVQSSPNFTQHVREQSLVTD QLSRRLIRTYQLYSRTSGKHVQVLANKRINAMAEDGDPFAKLIVETDTFGSRVRVRGA ETGLYICMNKKGKLIAKSNGKGKDCVFTEIVLENNYTALQNAKYEGWYMAFTRKGRPR KGSKTRQHQREVHFMKRLPRGHHTTEQSLRFEFLNYPPFTRSLRGSQRTWAPEPR" BASE COUNT 193 a 341 c 307 g 179 t ORIGIN 1 gcggcgcggc gagcacgacg ttccacggga cccgcggagc cgcgtcgtga tcgccgccgg 61 cctcccgcac ccgcaccctc tccgctcgcg ccctgctcag cgcgtcctcc cgcggcggcc 121 cgcgggacgg cgtgacccgc cgggctctcg gtgccccggg gccgcgcgcc atgggcagcc 181 cccgctccgc gctgagctgc ctgctgttgc acttgctggt cctctgcctc caagcccagg 241 taactgttca gtcctcacct aattttacac agcatgtgag ggagcagagc ctggtgacgg 301 atcagctcag ccgccgcctc atccggacct accaactcta cagccgcacc agcgggaagc 361 acgtgcaggt cctggccaac aagcgcatca acgccatggc agaggacggc gaccccttcg 421 caaagctcat cgtggagacg gacacctttg gaagcagagt tcgagtccga ggagccgaga 481 cgggcctcta catctgcatg aacaagaagg ggaagctgat cgccaagagc aacggcaaag 541 gcaaggactg cgtcttcacg gagattgtgc tggagaacaa ctacacagcg ctgcagaatg 601 ccaagtacga gggctggtac atggccttca cccgcaaggg ccggccccgc aagggctcca 661 agacgcggca gcaccagcgt gaggtccact tcatgaagcg gctgccccgg ggccaccaca 721 ccaccgagca gagcctgcgc ttcgagttcc tcaactaccc gcccttcacg cgcagcctgc 781 gcggcagcca gaggacttgg gcccccgagc cccgataggt gctgcctggc cctccccaca 841 atgccagacc gcagagaggc tcatcctgta gggcacccaa aactcaagca agatgagctg 901 tgcgctgctc tgcaggctgg ggaggtgctg ggggagccct gggttccggt tgttgatatt 961 gtttgctgtt gggtttttgc tgtttttttt tttttttttt tttttaaaac aaaagaggct // LOCUS D78514 617 bp DNA PRI 18-DEC-1996 DEFINITION Human mRNA for ubiquitin-conjugating enzyme, complete cds. ACCESSION D78514 NID g1741956 KEYWORDS UBE2G; ubiquitin-conjugating enzyme. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T.K., Kawai,A., Fujiwara,T., Maekawa,H., Hirai,Y., Nakamura,Y. and Takahashi,E. TITLE Molecular cloning of UBE2G, encoding a human skeletal muscle-specific ubiquitin-conjugating enzyme homologous to UBC7 of C. elegans JOURNAL Cytogenet. Cell Genet. 74 (1-2), 146-148 (1996) MEDLINE 97049093 REFERENCE 2 (sites) AUTHORS Watanabe,T., Okuno,S., Fujiwara,T., Takahashi,E., Nakamura,Y., Hirai,Y. and Maekawa,H. TITLE Molecular cloning of a novel ubiquitin-conjugating enzyme, UBE2G, homologus to UBC7 of C.elegans JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 617) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (28-NOV-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka GEN Research Institute,Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..617 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" gene 19..531 /gene="UBE2G" CDS 19..531 /gene="UBE2G" /codon_start=1 /product="ubiquitin-conjugating enzyme" /db_xref="PID:d1012075" /db_xref="PID:g1741957" /translation="MTELQSALLLRRQLAELNKNPVEGFSAGLIDDNDLYRWEVLIIG PPDTLYEGGVFKAHLTFPKDYPLRPPKMKFITEIWHPNVDKNGDVCISILHEPGEDKY GYEKPEERWLPIHTVETIMISVISMLADPNGDSPANVDAAKEWREDRNGEFKRKVARC VRKSQETAFE" 3'UTR 532..>617 BASE COUNT 181 a 129 c 142 g 165 t ORIGIN 1 gggccctcgg cagggaggat gacggagctg cagtcggcac tgctactgcg aagacagctg 61 gcagaactca acaaaaatcc agtggaaggc ttttctgcag gtttaataga tgacaatgat 121 ctctaccgat gggaagtcct tattattggc cctccagata cactttatga aggtggtgtt 181 tttaaggctc atcttacttt cccaaaagat tatcccctcc gacctcctaa aatgaaattc 241 attacagaaa tctggcaccc aaatgttgat aaaaatggtg atgtgtgcat ttctattctt 301 catgagcctg gggaagataa gtatggttat gaaaagccag aggaacgctg gctccctatc 361 cacactgtgg aaaccatcat gattagtgtc atttctatgc tggcagaccc taatggagac 421 tcacctgcta atgttgatgc tgcgaaagaa tggagggaag atagaaatgg agaatttaaa 481 agaaaagttg cccgctgtgt aagaaaaagc caagagactg cttttgagtg acatttattt 541 agcagctagt aacttcactt atttcagggt ctccaattga gaaacatggc actgtttttc 601 ctgcactcta cccaccg // LOCUS D85815 1086 bp DNA PRI 15-APR-1997 DEFINITION Human DNA for rhoHP1, complete cds. ACCESSION D85815 NID g1944384 KEYWORDS rhoHP1. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1086) AUTHORS Shimizu,F. TITLE Direct Submission JOURNAL Submitted (05-JUN-1996) to the DDBJ/EMBL/GenBank databases. Fumio Shimizu, Otsuka Pharmaceutical Co. Ltd., Otska GEN Research; Kawauchi-cho 463-10, Tokushima, Tokushima 771-01, Japan (E-mail:shimizu@otsuka.genome.ad.jp, Tel:0886-65-2888, Fax:0886-37-1035) REFERENCE 2 (sites) AUTHORS Shimizu,F., Watanabe,T.K., Okuno,S., Fujiwara,T. and Nakamura,Y. TITLE A novel human cDNA homologous to rho genes JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1086) AUTHORS Shimizu,F., Watanabe,T.K., Okuno,S., Omori,Y., Fujiwara,T., Takahashi,E. and Nakamura,Y. TITLE Isolation of a novel human cDNA (rhoHP1) homologous to rho genes JOURNAL Biochim. Biophys. Acta 1351 (1-2), 13-16 (1997) MEDLINE 97236425 FEATURES Location/Qualifiers source 1..1086 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 40..672 /note="Rho-related protein HP1" /codon_start=1 /product="rhoHP1" /db_xref="PID:d1020429" /db_xref="PID:g1944385" /translation="MTAAQAAGEEAPPGVRSVKVVLVGDGGCGKTSLLMVFADGAFPE SYTPTVFERYMVNLQVKGKPVHLHIWDTAGQDDYDRLRPLFYPDASVLLLCFDVTSPN SFDNIFNRWYPEVNHFCKKVPIIVVGCKTDLRKDKSLVNKLRRNGLEPVTYHRGQEMA RSVGAVAYLECSARLHDNVHAVFQEAAEVALSSRGRNFWRRITQGFCVVT" BASE COUNT 199 a 361 c 338 g 188 t ORIGIN 1 cgcagccgcc cgcccgcccg ctcagcgccc ggccccggga tgacggcggc ccaggccgcg 61 ggtgaggagg cgccaccagg cgtgcggtcc gtcaaggtgg tcctggtggg cgacggcggc 121 tgcgggaaga cgtcgctgct gatggtcttc gccgatgggg ccttccccga gagctacacc 181 cccacggtgt ttgagcggta catggtcaac ctgcaagtga aaggcaaacc tgtgcacctc 241 cacatctggg acacagcagg gcaagatgac tatgaccgcc tgcggcccct gttctaccct 301 gacgccagcg tcctgctgct ttgcttcgat gtcaccagcc cgaacagctt tgacaacatc 361 tttaaccggt ggtacccaga agtgaatcat ttctgcaaga aggtacccat catcgtcgtg 421 ggctgcaaga ctgacctgcg caaggacaaa tcactggtga acaagctccg aagaaacgga 481 ttggagcctg tgacctacca caggggccag gagatggcga ggtccgtggg cgcggtggcc 541 tacctcgagt gctcggctcg gctccatgac aacgtccacg ccgtcttcca ggaggccgcc 601 gaggtggccc tcagcagccg cggtcgcaac ttctggcggc ggattaccca gggcttttgc 661 gtggtgacct gagcggctcg gggcgtccca gcgacgcggg aaggggcagg gcgctgacct 721 gctgctgagc tggctgggct ggacccggtc cctaggctgt gaccgccgaa ctccactgca 781 acagacgggc gccaccaaag ccaggccctg aggcctggga gtcctggact gagaaagggg 841 gttcctgggc ccacctgctc tgtgtagggc tcgtcctgcg gtgcccgaga atcactcgct 901 aacccctatg cccggtcccg gaccgacatc ctggagccgc ctgtgcagcc tgatgccccc 961 tcgtggctgc tcccagggct gcacctgcca ggacctaatg ttcttaggtc cctctggcca 1021 gaacccacac ccggcccctt cccacctgtc atactggtaa ctgtaacaag aaaaacgaca 1081 tcactt // LOCUS D87957 1658 bp DNA PRI 03-FEB-1998 DEFINITION Homo sapiens gene for protein involved in sexual development, complete cds. ACCESSION D87957 NID g1620897 KEYWORDS protein involved in sexual development. SOURCE Homo sapiens male foreskin fibroblast DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Okazaki,N., Okazaki,K., Watanabe,Y., Kato-Hayashi,M., Yamamoto,M. and Okayama,H. TITLE Novel factor highly conserved among eukaryotes controls sexual development in fission yeast JOURNAL Mol. Cell. Biol. 18 (2), 887-895 (1998) MEDLINE 98107674 REFERENCE 2 (bases 1 to 1658) AUTHORS Okazaki,N. TITLE Direct Submission JOURNAL Submitted (22-SEP-1996) to the DDBJ/EMBL/GenBank databases. Noriko Okazaki, The Okayama Cell Switching Project, ERATO, JRDC; 103-5 Tanakamonzen-cho, Sakyo-ku, Kyoto, Kyoto 606, Japan (E-mail:okayamap@mbox.kyoto-inet.or.jp, Tel:075-712-5406, Fax:075-712-5492) FEATURES Location/Qualifiers source 1..1658 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /sex="male" /tissue_type="foreskin" CDS 150..1049 /note="protein involved in sexual development" /codon_start=1 /db_xref="PID:d1014201" /db_xref="PID:g1620898" /translation="MHSLATAAPVPTTLAQVDREKIYQWINELSSPETRENALLELSK KRESVPDLAPMLWHSFGTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALLQCVAS HPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINFLLT TEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYERFSHVAMILGKMVL QLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQVLKDDTTTKRWL AQLVKNLQEGQVTDPRGIPLPPQ" BASE COUNT 410 a 422 c 412 g 414 t ORIGIN 1 tgagaggtca gagggccgcg aagtgggcgg agcgagccgg agtcggatgg cggctacggc 61 ggctcattat tttccgctgc aggggtgctg aaggggggac gcgggtcgga cgcgtccggc 121 tgtggaagag agcggcggcc gctcacaaca tgcacagcct ggcgacggct gcgcctgtgc 181 ctactacact ggcacaagtg gatagagaaa agatctatca gtggatcaat gagctgtcca 241 gtcctgagac tagggaaaat gctttgctgg agctaagtaa gaagcgagaa tctgttcctg 301 accttgcacc catgctgtgg cattcatttg gtactattgc agcactttta caggaaattg 361 taaatattta tccatctatc aacccaccca ccttgacagc acaccagtct aacagagttt 421 gcaatgctct ggcattactg caatgtgtag catcacatcc agaaaccagg tcagcgtttc 481 tcgcagcaca catcccactt tttttgtacc cctttttgca cactgtcagc aaaacacgtc 541 cctttgagta tctccggctc accagccttg gagttattgg ggccctggtg aaaacagatg 601 aacaagaagt aatcaacttt ttattaacaa cagaaattat ccctttatgt ttgcgaatta 661 tggaatctgg aagtgaactt tctaaaacag ttgccacatt catcctccag aagatcttgt 721 tagatgacac tggtttggct tatatatgtc agacgtatga gcgtttctcc catgttgcca 781 tgatcttggg taagatggtc ctgcagctat ccaaagagcc ttctgcccgt ctgctgaagc 841 atgtagtgag atgttacctt cgactttcag ataaccccag ggcacgtgaa gcactcagac 901 agtgcctccc tgaccagctg aaagacacaa ccttcgccca ggtgctaaaa gatgacacca 961 ccacgaaacg ctggcttgca caactggtga agaacctgca agagggccag gtcaccgatc 1021 cccggggtat ccccctgccc cctcagtgat ccttccctgt tccctcccac tactccccca 1081 agttggggaa aggaggggga acctacgaga aaaacagctc aggttttatc accgactggg 1141 aatagacaac ctcaatgctg aaccgcactg gagaaaaggg gcaaggtacc cctgctgagg 1201 tgtatgggct gccatctcag gctgtcttga ggacctgggc tccctctgct actcccagga 1261 aatgggctcc tgacacagca gtctgccacc acagccccag gagggtgtca acaccagcaa 1321 atgctgtatt tgcagcatgt ccaagatgac ccttctcccc tacctctacc tagccactgg 1381 cagggagggg agacagtggt gatagcagca gcactctagg catggtgaac gcctgggacc 1441 aagccatgtg gcgtttttta ttttgccttt ctggaagact caagatatgt ctcttcattc 1501 tctctcagta tttgtttact ttggtttttt tgtttttaat ctcagagaga ggtgtgttta 1561 gtgggcacaa gctgtaatat tcagcaaaac tttgtcgact ggcactgttt acaagtttgt 1621 tagctgcata agctcaataa aaagttggtt tgggcatt // LOCUS HSAC002086 112686 bp DNA PRI 13-MAY-1997 DEFINITION Human PAC clone DJ525N14 from Xq23, complete sequence. ACCESSION AC002086 NID g2085785 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 112686) AUTHORS Tin-Wollam,A, Graves,T and Biewald,T. TITLE The sequence of H. sapiens PAC clone DJ525N14 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 112686) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (13-MAY-1997) COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: This sequence was generated from part of bacterial clone contigs of human chromosome X, constructed by David Bentley's chromosome X mapping group at the Sanger Centre. Further information can be found at http://www.sanger.ac.uk/HGP/ChrX/ SOURCE INFORMATION: This clone was derived from human PAC library RPCI-3 prepared by Pieter de Jong and coworkers at Roswell Park Cancer Institute, using the method described by Ioannou et al., Nature Genetics 6:84-9 (1994). The library is from one male donor. For further details, see http://bacpac.med.buffalo.edu/ The clone is available from Genome Systems, Inc. (http://www.genomesystems.com). VECTOR: pCYPAC2 NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of H_DJ525N14; actual end is at 112686 of H_DJ525N14. The orientation of this clone is unknown. This clone contains STS AFMb331yc9 (NID:g1235110). FEATURES Location/Qualifiers source 1..112686 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /clone="DJ525N14" /clone_lib="RPCI-3" /map="Xq23" repeat_region 523..710 /rpt_family="ALU" repeat_region 1001..1081 /rpt_family="ALU" repeat_region complement(2902..3185) /rpt_family="ALU" misc_feature 3378..3418 /note="similar to human EST R69179 (NID:g842696) yi39b07.r1" misc_feature complement(3680..4065) /note="match to human EST T77594 (NID:g694797) yd73h12.r1" misc_feature 6471..6793 /note="match to human EST R69179 (NID:g842696) yi39b07.r1" misc_feature complement(6739..7041) /note="match to human EST R69071 (NID:g842588) yi39b07.s1" repeat_region 9255..9302 /rpt_family="ALU" repeat_region complement(12295..12496) /rpt_family="ALU" repeat_region complement(15224..15591) /rpt_family="L1" repeat_region complement(16974..17109) /rpt_family="ALU" repeat_region 17365..17631 /rpt_family="ALU" repeat_region complement(17661..17957) /rpt_family="ALU" repeat_region complement(20930..21221) /rpt_family="ALU" repeat_region 22978..23019 /rpt_family="L1" repeat_region 23615..23989 /rpt_family="ALU" repeat_region 24666..24947 /rpt_family="ALU" repeat_region 25334..25430 /rpt_family="ALU" repeat_region 26796..32933 /rpt_family="L1" repeat_region complement(31523..31940) /rpt_family="L1" repeat_region complement(35528..35802) /rpt_family="ALU" repeat_region complement(36166..36848) /rpt_family="LTR" repeat_region 36166..36849 /rpt_family="LTR" repeat_region 41112..41401 /rpt_family="ALU" repeat_region 41537..41616 /rpt_family="L1" repeat_region 41861..42159 /rpt_family="ALU" misc_feature 44658..44934 /note="match to human EST N23587 (NID:g1137737) yv99a09.s1" misc_feature 44658..44920 /note="match to human EST H83437 (NID:g1062108) yv83e03.s1" misc_feature complement(44956..45131) /note="match to human EST N23586 (NID:g1137736) yv99a09.r1" misc_feature complement(45478..45729) /note="match to human EST H83545 (NID:g1062216) yv83e03.r1" repeat_region 51930..52220 /rpt_family="ALU" misc_feature 52820..55727 /note="Elongation factor 1-alpha pseudogene" repeat_region 53066..53332 /rpt_family="ALU" repeat_region 53994..54029 /rpt_family="L1" repeat_region complement(55080..55370) /rpt_family="ALU" repeat_region 57186..57919 /rpt_family="L1" repeat_region 57998..58409 /rpt_family="L1" repeat_region 58422..58524 /rpt_family="L1" repeat_region 58525..58798 /rpt_family="ALU" repeat_region 58799..60287 /rpt_family="L1" repeat_region 60331..60776 /rpt_family="L1" repeat_region 60778..60903 /rpt_family="ALU" repeat_region 60904..61251 /rpt_family="L1" repeat_region complement(60905..61068) /rpt_family="L1" repeat_region 61272..62268 /rpt_family="L1" repeat_region 63265..63555 /rpt_family="ALU" repeat_region 63885..64052 /rpt_family="ALU" repeat_region 64062..64353 /rpt_family="ALU" repeat_region 64638..64679 /rpt_family="L1" repeat_region 68204..68329 /rpt_family="ALU" repeat_region 68462..68731 /rpt_family="ALU" repeat_region 68819..69221 /rpt_family="MER" repeat_region 69246..69441 /rpt_family="L1" repeat_region 69444..69728 /rpt_family="ALU" repeat_region complement(69847..70141) /rpt_family="ALU" repeat_region complement(70727..71293) /rpt_family="MER" repeat_region complement(71814..71880) /rpt_family="MER" repeat_region 74190..74487 /rpt_family="ALU" repeat_region complement(76351..76650) /rpt_family="ALU" repeat_region complement(78351..78759) /rpt_family="MER" repeat_region 82086..82239 /rpt_family="ALU" repeat_region 83720..83996 /rpt_family="ALU" repeat_region 84463..84753 /rpt_family="ALU" repeat_region 85074..85130 /rpt_family="L1" repeat_region complement(88509..88576) /rpt_family="L1" repeat_region complement(88920..89211) /rpt_family="ALU" repeat_region complement(91388..91427) /rpt_family="ALU" repeat_region 93294..93320 /rpt_family="L1" gene 96380..98398 /gene="WUGSC:H_DJ525N14.1" CDS 96380..98398 /gene="WUGSC:H_DJ525N14.1" /note="similar to zinc finger 5 protein from Gallus gallus, U51640 (PID:g1399185)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2085786" /translation="MESRKLISATDIQYSGSLLNSLNEQRGHGLFCDVTVIVEDRKFR AHKNILSASSTYFHQLFSVAGQVVELSFIRAEIFAEILNYIYSSKIVRVRSDLLDELI KSGQLLGVKFIAELGVPLSQVKSISGTAQDGNTEPLPPDSGDKNLVIQKSKDEAQDNG ATIMPIITESFSLSAEDYEMKKIIVTDSDDDDDDVIFCSEILPTKETLPSNNTVAQVQ SNPGPVAISDVAPSASNNSPPLTNITPTQKLPTPVNQATLSQTQGSEKLLVSSAPTHL TPNIILLNQTPLSTPPNVSSSLPNHMPSSINLLVQNQQTPNSAILTGNKANEEEEEEI IDDDDDTISSSPDSAVSNTSLVPQADTSQNTSFDGSLIQKMQIPTLLQEPLSNSLKIS DIITRNTNDPGVGSKHLMEGQKIITLDTATEIEGLSTGCKVYANIGEDTYDIVIPVKD DPDEGEARLENEIPKTSGSEMANKRMKVKHDDHYELIVDGRVYYICIVCKRSYVCLTS LRRHFNIHSWEKKYPCRYCEKVFPLAEYRTKHEIHHTGERRYQCLACGKSFINYQFMS SHIKSVHSQDPSGDSKLYRLHPCRSLQIRQYAYLSDRSSTIPAMKDDGIGYKVDTGKE PPVGTTTSTQNKPMTWEDIFIQQENDSIFKQNVTDGSTEFEFIIPESY" misc_feature 100489..100812 /note="match to human EST R31842 (NID:g787685) yh69d10.r1" misc_feature complement(100904..101361) /note="match to human EST D80048 (NID:g1177925)" misc_feature complement(100974..101338) /note="match to human EST R31806 (NID:g787649) yh69h10.s1" misc_feature complement(101047..101361) /note="match to human EST R31792 (NID:g787635) yh69d10.s1" misc_feature complement(101258..101362) /note="match to human EST T61211 (NID:g664248) yb84c08.s1" misc_feature 101614..101789 /note="match to human EST R39187 (NID:g796643) yc89b12.s1" misc_feature 101615..101822 /note="match to human EST (NID:g683307)" misc_feature 101615..102038 /note="match to human EST AA131343 (NID:g1692841) zo08h04.s1" misc_feature complement(103124..103580) /note="match to human EST AA131443 (NID:g1692930) zo08h04.r1" misc_feature complement(103788..103995) /note="match to human EST T75299 (NID:g692061) yc89b12.r1" repeat_region 104972..105263 /rpt_family="ALU" repeat_region complement(107849..108138) /rpt_family="ALU" repeat_region 108827..109127 /rpt_family="ALU" misc_feature 109916..110040 /note="similar to human EST AA018543 (NID:g1481798) ze50c10.r1" misc_feature complement(111208..111355) /note="similar to human EST T75299 (NID:g692061) yc89b12.r1" BASE COUNT 33780 a 23505 c 23725 g 31676 t ORIGIN 1 gatcagtcgt tggggtgact ggaacccaca gtacatgtag gggcccacaa actagtttaa 61 tttcttatat aaaataaggt tgatgaaaag ggtcattgtg caagcaaaaa tagaatgagt 121 cactcaaacg atggaggata ggttctatgg ttaaaaccta ctctctatat tcaggacatt 181 aacaagtcct gaagtatcta caaaacactt cttaaaggaa gaatcttaga gcactcaatc 241 tgtcctggct gtgattattt gacaataaat aaaatgtatt tattcctaaa caagagctca 301 gaaagatatc tgtagaaaag ggcaggagta gaggaaaaga gggctatgca gcagcaggaa 361 agaagggaaa tcactgcagg tccacagcac cagctcagca cagaaggtaa gtgcgcattt 421 gtaatgggtc tgtagccaga gacatagcag aagctgcgct atccacagtc acaaccaaag 481 gcaaaggaag gatgccctgc aattttagaa atcttggctg cagctgggaa cggtggctca 541 cgcctgtaat cccggcactt tgggaggccg aggtgggtgg atcacttggg gccagcagtt 601 ccagaccagc ccagcaaaca tggtgaaacc ctgtctctac taaaaataca ataattaccc 661 aggtgtgctg gtgcgcacct gtagttccag gtactcggga ggctgaggca ctcctgagta 721 cctgcacaat acatgcaaac acccacccca tcctactccc cggccccccc gccctactcc 781 ccccctccac cctacgcccc cccgccctac tccctccccc gccctactcc ccccaccccg 841 tcctactgcc cccactccaa cacgtgctct cccgtgccca aggcaaggcc aagcccctga 901 ggcatgcgca cctcagcagg cccaacccac agcaaatagt gggaagagga aaggcaagag 961 aggaggtctc taagtggata cactgttact gaatctaggt acccagaaga tggaggttgt 1021 agtgagccca gatagtgcca ctgcattcca gagacagagc gagactgtct caaaaaaaaa 1081 aaaaaaaaaa aaaaaagaaa gaaagaaaga aaagaaaagc aatctcggat gctctaagca 1141 tcattgtgca tgtttccacc cttgcacttt gatgtggatt cattactaag aattaatagt 1201 ggactttctc aaggagtgtg ctgcattttt acatagtgca tttgcgcagc ccacaagctt 1261 tgcaaatcca ggcccaagct cctgctcctg gagacctagg aagcatgggg ctagtgcaaa 1321 atctccctta tcgccataaa atgggcttgc acaattcagc acagcagctt cccccaagtt 1381 ccacactcac gcatgcctgt ctacaggaca acgcatgcaa gcacgcaccc catcctactc 1441 cccctcccac gccctactcc cccctccccc acaccctact acaccactta cccctccccc 1501 cccacaccac caacgcgtgc tctccctcat ccgggcaagg ccaagcccct gacgcatgcg 1561 cacctcagca ggcccaaccc acagcaaata gcgggaagca gaaaagcaag agaggaggtc 1621 tctaagtgga tacactgttg ctgagtctag acaccagaag aacgttgcag gcggcgactc 1681 acagttctag cactgcctag gagagcgtgg tggccccagc tcagaatctg cagaagtgca 1741 cagctccatc cacaccactc agggtatgga gcctccggac cagtgtagcc agtatatgac 1801 cagcttgctc agccctgcag tcgacgacga gaaagaacta cagggtcagt accttgatgg 1861 gacacatgct tttgcgaact caggaaacca aggttctctg ggagaggcga gtagactgga 1921 tgatacccct gcactaccac tgctctcaga gtgtagcccc tcaccacctc cttccaacac 1981 cctcaggacc aatgccttga tctttctctt ggggtttcta cttttccaga tatgaatgct 2041 atggtgctgt cgcttactga agaggtcaaa gaggaggaag aggatgcaca gcctgagcct 2101 gagcaaggca cagcagcagg agaaaagtta aagtcggcag gagcccaagg cggagaagaa 2161 aaagatggcg gcggagaaga aaaagatggc ggcggcgccg gagttcctgg ccacctatgg 2221 gaaggagacc tcgagggcac cagcggcagc gatggcaacg ttgaggacag cgaccagagc 2281 gagaaggaac ctgggcagca gtattcgcgc ccacagggcg ccgtcggggg gctggagcct 2341 ggcaacgcgc agcagcccaa cgtccacgcc ttcaccccat tgcagctgca ggagctggag 2401 cgcattttcc aacgcgagca gttccccagt gagttcctgc ggtaagccca ttgctctggt 2461 tggcgcgcgg tttgcaggga ggcggcgttt ggctttcccg cagtccctct cctaccctct 2521 cccctcctga accaaaaccc atctggggcc tggtgttgct gccgtcccct ccccgcagac 2581 ccctggcacc tagtgggttc tgtagtgggg ctatgcctat taggcatcat gcagaattta 2641 aatgaaccaa gttgggcaac tttgggctga ggtcagttat gataaataac tcctatccca 2701 ggcgaggcag aaataaagat gaggagatta aggttctgta cagcaagtgc agggtcgcat 2761 tctgacctta tttaaaattc tgagaagtcc gtcgttcgtc tgggtttcct ttggtgttaa 2821 tttttctaag tttcaaatag taagttagaa tgtcatttat attgattaac gatttttttt 2881 gttatggggg ggttattttt ttattttttg gagaaggagt ctcgctggga cacccaggct 2941 ggagtgcaat ggtgcgatct cggctcactg caacctctgc ctcccaggtt caagagatta 3001 tcctacctca gcgtcccaag tagctgggat tacaggcgcc agccaccacg cccggctaat 3061 tattgaattt ttagtagaga cggggtttca ccatattggc caggctggtc tcgaactcct 3121 gacctcaagt gatcctcctg cctcggcctc ccaaagtgct ggcattacag gcgtgagcca 3181 ccgccctgat tttttttttt ttggcatctc ctttatttgt ggcatgagag aaatgttcct 3241 aatgtgaggc tcagctgggt gttacagaac agcctactgg gtgtggggag ttggtagaat 3301 aaaaaaatta aacacaagaa tgaaacgaca cccacaactc caaatgctga acacgtgtgg 3361 tttcttccat agaaggaggc tggcaagaag catgaatgtg actgaactcg cagtgcaggt 3421 cagtaaaccg aaaaagcaat cgggcagggg agccattcta aaacccgctt cagggcttgg 3481 acacactttg acccagacat tgccatcttg gtgtttttgt ggcctttttc atgtataggc 3541 aatgaggtct gaactttggg atctttgtgg ctggaaaatg cagtaaggaa agctcagctt 3601 gtggaaattt cccattacca gagggaacat gacaatccac ggaaaaaaaa atgagtactg 3661 aactgtttcc tttattcctt gtgtataaaa tatattatac acttgaataa attacaaata 3721 tataaatata tgtattccaa attacaaata tatataatat ataacagtat aaattaaata 3781 tagcataata tataagattg aattcctagc actggtatcc tttaggtagc acttccatgt 3841 ggaggcatcc tggggttttc tgacatgggg attattcgaa ctaatgctcc aaagagctcc 3901 attaaactaa actatttaat atatttaaaa ccaagcataa actctttggt taagaattta 3961 taaatgttta ggcattgggg taaaggaata attcccaacc agaacgtatg tttccttaag 4021 ctgagactga tggcggggga tagcataaag ttcaagcgcc tggccctcct actattgttt 4081 ttaagtattt ggtgaagcct tgtagagagt ggcagctcta aattttaatt ttctggaata 4141 ctttgatatg gcaggctgaa aacattgctg acatagttgg ttttgcatta tggtagagat 4201 agcgaaaagg cttgcaacct ttgcaagttt atctagaaac caaaacaaac tacaagatca 4261 atttttcata aatttccacc aacctaagga tggaagtgtt aggcaagaat tgtatacagt 4321 cacactcatg gcttggggaa gctgccccca ttcaactttt gaagctacca aatgcatcag 4381 tgaatatttt cttgatcaat ttctagcatt taagagattt tttgtttctt ccagtttgtt 4441 gtactaacag gaagagatgt gttttgtgga atgtgttgat agtgggggat ggctgtcagg 4501 tcaggctaga ctcttacctg tggtaatttg aaataccttt ttatttattt gctcaggctg 4561 tgataataaa tttgtaggtc agtggatcca aaggagtatt tgcaaaaaag gaagaaaaga 4621 aagaaaaatc gggatgacct taatagcttt tctgcagagc tcatgtgtgt gagtgcagtg 4681 agggaactat gctgcggttc tgtcaatggt ggcatgcaga aaagctgcct gacaaaacgg 4741 ggcactgagt gttggcatcc tgttgccctc cattttctca ggcaggcaat gccaatttgg 4801 gggagatgaa atattgacgt tttggtgaat tgcctttccc ctccctgggt ttacatagca 4861 ttcactcaat gcagttctga aggcaaaagg accactctca ctcagcaaat aaggaaaaag 4921 gcatgatgtt ctcatctatt gccaaacatt tcagcatgaa tatctccaaa aatttttaat 4981 agatgtttcc cccctctgtt ttgtaaagat agtaagggct ggtttatgaa accgatacgg 5041 agactaggga cctaatgttg tcccatagca catttggccc acttttggca ttgcttatgt 5101 ttatgaacct ttgaccacag gtacattgga attgtctcta aacaagcact attacggtag 5161 gcttaaaatt tagactatgt attcatgagg gagaaggact ccaaatagta agatgggaga 5221 aggaaatggg gcatcaaaca gggagaagag tccctttgtc ccataaaatt tgcagtttgg 5281 gaacttccag aaggatcaaa atacaatgga aatgtcatta actagaagct tgggggatag 5341 tgttgtttaa ttgacaggtt ttattttttg gtcttttaac aattggcagg tgtttgggga 5401 gtagtgtttt ttgatacctc ctccacatta ctactgaacg tgttcagagt tgcctatttg 5461 gacactcccg ctttgggaat gctgatgaga ttgggatctt ctgctcgagt agtgaggcta 5521 gatggctgaa ggcgtcccat ggaactgggc aaccaagtct ctgtcccata tcccaggact 5581 atagggagaa caaagtgagc tgaccttgtc cttagcccag gaagatttgg agggagggat 5641 gagttggagc gtgcagaatt agcaatgctt cctttctctc aaactctcaa ctgggcaaag 5701 ctttttgaga atttttgtga atttggacaa catttgtgtc ctccttccac aaagcttcaa 5761 gctgaagtgt gaagagtgtc cagggcttca gataccctca tttcccagca agacaatcca 5821 caaactccct gggcccccta taacgagact tgccttgcaa tataccatct ttttaattct 5881 caaggagttt cagaagatga atgacccctt agaatttctt gaaggaaaga aaacttgttt 5941 taaaatacaa tgtactctta ccgatttttt actttacact gaagatgggc ttttcaaaac 6001 gggggcctca tcttcttggt atgactttaa aaggcacccc cgcctttttt tccaagctca 6061 gcttccccca cgaccagggt gacagtggcc tagtaagtag gagttggtgg gcagctgggt 6121 agctgatggt caagtatctg caagcagaca tgccacacat agcaatacca gcagggatcg 6181 agtcccactc atcagctaca cctgcaccca agagtggaaa aagtaatatt acagataaaa 6241 ggagagagac aaagaaataa gccaagccag agcgtacgta gcacatactt ggagctcact 6301 aggggaaagg tgtggtggtc acctttctaa gatattttag tagtaaggaa gtagcttgtc 6361 ttgatacacc cgaaaaggtt aaggacatag agagttgctt gttgtgtttt tttccatgat 6421 gaaattaaat gctcagtata ctaaacctgt ttcttttttc ctctgattga agatttggtt 6481 tgagaataga agagccaaat ggaggagaca tcagagggca ttaatggcaa gaaacatgct 6541 gcccttcatg gcagtgggcc agcctgtcat ggtaaccgca gctgaggcca taacggcacc 6601 cttgttcatc agcgggatga gagatgatta cttctgggac cacagccatt ccagcagcct 6661 gtgtttcccc atgccaccct ttcctcctcc gtccttgccc cttccactca tgcttcttcc 6721 acctatgcca cccgctggcc aggctgaatt tggcccattc ccttttgtta tcgtgccttc 6781 tttcacattc cccaatgtct aagggatagc ctctgtgcca ctttttgcca gagtgtcttt 6841 gagccagatt catattttgc atagcacccc atcaaaagta gttcatcaaa tgtctattaa 6901 acgttttaaa gaaaagtaca tcattgaccc atttttaggg cacttgtaaa aatgtttcta 6961 taaatatgtg aagggtatgt acatttgttt tgtgtgtcac atggggtcag taagttctca 7021 ataaaaattg ttaagaaatg ccattcaaac cgaatgtcac ggactctcgt ctcatgcagt 7081 aatcttgagt caccatctgg gggctgtgca tgtcacagaa ttttcccatg tgcccatggc 7141 aggcttacac cgacccagac attctcttcc acccccaccc ccaacacccc atactcatct 7201 cctctctgcc cccattgata ccaggtacat gtgcaggatg gggtggtgtg agtccctaat 7261 gagagaatag tgtgtttcaa gttactgcca gggtggaaaa cagcaaaacc aaaatggtat 7321 cgggaaggaa gccatgccca ctcttaaaaa ttcattaagt tttttcttgc tatggaagac 7381 ttctttaaag taattttctt gtaaaagtgt aagtgtaaat aatgccatga aattatacac 7441 ttatttaatg gctaaaatgg caaattttta tatggtttac cacaacaaaa gaaaagaaaa 7501 aatataccaa aaagttttta aaaagtgatc atgaacattg tggtcatcct gtgaagtgat 7561 tgcaatgcct gcagataagg agtgatggtt acaacagtat tttctctgaa aattatttga 7621 tggccagttt cataatgatt atgttttcag cctgaaggaa aattccattt tcttgggtga 7681 gcatgaactt tctgtcaggc tgtctgctgc ttttcattcc ccacttctct cttcacaatt 7741 gtgggtgtca agcttcagcc acgcaattgc atttggtgtg agggttttgc aaggagaagg 7801 aggtttattc atacccctga agccacaagc cttggtggga ataaggaaag tccatgaatt 7861 cactatgcat taatgcatgc ctcttggcca agtggattct tttttcttct ccgattgaga 7921 ttttcctttt tttttttttt tttttttttt gctcttgttc tcattgtatt gtgctttgta 7981 taattattta cagtaagtcg cgcatgtcag tgtacattcc gtctggaaat tgtttccatt 8041 tggtacattt tgtgccagtc ggtctattcc tgctcattat tttgtttttt ctacattcag 8101 actgaaacat ttggtagcct agagatactc agaaataggc aaagaaaggt aaaaggggag 8161 gaggggagat tgaaatcata cttccattat ccctccccgt ggttatcaga ttcataaaat 8221 ttttacacca tcaaaagaca tttttagcca tatatgtttc ctttatgtag aaaatgagat 8281 ctcctatact cagttggcat ttgttttcat gtatctgata agtacctgtt agaattatag 8341 gtcaagatgt tgtctagctc tggtaaccta gatcctgaac ttcaaagcag agttcttgga 8401 ccttaagaaa ttcaaattcg tagaattata gttaggatgt catttactga ccttgaacaa 8461 cacattgttt aggtagcata caatgggact agttcaaggg gctgggtgga gaacaagtta 8521 caattttttc tgatggagaa cattactgga gacagtaccc tttaaaactt tcattccctt 8581 tttattagca gctaacgtct tgtatgcctt tactatgtga caagcaataa aataatgtcc 8641 ttggatgatt ttgtttgatg ctcaaatgaa tcctaatagt gaggttctag catcttcatt 8701 gaaagaagag gaaacttatc catggtgcaa agctaatcag aggaacagga tctgaagcca 8761 aatttgtctg cctccagagc ccatgctttg tcaagtttaa ttcagcggtg gtgcactggg 8821 atgccacagt tagggcaatc tggcttccaa tcccagatgc accacttccc agctgtttcc 8881 ccctgggcat tttacttaag ctctgtagta tcattattat tctctataag atgataatat 8941 tatatcacag aattgatgag agggataaat aacatatgta agtggccttc cacaatgccc 9001 agccagagta agcacacaga aatgttttct actattattc tatattatct cctatcatca 9061 tattattata acctgttgtt ccacctttcc tatgtggtat aacagaaaga gatctgatta 9121 aaaataaggt ggtctgggtt cttgctgcag ctctacgctg tctttgttac cttgaatgag 9181 tcgcttaacc tactatatct ctgctgtctc atgtgttagg ccattcttgt tttgctataa 9241 ggaaatgcct gaggctgagt aatttataaa gaaaagaggt ttagttggct cacagttctg 9301 caggcttaac aggaagtgtg atgctggcat ctgcttctgg tgaggacctc aggaagctta 9361 cagtcatggt ggaagacaga ctgggagcag gcatctcaca tggtgagagc aggagcgaga 9421 gagtgggtgt gggaggtgcc acacactttt aaacaaccag atcttgtgat aactcgctca 9481 ctattgcaag gacagcatca agccatgagg tttccacccc catgacccaa acacgtccca 9541 acaggcctca cctccaacat tggggaaaat aaggattaca tttcaacatg agatttgggc 9601 agagacaaat atcccaacca tatcatctca cctatggaat catgacagta atagcatgac 9661 tcatgtattt ggagtaggtt tagtggtaaa agtttatcag gcatatttca aaaaatcaga 9721 aaaaatagga atgaatgcct tatgaaaaaa atcatttcac gcattgtttt ctagagtgtg 9781 gggagtacca ttcatgttta tatgcatgct cgaccccatt ttcccagatc aacaattact 9841 gtttgtcctt tcttgacaga gaaaaggaat caaatgaagt gaataatatg gagaggaatt 9901 ttttaaagtg agcagagtgg atatttgtaa aggcttttcc atgttttcgc tatggtagtg 9961 ataactgtgg ctgtctgtat aaaaagggtg tatcatttaa acatgccctg aaatggttaa 10021 tgatgaaatt atgtgatgtc tcgatgtgct tccaaataat cccaggttgg gtgcagggat 10081 gttggagaaa tcagcaaggc tactgatgaa acaagactgg atattaataa ttgttgaatg 10141 tgggtggtgg ctatatgagt gttcatctta tgtttgtttt atatattttg gacatttcct 10201 ataattttta aaaaaatagt aagggtcgaa aacctagctg ctaacagggc cgggaaggta 10261 acataaatga agaaaacaag cagagtgggg actggtaaac tgggaaaggc tatgtctccc 10321 ccaaagggaa tggccactac tctgcttcag cctaattttt ctggtcagga ctttggggtc 10381 cagatctggc agattttcca gtttgtaaag attggccaga aataagtgtt tacatgaaac 10441 cttctgatcc ttagatctga gcaatcaatt caaatatgtt ttatttaaaa ttacactggt 10501 catccgaaac agatctgcag accagatata ggatgcagct tccaatttgc aacctgtact 10561 tgagaaggta actttaagca aattcaaaac cgtggagcaa tagaatatga ttgaaagcaa 10621 ctctaaagcc agcaatccta ggatgccatc attccacctt attttttttc tcattataag 10681 tcctgttctc atgtgataaa agaaaaactt cagccaaatt aaagttaaag gagtttaatt 10741 gagcaatgga cgatttgcaa atcgggcagc ccccagaatg acagcagatt cacagagact 10801 ccagtgcagc catgtggtgg gagagttata gacaaaaaag ggaaactaca tacagaaatc 10861 agaagtgaga tagagaatgg ctggattggt tacagctcga cgtttgcctt atttgacatt 10921 ttcattttac caataatttt taaaactctc tttatttccc aaaaattact taagacacat 10981 gaactaaaag gcattacact ttttactttt ctgacaaaat atgttattta agcttttatt 11041 atttttaaac caattagtga aagctctttt atatataaac atcaaacaca taatacatat 11101 aaatccatag acagaagtta aaggactcat tttccaagcc aggaattgaa cgctgaaccc 11161 aggctgccat tgtgaagaga aagcatggcc acatggttac aaggtcaagc tcccaaggac 11221 atacaagaca agagggaaac cttatccagt tttttttttg ttttgttttt tgttttttcc 11281 agggacctgc agcaaagctt ataagtgatc agtttgcttg gctgtcttga acagcgggct 11341 tacaggtgtc ctaagcctgt attctatcct aaggtacccc tcattaatga cagaaaatat 11401 ggaaagacac acaaagcata ccaaatttgg tacagcttaa gactagcctt acaagtgctt 11461 tttctgatta atttaaactt tacaggagag taacagtgat ttttaccatt tattcaacct 11521 gtttgcacag agagagagag agagaggcca ggagtctgac tggtaagaaa ttgttacctg 11581 tttgccagca tgccaggctt ctgtgttccg ttgccctaag tggccctagt gacccacctc 11641 gctgcaccat agacctgggg ggccaagcca caacacaaag gaaaattatc tttttctgtt 11701 tgggccagag taaaatatgt gtgacgaaac atagacatca gctactctgc ttagcaccca 11761 atattaaact ggcaaggctt aaatttgtcc ttagatggct cccgtcatct ttaatccaac 11821 ttctgactag gaatttcaac acgtccctgg gcaagatggt caccctgagt aatagaaaag 11881 ataagaaagg gaaaggagag agagaaaagc attgcctgtg gcagggtggg gaaggtgaag 11941 agctcacaga ggccagagaa ggacccactc attcattgca gtgacactga aaatgaaaag 12001 ttcaggggac cacttgccag tagtgaaggg atcttttcca gcagttccat cagctgtcag 12061 gattcccctt ctggggagga aaaagctccc catctcccac ggtcctgcac atgcctaatc 12121 ctgtcaccca tagccgtcag caaaaagtgt aagaccgatt aatccaaaga gaatagcact 12181 taacattcca tagtgccaaa cccgtcctta gccaaaaggg attttaccaa gagccctcat 12241 ttttaaatgt atttcaatgt gttgttgttc atttggaacg ttccactgta agtacaagcg 12301 attctactgc ctcagcctcc cgagtagctg ggactacagg tgcatgccac cacgcccagc 12361 taattttttg tatttttagt agagatgggg tttcactgtg tttagccagg atggtctcgg 12421 tctcctgacc ttgtgattca cctgcctcga cctcccaaag tactgggatt acaggcttga 12481 gccaccgcac ctggcctctt gttctcttat tttctcccga acatacagaa tcgctctctc 12541 tgttctgagc caactaaagc tgggggtgga gtgacatagg tattaacacc cctgtggtca 12601 ccaccactat gactgtgctg ggtcagacct gaagccagca cagcactggg tctcacccaa 12661 ggcctgctgt aaccactccc tggctactgt ctctatgttt gcttaagacc ctgggctcta 12721 caatgagcag gtggcaaagc cagccaggcc tgtgtccttc cctccaggtc agtgagttcc 12781 cccaggcccc aggtgggtct agaagtatca tccaggagtc agggactaga gtcaaaaacc 12841 ttctaagtct acctggtgtt ctattgtatt gcagctgagc tggcactgaa accataagac 12901 ctagtccttc ctgcttttcc ctcccctttc caaaggcaaa ggagcctcac tccacagcca 12961 ctgccacccc tggccacaag gagcactgac agactaccat caatgttccg ttaaggccca 13021 aggtctctta agttagcttg tggtgaatgc tgcctggtct gggactcatc cttcagggca 13081 gtgggctccc ctctggcctg gggcaggtcc agaaatgctg tcaagtcctg gaatctggaa 13141 cctcaagaac ctgcttggtg ctctagcccc cgtggtggtg ttggtacctg aagccagcaa 13201 gtctcagagg ctcatgatgg ccctcaatgt agtgcctgtg cattgctgtt ggttattcag 13261 ggcccaaggg ctcttcagtt agcgggtgat gaatgctggc aggatggggt cctttccttc 13321 aaggcagtgg gttcccttct ggcccacagt ttgtctagaa atgtcatttg ggaactaggg 13381 ctggaacagg ggcctgttga ctctgactgg agccctatct tgctgtggct gagttggtat 13441 ccaagatgca agacaaagtc ctcccaactc ttccctctcc tctcctcaag cagaaggaag 13501 ggctctcttt tagagccaca agctgtgcat cctggggtta ggggaaaagt gatgccagca 13561 ttcccttggc tgtcctagct agtgtctcag catgttgtgt tccccgcccc gccaccccca 13621 atccactgtg tctgggccta gttcatccct aggacttacc taaaagttgc agtccttatg 13681 gcctaggctg cctttcaagt ttacttagag actgacagca ctttggccct tcatggtgag 13741 gtttgcaggt actcaagttc agactgctgg gatcagtgat tcccctctgg ctagggctgg 13801 tttaaatgct ccctctgtga gtgggcatca actgagtttg atctggtttt cctttctgct 13861 gtaacaggac attgctgagt gcaatgcttc acaattgaat tgtgaagcaa aagaactgtg 13921 ttctctcttc cccagtgccc agaaacactc tccacaccat gccatggctg ccagggtggg 13981 gaaggggtag cattggtgat tcaggattgt tttctatatc tcttcagtgc ctctttcagt 14041 gatacgaaat taaaaccagg tactgtaagt gctcacctga tttttggttt ttaagaaggt 14101 gtttttttct gtgtaggtag ttgttaactt ggtgtccttg cacagagtgg caggaggacg 14161 atcagcggag ctttctattc tgccatcttc gtctgcctcc tttcctagcc tcatttcccc 14221 ctgcttcctc ctctgccccc tgggtcctga gcacatgggt cttgagaaca caccaagctc 14281 cttcccacct tggggccttt gcactggctg ttctctctgc ctggaatgct cttcctacag 14341 gtttttgcat ggctgcctcc tttactgcat tcatgattct gctcaaatgc cctctctaag 14401 cacctgagct aaaatcccta accccgcagt ctctctactt ttttgtttgg ttattggctc 14461 tcttgtttaa tctctgtctt cctaccagag ggcaggattc agaatttaaa cagtgcccta 14521 gcacataatg agagctcagt tatcttctgt taaagataaa aatgaagcca cactttaggt 14581 gtacccgaag gccaaccact cataaccagg taaccaaaat ttaattcttc ccaatttccc 14641 caaaacactg tctctaatca taaatatcaa acataacctt tacatctttg tcagcgtgat 14701 tcagtgaaat taaaccaatc agctataggc aaatcagttt aaacagctct gtttacctta 14761 aaaagaatga taacgtaaaa cagccaacca caaaaaaaaa tcaaaatatt ctcctttatg 14821 ctttataaag tgtgctatga ctgccataag gtgagctttc taccactttg tttgaagtct 14881 cctgggtcat gagctgtact ttctcttact gtataacaat aaactttaaa atgtttccta 14941 acttgatctg attctcattt tgacacttcc aataatcttg agaaatgctc ctgagcactt 15001 tgggtctggt attcttagag agggcaaaca ttctcatttt tcaaagggga agttgaggcc 15061 cagagagaga cagtgactgc ctgaagtcat aaagttccag cactctcttt ccttactccc 15121 ttgttctcct ggtcttgggg acatcagaca tctttaaaca tttggtctca ccatagaaac 15181 tttgagatcc agtgcattag ggttccctag agggacagaa caaataggat atagatatag 15241 atatatagat atatatgtag gagatataga tatatatatg catatgtagg atatatatat 15301 gtatatgtag gatatatata tgtatatata tacagaactt atatatatat atacagaata 15361 tatatatata tacagaacta ataggatata tatatatcct ccatatatat atatcctcca 15421 tatatcctcc atatatatat atcctccata tatatatatc ctccatatat atatatcctc 15481 catatatata tatcctccat atatatatcc tccatatata tatatatcct ccatatatat 15541 atatatatat cctccatata tatatatata tatcctccat gtatatatat atggagttaa 15601 ttaagcatta acttacatga tcacgaggtc ctacagtagg ctgtctgcaa gcttgaggag 15661 caaagagatc cggtctgagt ctcaaaactg aagaacttgg gagtccgatg ttcaagggca 15721 ggaagtgtcc agcatgggag aaagaggtag gctgggaggc taggccagtc tctccttttc 15781 acatttttct gcctgcttca tattcactgg cagctgatta gattgtgccc accagattaa 15841 gggtggattt gccctgtgca gcccactgac tcaaatgtta atctcttttg gcaacaccct 15901 cagagaacac ctaggatcaa tattttgtat ccttctatct aatcaagttg acactcagta 15961 ttaactgtca tacccagagt ggctgttact tcccaaaaat attcagtctg taattgataa 16021 agtggggttt ttatcctggt atgtcagact tgtggctgag actttctgac acttgtgagg 16081 ctggaaggga aagaggcagt ggataaagtg ggccttggct tgttctcata ggttgcaatt 16141 aacatctgaa gcacttgctt tcaacctgaa gaacttcatt atttcttgtg aggtggatct 16201 actaacaaca aatcctccca tttttattta actgggaatg tctttgcctt catttctgaa 16261 aggtagcttt gctggatgta ggattcttgg ttgatagttt tttctttaag cattttgagt 16321 atttaatctt actgcctcct ggccttcatt gtttctgctg agaagtcaac taccaatctt 16381 actggggtaa gtgatgagac atttttctct tgccactttc aagattttct ccttgatctt 16441 agccggtttt actatgatgt aatctgtttg tggattcttt acatatatcc ttcttagaat 16501 tcactgagct tcctgagtgt gtaggttatt gtttttaaaa taaatttggg aaattttctg 16561 ccattatttc tttgactatt ttttctgctc ctctctatct cttcctttca tctcctctct 16621 atctcttcct ttcttctagt agatcccact taggtttgtg tggctgatgg tgtcctacat 16681 ttgtctgaag ctctatttat ttttcttcac tctttcttct ctctagtctt catcttgcat 16741 aatcactatc aatcctgttt caaatctgct aattctttct tctgccagtt taaatgtgct 16801 attgaacctg ctagtgattt tttcatgtca gttattgtac ttttcagatt cagaatttcc 16861 atttggttct tttaaaataa tttctatctc tttaatgata ttttctactg gatgcaacat 16921 tgttatcata cctttatttt ctgaatcatg ctttctttta gttcagtaaa catattattt 16981 tttttctctg tagagatgga ggtcttgcta tgctgaccag gctggtctca aactcctggc 17041 ctcaggcaat cctcccgctt cagctttcca aagtgctggg attacaggta tgagccacca 17101 cacctggcct gcaaacatat ttataatggg tattttgaag tctttctctg ttaaatccat 17161 catgtggttg ctctcacagg cagttggtgt tgtctgcttt ttttctgctg tatggggcat 17221 actttcctgt ttctttgcag gtctgtattt gtctgttctc acattgttat aaggaaatag 17281 ccgagagtgg gtaatttata aaggaaagag gtttaattga ctcacagttc agcatggctg 17341 gggaggcctc aggaaacata caatcatggc agaaggtgaa ggggaagcaa ggcaccttct 17401 tcacaaggca gcaggaagga gaagtgcaag caggagaaat gctagacgct tataaaacca 17461 tcagatcatg tgggactcac tcactatcac gaggacagca tgggggaaac cgcccccatg 17521 atccaattac ctccacctgg tcccatcctt gacacatggg gattatgggg attaagggga 17581 ttacaattca agatgagatt ttgggtggag acatagacaa accatatcac atgcctcata 17641 aactttttgc tggaaattga aatcatttct gagacatggt cccactctgt cacccaggct 17701 ggagtgcagt ggtgcaatca tggctcactg cagcctcgac ctcctaggtt caagtgatcc 17761 ttccacctca tcctcctgag tagctgggac tatcagcgtg caccattcca cctggctaat 17821 tttttttatt acttttgtag agatgggggg tctctctgta ttgcccaggc tgttcttgat 17881 ctcctgggct caagtgattc ttccaccttg gcctctcaaa gtgctgggat tacaggtgtg 17941 aagcattatg cttggccaca aaatttttga taatatgttg tagcaactgc ccccaaccct 18001 ggcctctggg acttgttatt tgctggtata ttttttttag tgattggctg gattatttta 18061 atgaagctta ttccttctcc tcatagtctt aagcctctga tgttgctctt caggaagaca 18121 tgactttggg tgtgtacacc gtcactctag gatggcagtg gtgttagtag ggctctctat 18181 ctttccctta ccatgcccaa ctattaaact ccactaattg cctgctgatc attctattgt 18241 tttcagcaat gctctaggac ataaattgtt ctacaaacta attcaattaa attgtggttt 18301 atttgaagga atagtttttg aggtccacgt ctgatatttg ttctgatacc aggagtgctc 18361 ttaccagcca tcttatttcc tggttttctc ctgaaaacta tccagcttac aggccatgct 18421 ttatcttcat tagatccaca aatctcaact gccttgtatg acaacgtcca ctgttcttga 18481 gagcactctt agctttgaac tttacactct gttgcaaatg aagtcaatcc ctttgggaag 18541 agattaggag ctacctgttt tgtagcctgt tcctcctcca aggcaaaatc tttgagcaag 18601 agctctagag acaaggtggg gacggtggca agcttctgcc tgaatggcaa cccctttcta 18661 tgggctgagg ccttggcaga gtggggtgca gcagcctaag gtgctctcgg cttgcctctt 18721 cagcatgtaa ccaccacctc atgagcgagg caaggaaaac ttgagccgcg gtttcctcag 18781 tgtgttgcat ctaaggtaga gcctccatta aataactggg ggacagagtc ggcagattaa 18841 atgagccacc atcactcagc tgtactcacc tgatacttag cctcagaaac aagtagctgg 18901 tgtcagtatg aatgatgctg aagtgctgct cctcttggga agaaagtcct ctggctggga 18961 gccagagggg agagggagcc ctatgctctt gtctgcagca gtctgaagta gaatctctgc 19021 cttactgatc tgggagggga aaaggaagaa gctgtagtga ttcaaatacc acagacttgc 19081 ttttcttatt gaatttttgt aggttctctt agaaagacgt ttcttcattt gctgttttcc 19141 cttaggacca tttccagggg ctttaagttg ttgttttaaa aataatattt accactttca 19201 cttgggagtg catcagcaga gttcctcaag ctgtcatgct ggaagttgaa ctccgtgttt 19261 ggtacttttt tcatttgcac agtgatcttg ttttttctat gcctagacaa ggttatgaag 19321 tggtattttt tttctcactt cgtcaagatc acagagtgat tttgtctggc agtagtgctc 19381 tgtgagactg tttatgttca acaggaaaac atcaagacct agttgtgcgt accaggccag 19441 tgctggctgt cagggactgc gatatggttt ggatttgtgt ccctacccaa atctgatgtc 19501 gaattggagg aggggcctga tgggaggtga ttatatcatg ggggaggatt tccccattgc 19561 tgttctcatg acagtgagtg agttctcata atatctgatg gcttaaaagt gtgtggcact 19621 ttccccctta attttttgct ctcctgtcac catgaaaaga tatgccttgc ttccccttca 19681 ccttttgcca tgattgtaag tttcctgaag cctcccagtc atgcttcccg ttaagcctgt 19741 ggaactgtga gtgaattaaa cctttttttc tttataaatt acccagtctc aggtagttct 19801 ttatagtatg agaacggact agtacagact gcttatcacc ttctcacagt gtagaacaat 19861 cacttcgtaa ttattctgtc taaggggcaa ggaagcttgg gcatttatcc accaactccc 19921 aataatcatt ggttgagggc tgctcctggg ggcatttatt ccccagcctt ccctgttcag 19981 gcagaggggc ttcagacccc agaggaagca tcaagctgtt gcaaattggg ccaagtacat 20041 atggctgaga tctagtagga cacagacaac atctgctata gatgacctgc caaagatcct 20101 gaaagggcca ggatttgagc tcagtgttat ctgatgacca tgtcttagcc accacttgat 20161 ttcgtgaatg gagaggtttg aaaagaacag cagaactttg ctagaggggt tgtcaagatt 20221 acttgcctga acagtttctc atgtatatat tacacagact gttggagata cctgctctgt 20281 tcccagctcc tggctagacg atggggtcac aaagatgacc caggcatgtt ttctgctctt 20341 atggtacaat ctgtctgttg agggtgggat atctgaaggg acaagcccac acttcgtccc 20401 cagtcccacc tgtgttggct agggagtcct tcctggagga tgtgttgcgt aaacaatgga 20461 gagaatttag ggaacatcga gcctactctt ttcactttac agatgaagga acacaggccc 20521 acagcaaggg agtggtctgc aaaagagaac atatagactg caccctggag aaggccagct 20581 tctaagtggc catgcccttc cttagctccc cagtcaccca gatacaggaa gtgctcacta 20641 tatcagattc agctcctcag cacttaccaa gtccatcctg cccagaatgc tttccttcaa 20701 aggcctgcag gctcaaggct ctgtcttaag cctgagcctt tttacctcta aagaaagtaa 20761 tcaacctcct ccaaaggcat tcagtgtcct ggtatgggag aatatgtgtg tataccaggg 20821 tctgggaatt ttgagaaacc atcttaaggt ttggtgagtt gctttttctc cttatgtgaa 20881 tttatacagt cctctatgga gttctggagg gaaaatgaac caaacaaaat attttttttg 20941 agacagggtc tcactctatc tcttaggctg gggtgcagtg gcacaatctt ggctcactgt 21001 agccttgacc tcacaggctc aggtgatcct tccacctcag cctcctgagt agctcggact 21061 acaggcatgc agcaccacac tcacctagtt tctgtattct ttgtagagat aggattttgc 21121 catgttgccc aggttgatct caaactcctg ggctcaagtg atcctcttgc ctctgcctcc 21181 caaagtgctg ggattacagg tgtgtgctac tgcacccagc cgaagatttt ttttaaagtg 21241 aaaacaaagt atgtttttat ctatttccaa actatagcat gaatttcccc aagaaacttt 21301 tagcagatgc tctcctccct tttgcaaata tactaaaact attttatgaa aacagtattg 21361 ggaatatgga cccaaaacac ggttatcctg atgtagagat ttggccaatg gtacatttga 21421 attccactaa aatgaactct gttatactga cctgcttttg actatatcct ttatacagca 21481 ctgtagaggg ctttctgcag gcccagagca atggagtggc ctacgtaaga gaacacacat 21541 agctcagctt agaagaggcc agccgctgaa tggccatgcc ctcccctggg ccagagatca 21601 ttcaattaag agaaacacac accagcccaa gttcagctct tcagcaccta ccacattatc 21661 tagcccagaa ctatttcact taaagggttg aagattcaag tcccttcccc aatcatggca 21721 ctgggccatt ttacctctag gaatagtaaa caacctcccc taaaggcatt cagtatcttg 21781 gtgtccattc ttacaggccc ttgagggagt gatacaactt gtaccctcct cacaactctc 21841 atttctccca atcttcttgc aaatctgtgc ctgaagctgg aagcacaaca catgaacaat 21901 aactagcttg gtgcttggtg ttgaattaac ttttttaaga aagcaggcta acaggttgga 21961 aattgttctg tcaatatttt agaatctagt aatacaaaag accccttcta aaagcacatt 22021 ctgttgtgtt acacaagcaa ggggacaaag gtgatttcag aggtagcatt caggtgtgag 22081 catcataatt gatacatttg gacccctcta aagcacaggc cacagacaaa actagggcct 22141 ctatgagaac tgcagagttg gattatgatt ttttatttca agccgaagaa tcttggttta 22201 ggcaaaataa tggagggaag gcagagaaca aggaagagtc tctgtccatt ccctttgaac 22261 gagagaaaca aaaaaattaa acaaaggatg ttttaaaatc aaaaggtgaa agagtctgtg 22321 gagacatagg aggaaagaaa tccatggcaa tggctgagcc tggagctgtt gttgaggtgt 22381 ggggagcttg acgataaaca gcggctagca gctaggtact gcctctggaa gatagccagt 22441 ggggcacaag gaggtagctc tggccagctg ggctttacca taaaaaaaaa aaggaaaaat 22501 agaactctgg tccctatata aatcagtccc taacatctcc ttccacagcc ctcatcacga 22561 ggccaagcag taagaccaat ttctctttac ctctgatccc tatcctccct ttaactcatc 22621 tggtcacatt aaggggtttc agtgggattg aaattattta taaggaagaa aggaaagtaa 22681 agactgagtg gaggtttcac aatctgattt tccctggccc aggtccccaa ccccagcacc 22741 tcaaggagca caaaatatga ttctattcat tttaagcaca aagaatttcc aaggatttct 22801 cagaaattcc acaggcctac ccgccacatc aaaaaggtca actccaaaag tgagtccgac 22861 agaaatattg taagatgcta aatcgtctct tctatttatt taacatacag acttagattt 22921 aggagcctct tcttggctca acccgttagg tgaacacatt ctatagtatc aaaattttgc 22981 ataactctcc actaagtaaa tgtttagtcc attcattctg aactggaagg aaggagacta 23041 gctgccagct gtgatcctag tcaacatgca ggatgtgatg aaatgaggaa gtcttcttca 23101 tggtcatcga gttactgtcc aagagtagca accacagcca acgacagaga tcaaatttca 23161 ttcatctcac tctgctcctt ctcaccattc tttcccctat gaaccatagg aaagactcaa 23221 tccataggca aaataaaaat gatactctga ataactcttt taacaccctc ctgatattta 23281 ggcaccatgc ccaaactgtc ttgttccctc tcacaattgg tagcattcta atattgtaca 23341 tcaatatctc agttaaatag aataacacta ggagggaact tgaattaatc atgattgcat 23401 acttattgca ggaaaagata aataaccaag aaagccactt ttcctctgct gaatcttttt 23461 ctttgcttag ctgttttttc catatgtgtt gctttatata actatttaga gaaagtctca 23521 gggatcagca cacattcaat ctaatagcct aagtgttttt cagctaacgc catggcaata 23581 aagtttgctc tttggagtga tggctcttag acactgtatt agtctgttct cacactgcta 23641 tgaagaaatg cccaagcccg ggtaatttat aaaggaagga ggtttaattg actcactgtt 23701 ctgcatggct ggggaggcct caggaaactt acaatcgtgg cagaaggcaa aggagaagca 23761 ggcactttct tcacagggca gcaagatgga ggaagtgcaa gcaggggaaa tgccagatgt 23821 ttataaaacc atcagatctt gcgagactca ttcacaatca tgaggacgga atgggggaaa 23881 ccacccccgt gatccagtta cctccacctg gtcctgccct tgacacattg ggattatggg 23941 gattacaatt caagataaga ttttggatgg ggacacagcc aaaccatatt agacactaca 24001 aactcacagt gaatctctgg aatataccat gctcatttct gactatcttt gcttgtgctg 24061 ttcctatact cctttcctca tctagccaaa tcattcatgt ctgttttcct ctcacatttt 24121 ctgcctcctt tcacccatct tcttcttttt ccttctgttt ctcaacctca cctgcccaat 24181 tgtctcccct tccagacccc ctgtagacct gaagtctcaa tcgctaatta agaggcaaca 24241 gtcatgactt atggcctctc actgaaacat aactgttttg tgggtgtaat tcttatgttc 24301 cctacctgca tggtaagcac ttgcattttc agactgggtc tcctactctt cttgtttcct 24361 tcacggaagc tagcagtggt gctttatgtg taatttttaa aagggcaatg attatcgtaa 24421 cagctaaaat ttttggaagg cttaatgtgt gacaagtact gtgtgtgaat taagcttatt 24481 gctatttcac aaatgaagaa ataaagacag agagagatta aataatctgt ccaaatcaca 24541 gctgacaagt ggctataatg gaattcattg accaaacatc ccattctgtt cagtagactc 24601 aaatgtgagg ggaaaataga catacataat ctcacatttt ttttcaacag aaaataagga 24661 aataagctgt gcacagtggc tcatgcctgt aatcccagca ctttgggagg ctgaggtggg 24721 gggattgctt gaggccaaga gtttgagacc agcctgggca acatagtgag accccatctc 24781 taggaaaaaa atagctgggt gtggtggcat gcacctgtat gtagtcccag ctctttggga 24841 ggctgagata ggaggattgc ttgagcccag gagttcaaag ctgcagtgag ctatgatcag 24901 accgctgcac tccagcctgg gtaacagagc aggaccccat ctctaaatta gttaattaaa 24961 ataaaaagaa aatgaggaaa taaacaatga catgggattc ttgttcaatc tcatttcttt 25021 aaaacagtcc attaaccaga ctaaatctaa cccttgggta ttatgctgaa ttaaggattc 25081 attttaatat ataagtacac aatatttctg gatacaaatt aaacagaata gactgctggg 25141 tatgtactag atatttttct ttcattcttc tatttgctca ttagattccc ttttctgtgc 25201 tgccattttt ttctttataa tcattaaccc atccatttat tgtctgttag gaagtaaggc 25261 aacatatatg caatgaatat aataagaaag gaaaactgaa aaaatgttgt tttagaaaat 25321 aacgttaata gtgtcacttg aacccagaag ttcgaggctg taataagcta tgatcgcgct 25381 cctgcactcc agcatgggca acagagtgag atcctctctc ttaaaaaaaa aaaaaaagtt 25441 aactgtggat taagacatct ttttcaaaag ataaagccat tttcctttgc ccgggtgaat 25501 gctaagcagt ttgtgaactc acagtatcac tgggatttgg aagtctcacg tgttaaaaag 25561 taagacatga tcaaagcttg aatacagcca taggtctgct tgtttgaatc ccatcattcc 25621 aagatataca tattttgttt agtgtttcag tatattgata tggtttatgt aacagtatat 25681 atacaactac tttgtggagt atttcctgca atttcatgta ctcgcattca gaagcaatca 25741 gaagccattc ttcttttttc aaaacaagga aaacaatatt tcttccatca ttctacatat 25801 atatttggca cctgttaatg tcatatatgc caacagctac ttcagacatc tctttttgat 25861 tgactagtat ttcatttttc agtcgtcagt cttaaagatt aatgttttgt ctttctaata 25921 aatatgtagt taagtgaatt ccatacttat ttttagatcg accgtgtgtt ttcttgctta 25981 tctttctgtg ttaattgaaa ttctgttaaa cctagaaaat tacttggagt atgtgttcag 26041 atgacgtggg gcaggcaagc ccccagactg gggcttagcc tgggagagtt cttggctttg 26101 cccaggaaag aattcaaggg tgacttggtg gtattaaaca gcaatctttt acttaactgg 26161 agctgttcct tgtgaagcag ggctaactca ttgacatagt gcccagagcc acatttgtgg 26221 gctgttggca attgtattta tatccactta tggtccccta cccaccttaa atttggagag 26281 cctcatgtaa aatcagaaac aagcagggga ctagattccc ttattgtcct ctttatgccc 26341 atattgtttc ataattataa aatgttagac aggaacatag ggtccttgga tggaaggata 26401 ccacataaat atattcagat agatgaaagg cagaactctt tgttacttac agctccaaat 26461 gagagaaggc tgccaggtag agccacacct gaagttgtaa cgcaggatag agctatacct 26521 gtaggaggca gtccaggtat ggcaagggag gtttcatggg ctccctgtgg atttgttaat 26581 ttgaaactta ggcaaaaggg ctgtccctag ttgtctggta cctatccctg tggtgattag 26641 ggcaggtaca tagttgccag gaatgtgaga gccccataag ggaaatggtt gagatgtgga 26701 tttaatcagc tgctcaagaa gaggaactga ctagcctcta gccagggcct caaaattggg 26761 tcaagatggc actgaagaaa acaaaaccca caactggtgg agccaagatg gccgaatagg 26821 aacagctcca gtctacagct cccagcgtga gcgacgcaga agatgggtga tttctgcatt 26881 cccaactgag gtactggttt catctcactg gagagtgtca gacagtgggt acaggacagt 26941 gggtgcagcg caccgagtat gagccaaagc agggtgaggc atcgtcgcct cacccgggaa 27001 gcgcaagggg tcggggaatt ccctttccta gtcaaagaaa ggggtgacag atggcacctg 27061 gaaaatcagg tcactcccac cctaatactg cgcttttcca atggtcttag caaacagcac 27121 accaggagat tatgtcccac gcctgactcg gagggtccta cgcccacgga gcctcgctca 27181 ttgctagcac agcagtctga gatcaaactg caaggcggca gcgaggctgg gggaggggcg 27241 cctgccattg ctgaggctcg agtaggtaaa caaagcggcc aggaagctcg aactgggtgg 27301 atcccaccgc agctcaagga ggcccacctg cctctgtaga ctccacctct gggggcaggg 27361 catagccaaa cagaaggcag cagaaacctc tgcagactta aatgtcgctg tctgacagct 27421 ttgaagagag tagtggttct cccagcatgc agcttgagat cggagaacgg acagactgcc 27481 tcctcaagtg ggtccctgac ctccgagtag cctaactggg aggcaccccc cagtaggggc 27541 agactgacac ctcacacggc cgggtactcc tctgagatga aacttccaga ggaattatca 27601 ggcagcaaca ttttctgttc accaatatcc gctgttctgg agcctctgct gctgataccc 27661 aggcaaacag ggtctggagt ggacatccag caaactccaa aagacctgca gctgagggtc 27721 ctgactgtta gaaggaaaac caacaaacag aaaggacatc cacaccaaaa ccccatctgt 27781 acgtcaccat catcaaagac caaaggtaga taaaaccaca aagatgggga aaaaacagag 27841 cagaaaaact ggaaactcta aaaatcagag cgcctctcct cctccaaagg aacgcagctc 27901 ctcaccagca atggaacaaa gctggatgga gaatgacttt gatgacttga gagaagaagg 27961 cttcagacaa tcaaactact ctgagctaaa ggaggaagtt cgaacccatg gcaaagaagt 28021 taaaaacctt gaaaaaaaat tagacgaatg gctaactaga ataaccaatg cagagaagtc 28081 cttaaaggac ctgatggagc tgaaaaccac agcccgagaa ctacatgatg aatgcacaag 28141 cctcagtagc tgattccatc aactggaaga aagggtatca gtgacggaag atcaaatgaa 28201 tgaaatgaag cgagaagaga agtttagaga aaaaagaata aaaagaaatg aacaaagcct 28261 ccaagaaata tgggactatg tgaaaagacc aaatctacac ctgattggtg tacctgaaag 28321 tgacagggag aatggaacca agttggaaaa cactctgcag gatattatcc aagagaactt 28381 ccccaatcta gtaaggcagg ccaacattca aattcaggaa atacagagaa cgccacaaag 28441 atactcctcg agaagagcaa ctccaagaca cataattgtc agattcacca aagttgaaat 28501 gaaggaaaaa atgttaaggg cagccagaga gaaaggtcgg gttacccaca aagggaagcc 28561 catcagacta acagctgatc tcttggcaga aactctacaa gccagaagag agtgggggcc 28621 aatattcaac attcttaaag aaaagaattt tcaacccaga atttcatatc cagccaaact 28681 aagcttcata ggtgaaggag aaataaaatc ctttacagac aagcaaaatg ctgagagatt 28741 ttgtcaccac caggcctgcc ctaaaagagc tcctgaagga agcactaaac atggaaagga 28801 aaaaccagta ccagccactg caaaaacatg ccaaattgta aagaccatcg aggctaggaa 28861 gaaactgcat caactaatga gcaaaataac cagctaacat cataatgaca ggatcaaatt 28921 cacacataac aatattaacc ttaaatgtaa atgggctaaa tgctccaatt aaaagacaca 28981 gactggcaaa ttggataaag agcacaccca tcagtgtgct gtattcagga aacccatctc 29041 acgtgcagag acacacatag gctcaaaata aagggatgga ggaagatcta ccaagcaaat 29101 ggaaaacaaa aaaaggcagg ggttgcaacc ctattctctg ataaaacaga ctttaaacca 29161 acaaagatca aaagagacaa agaaggccat tacataatgg taaagggatc aattcaacaa 29221 gagagctaac tatcctaaat atatatgcac ccaatacggg agcacccaga ttcataaagc 29281 aagtccttag agaccgacaa agagacttag actccaagac tttaacaccc cactgtcaac 29341 attagacaga tcaacgagac agaaagttaa caaggatacc caggaattga actcagctct 29401 gcaccaagcg gacctaatag acatctacag aactctccac cccaaatcaa cagaatatac 29461 attcttttca gcaccacacc acacctattc caaaattgac cacatagttg gaagtaaagc 29521 actcctcagc aaatgtaaaa gaacagaaat tataacaaac tgtctctcag accacagcgc 29581 aatcaaacta gaactcagga ttaagaaact cactcgaaac cactcaacta catggaaact 29641 gaacaacctg cttctgaatg actactgggt acataatgaa atgaaggcag aaataaagat 29701 gttctttgaa accaacgaga acaaagacaa aacataccag aatctctggg acacattcaa 29761 aacagtgtgt agagggaaat ttatagcact aaatgcccac aagagaaagc aggaaagatc 29821 taaaattgac accctaacat cacaattaaa aggacttgag aagcaagagc aaacacattc 29881 aaaagctagc agaaggcaag aaataactaa gatcagagca gaactgaagg aaatagagac 29941 acaaaaaacc cttcaaaaaa tcaacgaatc caggagctgg ttttttgaaa agatcaacaa 30001 aattgataga ccgctagcaa gactaataaa gaataaaaga gaggagaatc aaatagacgc 30061 aataaaaaat gataaagggg atatcaccac tgatcccaca gaaatacaaa ctgccatcag 30121 cgaatactat aaacacctct acgcaaataa actggaaaat ctagaagaaa tggataaatt 30181 cctcgacacc tacaccctcc caagactaaa ccaggaagaa gttgaatctc tgaatagacc 30241 aataacaggc tctgaaattg aggcaataat taatagctta ccaaccaaaa aaagtccagg 30301 accagatgga ttcatggccg aattctacca gagggacaag gaggagctgg taccattcct 30361 tctgaaactg ttccaatcaa tagaaaaaga gggaatcctc cctaactcat tttatgaggc 30421 cagcatcatg ctgataccaa agcctggcag agacacaaca aaaaaagaga attttagacc 30481 aatatccctg atgaacatcg atgcaaaaat cctcaataaa atactggcaa accgaatcca 30541 gcagcacatc aaaaagctta tccaccatga tcaagtgggc ttcatccctg ggatgcaagg 30601 ctggttcaac ttacgcaaat cactaaacgt aatccagcat ataaacagaa ccaatgacaa 30661 aaaccacatg attatctcaa tagatgcaga aaaggccttt gacaaaattc aacagccctt 30721 catgctaaaa actctcaata aattaggtat tgatgggatg tatctcaaaa taataagagc 30781 tatctatgac agacccacag ccaatatcat actgaatggg caaaaactgg aagcattccc 30841 tttgaaaact ggcacaagac agggatgccc tctctcacca ctcctattca atatagtgtt 30901 ggaagttctg gccagggcaa tcaggcagga gaaggaaata aagggtattc aattaggaaa 30961 agaggaagtc aaattgtccc tgtttgcaga tgacatgact gtatatttag aaaacgccat 31021 cgtctcagcc ccaaatctcc ttaagctgat aggcaacttc agcaaagtct caggatacaa 31081 aatcaatgtg caaaaatcac aagcattctt atacactaat aacagacaaa cagagagcca 31141 aatcatgagt gaactcccat tcagaattgc ttcaaagaga ataaaatacc taggaatcca 31201 acttacaagg gatgtgaagg acctcttcaa gcagaactac aaaccactgc tcaatgaaat 31261 aaaagaggat acaaacaaat ggaagaacat tccatgctca tgggtaggaa gaatcaatat 31321 cgtgaaaatg gccatactgc ccaaggtaat ttatagattc aatgccatcc ccatcaagct 31381 actaatgact ttcttcacag aattggaaaa aactacttta aagttcatat ggaaccaaaa 31441 aagagcttgc attgccaagt caatcctaag ccaaaagaac aaagctggag gcatcatgct 31501 acctgacttc aaactatact acaaggctac agtaatcaaa acagcatggt actggtacca 31561 aaacagagat atagaccaat ggaacagaac agagtcctca gaaataatgc cgcttatcta 31621 caactatctg atctttgaca aacctgacaa aaacaagaaa tggggaaagg attccctatt 31681 taataaatgg tgctgggaaa actggcttgc catatgtaga aagctgaaac tggatccctt 31741 ccttacgcct tatataaaaa ttaattcaag atggattaaa gacttaaatg ttagacctaa 31801 aaccataaaa accctagaag aaaaccaggc aataccattc aggacatagg catggtcaag 31861 gacttcatgt ctaaaacacc aaaagcaatg gcaacgaaag ccaaaattga caaatgggat 31921 ctaattaaac taaagagctt ctgcacagca aaagaaacta ccatcagagt gaacaggcaa 31981 cctacagaat gggagaaaat ttttgcaatc tactcatctg acaaagggct aatatccaga 32041 atctacaatg aactcaaaca aatttacaag aaaaaaacaa cccatcaaaa agtgggcaaa 32101 ggatatgaac agacacttct tgaaagaaga catttatgca gccaaaagac acatgaaaaa 32161 atgctcatca tccctggcca tcagagaaat gcaaatgaaa accacaacga gataccatct 32221 cacaccagtt agaatggcga tcattaaaaa gtcaggaaac aacaggtgct ggagaggatg 32281 tggagaaata ggaacagttt tacactgttg gtgagactgt aaactagttc aaccagtgtg 32341 gaagacagtg tggcgattcc tcagggatct agaactagaa ataccatttg acccagccat 32401 cccattactg ggtgtataca caaaggattg taagtcatgc tgctataaag atacatgcac 32461 acgtatgttt attgtggcac tattcacaat agcaaagact tggaaccaac ccaaatgtcc 32521 aacaatgata gactggatta agaaaacgtg gcacatatac accatggaat actatgcagc 32581 cataaaaaat gatgagttca cgtcctttgt agggacatgg ataaagctgg aaaccatcat 32641 tctcagcaaa ctatcacaag gccaaaaacc aaacaccgca tattctcact cataggtggg 32701 aattgaacaa tgagaacaca tggacacagg aaggggaaca tcacacaccg gggcctgttg 32761 tggggtgggg ggagggggga gggatagcat taggagatat accgaatatt aaatgacgag 32821 ttaatgggtg cagcacacca acatggcaca tgtatacata tgtaacaaac ctgcatgttg 32881 tgcacatgta ccctagaact taaagtacaa taaaaaaaaa aaaagaaaaa agcccccaca 32941 attgtattac acacacacac acctcttcac attggtggga ggtcccctag gactggggca 33001 gggctttgtc agcatacgtg aaatagcttc catgattccc tttgctttgg gtggttgcac 33061 tgtggtggca gcagtctggg agccaagaca acagagatgt ttccactctc agagagagcc 33121 ccgttctctc aggaaggcaa agtttccagt gtcccactca gtggccatga gggtaggtct 33181 ggagcatggg gttgggatgg gggtgggggg acactcatac gcttgtcctt agttcattga 33241 ctggaatggt cttgggcgtc agctggatgg cttcagctaa gttcctgttg ctgtagctgt 33301 gggttgggga tgggagagag ggtcatctac tgggggaaaa cagtctcctg aaactgccca 33361 ttccagactg gggaaagacc gaccgatggg agaggatatg tctgaaggca cagaacatta 33421 tagaactggc tctccatagc ccttatgcca tgcccattca ctcagagtaa attgtagtaa 33481 actctgcaaa aacaacaaag tgttttgtaa atcagttaat ttcttaagct tgcatgaact 33541 agtcattggc aacttgattg ctccaagaat ggctcacttg ttcacttggt cattctgtga 33601 cttggatttt ggcaaaatgg ctcttgatga attaaccagg agtcaaaatc atgaccgcca 33661 acaccaagca attaaggtga agtctttgtt catgggaaga ccctctttag aactttctcc 33721 atggagccct gtgttccaac gggagaagga aaatactagg attttatcct tacatatctg 33781 tggggacaaa tcattcttga tcctcccaat gaccttttct agctcttttt ttatctgtac 33841 cttcctaatg agatgctagg taggcagaat gggcagaaag agctgggagc atggtcaagt 33901 gtggggtctt ggtgatgaat ggggagaacc actgtaggta cttttttttg tagacagtcc 33961 ccattcctaa caaactctga tacatgaatg atcagatcct tgtttggagt accagcagca 34021 gtagctgaag aagcctcaga agtctttaag cagctcttac ccagggttcc cagaaagaat 34081 gcttatgaat gctgttatag caacaatttc aaccaatggg aagtaaagcc caaatatttc 34141 tgcccgtttc agttcggtca aggaggagga aatgggagag agtgagggaa ggaggaacat 34201 caaatttatc tgacctcaat aacaggacca gtctgtagta tccatactca gctccttccc 34261 agcaagagat gaactcatag ggctgtgcaa ggaggaagag tgacaataag ccttacagct 34321 ggtatgtatt cggaggttct tatgtgccag ttactgtgta aatactcggt atatgttgtc 34381 atttactcct cacaacagat attagtgttt tcatcttaaa tttcaggaaa cagacacaat 34441 gctcattttc ccaaggctca aactcagagg tgtctgtgtt ttacttggct attcctaaat 34501 tctgagatca gggcctccct tgtgggccta aaggttactc ttgtctttta tgaaagagag 34561 agacagtata ttatagtatg ctaactataa tggttatcag cagaggcaga agcaatttac 34621 atattgtttc tttttcttca atttccttgc ctataaaatg gggataacaa gcttactgaa 34681 caggattttt atgaggattg aatcagatat ttcattgaaa gactgagagc agaacctgcc 34741 acattgtaat ctttctttac aaggtttaga tattatttct gttattaaat attttgagag 34801 aggattgtaa ccacctgatg ggttcttcct gcctgctgca caaatgaaga ccatggcatg 34861 gtagtaaata aaagaattta attgatgcaa ggctggccac gccacatggg agatagaatt 34921 gttactcaaa tcaatcttct tgagcattta ggggtagggt tttccaaaga tagttttagg 34981 gagggagtta gggtggctaa gcaatgggtg cttgctgctg attggttggg ggtgcaatca 35041 taagggtgca ggaatggtcc ttctgcccac tgaatatctt ctgggtgggg ccacaggagt 35101 ggctggcagg tccaggtgaa gccattggtg tcagacagac aaacaaatct gaaaagatat 35161 ctcaaaaggc cagtcttagg ttctacaata gtaatgttat ctggaggagt aattggggaa 35221 gtagcatatc ttgtgacctc cagaacaata gctggcaatc gtttatgtct acaccttagc 35281 agaatttagg ctcctctatc cccctagcct ggtggtctct cattagcttt acaaaggcag 35341 ttgaattttg ggaaagggct attatcattt aaagtataaa ctaaatgtct ctccaagtta 35401 gcttggccta acccaggaat aattaggggc agcttgaagg ccaaaggcaa gatgggactt 35461 tggcatgatc agatctcttt cactgctata attttctcag tgttgtaatt tttgcaaaag 35521 ccgtttcagc ctcactctgt cacccaggct ggagtgcagt ggtgcaatca cagctcactg 35581 cagcctcaac ctcccaggct caagtactcc tcctgcctta gcctcccaag tagctgggac 35641 cacaggcaca caccaccaca cccagctaac ttttttatta tgtgtggaga ctagttcttc 35701 ctatgctgcc caggctggtc tcaaactcct gacctcaagt gatcctcctg cctcggcctc 35761 ccaaagtgct gggattacag gtgtgagcaa ccatgttcag tctactatta ttttaatagt 35821 agtgtaaatg gtgttgtttt aaatgtcctt aattagtgaa ataaaaagat aataacctat 35881 gtcaggaccc aattttatta actttattag tccctgacat ctccaagtat cagaacccct 35941 gcctttaacc actatgcttt gtatgtggag tagtttccca ccttaagaca attggtctta 36001 attgcaatac agaaagggaa gtgcttcgtt aaatgtcctc tgtatccagg aagaaatact 36061 aagcttgctg cctggaaaac actacacaaa tctaggaccc atatgtaaac atcatgctgc 36121 tgagatggca tggaggggtc agggcactgg ggtgggagga ggaagtgtaa ctgcccaagg 36181 ggtttacctt gcccactgcc taggcagagc cgatttatga agacagggga attgcaatag 36241 agaaagagta attcatgcag aaccggctgt gtgggagact ggagttttat tatgacttaa 36301 atcagtcttc tcgagcattc agggagcaga gtttttaagg ataacttggt gggtgggggg 36361 aagccagtga gccaggagtg ctgattggcc ggagatgaaa tcatagcgag tcgaagctgt 36421 cttcttgtgc tgagtcagtt cctgggtggc ggccacaaga tcagatgagc cagtttatcc 36481 atttgggtgg tgccagctga tccatcaagt gcagggttta caaaatatct caaccactga 36541 tcttaggagc agtttaggaa gggttagaat cttgtagcct ccagctgcat gactcctaaa 36601 ctataatttc taatcttgtg gccaatgttg gtcctacaaa ggcaatctag ttcccaggca 36661 agaaggaagt ctgctttggg aaaggctatt accatctttg ttcaaactat aaactaagtt 36721 tttctccaag gttagttggg cctacgccca ggaatgaaca aggacagctt ggaggttaga 36781 agcaaaatgg agttggttaa gttaaatctc tttcactgtc ttagtcataa ttttgcaaag 36841 gcggtttcag aagtatggga gtcagtgtgg ctaaatagga ggaacagaga ctctcaatgc 36901 agacagaggg tttcaagtca cagatctgcc acttactgga tttgtgactt taggcaggtt 36961 gttttagctt tctgagcctc atctgtacaa tggaattttc acagtaatta gttcataggg 37021 ctgaaaagaa gatgaaatga agagattcaa gtagaacact cagaacaagg cctgccatgc 37081 tgtaggtgtt taataaacat tatcttccac agtgatgatg attaagggac taaatttcat 37141 gcacgtccat gtgaagagac caccaaacag gctttgtgtg agcaacgggg ctgtttattt 37201 tacctgggtg caggcgggct gagtccaaaa agagagtcag tgaagggaga taggggtggg 37261 tggggctgtt ttataggatt tgggtaggta gtgaaaaatt acaatcaaag tgggtttttc 37321 tcttatgggc aggggcaggg gccacaaggt gctcagtggg ggaggtttgg agccaggtga 37381 aggaatttca caaggttaat tgctcagtta aggtggggca ggaacaaatc acaatggtgg 37441 aatgtcatca gttaaggcag gaaccggcca ttttcacttc ttttgtgatt cttcacttgc 37501 ttcaggccat ctggatgtat acgtgcaggt cacaggggtt acgatggctt agcttgggct 37561 cagaggcctg atactaacca taacagatag aatttaaaaa gcagaggaac caaaaacagc 37621 ccaccactct cctgagctgg gagaatttgc ccagaattcc agggataggt gtgtcctaag 37681 cttccgctag gtagggtcaa tgcccagcag ccaagccaga aggaagattt gggcagacac 37741 tgtcattggt gaacacgtaa gctccagagc tagaatacta ctgatgatgg tgactggctt 37801 tggttcttcc gtctggtctg acacccagaa agccgatcag actcaccgct atgaaccttc 37861 tagtgccaca gcaggccaga aaccacagcc ctgcctcccc ttctcctgag ttcccggtgt 37921 gggtgtggta ggggcacggg tggattcagt atttctgtaa tggaaactag aaacacgtag 37981 ccccacagta gctgcattcc aaaggggcag agatctgact tcctttgttt catgctttta 38041 gagacaatga cagcaagacc tctcacagga agtgctagga ccactggaga gagtgaattc 38101 aactcttggc caaaaaccta tgggtgatct ttcaggcctc gccataggca attacaaaag 38161 ctttacctgg cagggatcat ggggggacgt gccctcccca catcagacct gatgcacagc 38221 cgtgcattgc agtctgtgaa tggagttgca gtcaaggact cagacgtccc gaactgatca 38281 tgctctcaca catgtcggca ttcctagacc agatgcctcc atttggagtc cgccttatca 38341 ggagcctttg gcagaggcct ttctcaagca cctgagctaa gacctcttgt tgccatggtg 38401 ggagtggggg atgcctaaaa atattttttt cttcaagctg actcagtact cagtactttt 38461 ctactcctag cccttccctg tcacctcctc ctgcccctgg gtccataaaa tggcaggagc 38521 ctcttgttca gggctccctt aacagtgaga tgatgaagcc ctgcatccgt cagacccctg 38581 ttggagccac cccatgtgga aaaatggaat aatggaggag tcggtacttt ttctggttca 38641 gtcccttgct tatactatca aaggcagtta ttaaagtctg actgttactt ttattttgac 38701 tcgtcttaat cgactactct gacacccagc agctcagctc tctctagctc agctgagctc 38761 ttgaaaaatt ggaatggcaa gctgggtagt tgattatgca caatgtcgtt tctgaagagc 38821 aaagagcctg aaaatctcgt ggggcaattc actaatgatg ggtgtaagca gaccttgcag 38881 ggcactcctg gaggatgcta atggattatt tgccagagtc agaggatctg accgggcact 38941 ctgccagtga gtcctgggaa gtgctagcag acctcacgtg gcactttacc agccatgcca 39001 gcaggtctca ggaattgcta ttgtaagtca cagggtatac ttggaggatg atagaaaatt 39061 gtttataaaa gcttttggtt ccaaggtaac catcggtcct cctgtttatc cccatcacct 39121 cggctgagat aacctagtgg atcaaataag agatccagca gaggggctct aagaagccac 39181 agaaacaatt ccctggacac cgttctctga tgatggggcc agaagtcgtt ggagaccctg 39241 taaactgacc tgtaccccaa aactattcct agatctaagc agttccctgg ggctgacgga 39301 caacatgcca tgcagcatgt tactgagact catgcttcta ataaaaataa tgtagagctc 39361 tttagaatca ctttgtccag tgatctcact agatagggtc aatgcccagc agccaagcca 39421 gaaggaagac ttgcttaggg gaagaaacca cagagacttg gctagtttgc ttccattacg 39481 aaaaaggtaa tcttgttaaa taaacttcat aaaggaaagt ggctccttga ggggtacatc 39541 actacagacc ccctcctcca tagtgccttg atagtggcca ctttctctga ggaggaggag 39601 caaggtgggg agatgagagt gtatgtatac caacgaccat ctctttcctc agctgggtac 39661 tatttggtct ctgacagatt tggtccacac ataatgagta cctagtcact gaaaaacatg 39721 tctcaaccac cttcgcagag actatggagg tcctgcagcc ctcctggtgg tctcttgggt 39781 ctactacgag ggagtgggaa cggtagatga gaagattaac cagacttaga tttacgggtt 39841 tgttcatgct gctcctgctt gttgtcaagc ccaattaagc cacctcctga ggtcaagtcc 39901 atcaatctgc agtgtgatcc agatcttaac tcagaaattc tcaggcacag aagaaatggc 39961 ttccgataaa atagcaaaat ttgacccctc ccctccaaaa agggagcata aaaatagtac 40021 ttgagtgccc tcttctggac tgtctgggca agggccttcc caaagagatc tctgagtctg 40081 attactcaat aatggaacca cgaaagcaga ttgatgggct ccccactaaa aaattcaggg 40141 ccaagtacca ggccatgggg atttgataag tggtcccgct agagtctgca cagtgatggc 40201 cttccacctc tcctggtgga tttggagaca aatcagagat cctgagcaac ttttgctctg 40261 acactcagct ttcaaatata aacaagatta ggggttacaa acaaaaatcc aaagatttta 40321 ggggttataa acaagatcct tctgtactca ggggacccag aggccctatg ccaaggttag 40381 agttcaactg aacagtggag aggaattgca cttcttaggc cttttagata ccagtacaca 40441 ggtgactgtg atccccacaa acccttgcat caagatgagg agaaaacaga tcgtgttctc 40501 tgactttggg ctcctactaa cctccgttgt catggccccc actactgaat gaaatattgg 40561 cagtgatgtg ttttccatgt gcagccccac ttgcctttgc ctccaagccc tgacaggaat 40621 tgcaatagtt atggccatct tagtgagaaa tatagaagac catggaccca tctgatgacc 40681 aacacctagt actgttgttt cccaaaaaac aatgtcagct gcctagggga gaaaaagaat 40741 aattgccatc attgctgagc ttaaggaagc caaaatgttt catgagactg tttctccatt 40801 caatagcccc atctggcctg tgcacaagac cttggtttct tggagactta ccactgattt 40861 taggcagctc aatgcgtaat accacctttg gcaccagcag tacctgacat tgtgactatt 40921 acagaggccc aaaacagaga gaacttggta tgcagcaatt gatattgcaa aggcatatca 40981 aattgtagtc cccagtgttg gagatggggc ctggtgagag gcgattagat cataggggca 41041 gatttcccct ctagttttgt tctaatgata gtgagtaatc atgacatctg gctgtttaaa 41101 agtgtagcat cggctgggca tggtggctca cacctgtaat cccagcactt tgggaggccg 41161 aagcgggcgg atcatgaggt cagaagttcg agaacagcct gaccaacatg gtgaaaccct 41221 gtctctacta aaaatacaaa aattagccgg gcgtggtggt gtgcgcctgt aatcccagct 41281 actcaggagg ctgaggcagg agaattgctt gaacccggga ggcagaggtt gcagtgagcc 41341 aagatggtgc cactgcactc cagcctggac gacagagtga gactccgtct aaaaaaaaaa 41401 aaaaaaagaa agtgtagcac ctctcctctc ttcctcctgc tctggctgtg tgaagatgta 41461 cctgcttccc cttccaccat aattgtaagt ttcctgaaac atccaagcca tgcttcctct 41521 acagcctgcg tacctgagtc aattaaacct cttttcttta taaattaccc agtctcaggt 41581 atttctttat agcagtgcaa gaaaaaacta atacagagca atttcttaaa actactcctc 41641 cctgaatcct cactctcaca gccagagacc tagttgccat ctgcaatgga cttaattgtg 41701 ccctgcctca aattcctatg ttgaagttgt aatccccaat gtgactgcat ttgtagattc 41761 ggcttttagg tgataatcat taaggttaaa agaggtcata gggtagggtc ctaatccgat 41821 aggattggtt accttataac ctcataagaa gaggaagatt gaccaggcgt ggtggctcat 41881 gcctgtaatc ccagcacttt gggaggctga ggcgggtgga tcacctgagg tcaggagttc 41941 gagaccagcc tggccaacca acatggtgaa accctgtctc tactaaaaat acaaaaacta 42001 gctgggcatg gaggcacacc cctgtaatct cagctactca ggaggctgag gtaggagaat 42061 cacttgaacc tgggaggcag aggttgcatt agtaagccga gatcacgcca ctgaactcca 42121 gcctgggaga cagagcaaga ctccatctca aaaaaaaaaa aaaaaaaaga agaagaagag 42181 gaagaggaag attctctctc tctctcccct tccctctctc cctcccccct ctttctgtct 42241 actggaatgt gaaaggaaaa gccagtgagt acacagagag aaggcagcca actgcaagca 42301 aggaaggtga ctctcaccaa aacctgagct cactatcatc ctgaccttag acttccagcc 42361 tctagaactg tgagaaaata aatttgtgtt gtttaagcca gccagtctat gatattttgt 42421 tgtagcagcc caaacagact aagacaccat ccatgatacc ttcttctttc tcagccccac 42481 atatattatg tcaccatatc ctgtgtgccc ttccttcaaa atatgctctg aatctggtta 42541 cttaccactt ccactgttgc cacccataac caaactgcta tcatcagctc ttccctaaac 42601 acctgcaata gctccttact aggctcccgc actcacgcct gattccctcc catctgttat 42661 cttcatatta aaaataatac agtggctttc cactataccc taggatgtga tttaaaattc 42721 ttagcatggc cttcaaagct ttgcctaatc tatctcctgc ttaactcttc aaactactgg 42781 agcctctctt ctcctcactc ctcaccatct agtcacatgg cccctttggg aatatatgaa 42841 aacttcccag cttggaatca ctgcccatta tctttcacct ggctcattct tgtttatcct 42901 tcaggttcag ctttcatgtc ccttctttgg agtggccttc catgacctac atcctaaatt 42961 agaagtcttt gtattctttc tagttttagc atgttcttta cattgacatg cataccacca 43021 gttttgtatg tattggttta ttgtctggct ccccagttgg gtgaccaatt tcatgagagt 43081 aaggaaagca tctgccttgt tcactgatgt tattaccagc acccagccta gggttcaaca 43141 ggagtaggtg ctcagttaac gtgcgttgag tgagtgaata aatgaataga gtctatgagg 43201 ttggtgtttt tagttcagag tagtttggtg attcacttgc tgccacacag ctagcaagca 43261 gacataaatg ggcaagatcc cagaatcata gttccaaggt gaatgcataa tcaagagact 43321 gccagggggt gaaaaatgga agggggcatg gagtgtgaga caggtgaaaa gattccagac 43381 cattgattct agggagatct tcccccaggg tgtgtttttt tggggggtac atattcaata 43441 ttcacatttg ttgaaaaata gcttctgatt tcataataaa tatatattgc tgaaagacta 43501 ttccggttgg taaggataac agaaaatgca gcgtttcttc gctgagggca gatttttttc 43561 agtttattga aaggaggcaa gttaaaaacg tgactaggaa actgcacgaa agctttctgc 43621 cattcccctg attggagcta aggcctgtgg ggtgcaggag acaccaaggt ttgggtcgtg 43681 ccgggactat gctcatatag tggagagtag gtcttggtga cacagcaacc cacaggctgc 43741 acattgccac ccacccctca ccccagcaaa ctgcacatca ttccgcccct accccacaga 43801 ctccccttca caattcccta atctgtatat tacctctcag gctccacaca tgccccacgc 43861 cagcccccct ctccaagctc cactttaccc cacattacct cctagaatca catcacccca 43921 caaatggtga gaaatggcca tttgtgaaaa gcgagtttgg aaggtgctgg tcaggagagc 43981 cgggtgcgaa gccagaaccc ttgggcagag gcagcatgag ggtcaactcg acagcagaag 44041 gcgcctgaga gtctctccgc aaaggccatg caaggacgcg cgctcctgta agtacagcct 44101 ccatgcccgg gttaattcct agacaggtag tgagccaatc aagccagcca cccactttct 44161 aaaaaacaaa aaatctactc tttctccttg ccctcctctg ctcagcatcg ttcactatgc 44221 ctcgaatatt ttaaatggcc ctatggatat ctgtaaaaca aaggctagta gttatcatta 44281 ttattatcat taatcagata tataggtaaa aggtgaccct aggaacaaag ccaagaatct 44341 agagtaagga tggctaatgg ggattgtgtt tgaatagttt gcttttgttt cttattttgc 44401 ttgggttgtg acgtgtgttg cttcaacata tggcttaaaa aaaaaaaagt ctgttaatgt 44461 attggtcaca aagtgtcttt ctgggaaaag gcacttattt tcccagagtg atttgacaca 44521 tttctggtga aatgcattaa cattaacgtt cctttttttt tccccaagtt ttggacatat 44581 gggtttaatt acctgctaga ttagcagatg gaaatatgtg gtaaatggat gtaaacaagg 44641 tgcataaact taggagagag gcagaaattt ttaaacttta ggtttagaca ttcactagat 44701 tggacaattt ttaacattta aaataaaact ttttgccaca aagaaacaat tgtattaaaa 44761 ctagtttcca gaactgccca aatgactttt taaacatgat ttaaatgtca agctttcttt 44821 ggcaaatcta tccaaaatac taattttcct tctagaactt ggtttataat ttatatatta 44881 gagtgccctt aattgatgtt cattgggata ccttcctctg agccctaatg gtttctatct 44941 tccctttaaa atgattaagt atcttttaat taagttgtca ttcgcaaaga tattctggag 45001 caaaaatcag gaattaaaca tctatgaatt cttcttcact tctgttatag tgtagcaatt 45061 tatcttctaa cccatagtag tctatcaatt gagtaaggct tagtttcttg ttctcttcag 45121 acacagctga ctgcaactca taaagcaaca gctggctaca gctcctccac ttctttatta 45181 acaactgtaa tggagacaga ttctttgatc atacagtttg atgagtttta gcctccaaag 45241 aaggtcttct ttctcctgaa cctgcttcac caatttggct ttttcatcgt ttaatctttg 45301 tttagaaaga ttctcattca caaactcatt ttggagagca ctgcatgacc cagagtcaaa 45361 tgactgagat tcacagacat tgattcttca aagtattttt caaatttata ttttcttcaa 45421 attcactgtc tatgtgttta aaactttctt gaaattccaa acagttttcc tctgtggaag 45481 atgctggttt ctctgaaaag gtctgatcgt tttcttcatt ctctacttta agaagtttca 45541 ccacattata acaggaacta aatgaagatc ttgctttcct taatctttct ctaagtgttg 45601 cactcatagg ttgtgttcgt gaactatttg tatagggaga tgatggattc acagaggtct 45661 gaggagtgct aggtaaaacc acagctgagt ctgatggact ttccatcttg aaaatgaaat 45721 cttggttttc cttcaactcc acagcagact cccgaggaag tccaggtctc aggcatcccc 45781 cttcccctca gcagggtacc tccctccgcc atccccagag agagcaaaaa gccaactttc 45841 ccttcttaaa ggactcacgg gaagggaata atggaaataa taacttcaca tgctgaagaa 45901 aacagataaa gtgtgaagaa accatttatc aacatttaaa cctccacaca ttatttgttc 45961 tttttttaaa agaactttac taagaatcag attcattaag tcaaaggatg ttagaacaca 46021 aaggaacctg caaaattaaa tcgcgttctt attatataaa acaggagacc cagagagggt 46081 aagtgacttg tccaaagtca cagagcactt gaatgcagtc agggctgatt tccagtctgc 46141 aactcaaaag agaagcagtg tttctcagtg aaagaacagt ggtgtgagtt ttaaagattt 46201 tattctattc tttattacat gattattata ggaaaaattg ggacacattt taaattacct 46261 agaatcctac cacttcacaa tgatcattgt tagtattttt gctgtataat tttccagact 46321 ttttaatgta tatgaatact ggtaaatgtg tttactaaaa ttggatcaca ctgtctgtac 46381 atctctgtca tataactcct aacagatgca aaactagtcc actgaataga ttaacttcat 46441 cagtttaacc agtcccttac tgatagatag atagataagc tttttcgaat gttaaaccct 46501 tataaaagat gctgcactaa ccattcctgt tcagacatct tttgctgaag tgatttctac 46561 ctactgatca gtgtgaattt tgtagggatg caatgatgta gactaggttt gaactctggc 46621 tctagaactt actgtgtgtc tttacacaaa tcacaacttt tctgcactct agtttcttca 46681 tcttcaagtg ggaataatca tactgatagc ctagaggtgt taaaaggatt aaatgaaata 46741 atacatgtaa aatagttggt taatacctag catgtagcaa atcctaaata atattagtta 46801 ttattattac cacaatgctg aagaaaactg gctcttcaaa ataatgccca tgattttctg 46861 ttactcttta aaagagacta gccagaattt tacaaagaag gttttagatg caaaacttca 46921 gtagctaaat taatatttcc ttttttctaa cttaatctat cgagaagaaa agtaataaat 46981 ttaaatcatg tgtggtctgt tgagctagtt actatgttct atctatataa cttaatgcca 47041 gtcaaaattt aaaaggcata gcttttcatc aaacattctt gtcttagtct gttttctgtt 47101 gcctataata gaatatctaa aactgcataa tttataaaga aaaggaattt atttcatatg 47161 gttatggagg ctaaaaagcc caaggtcaag ggactgcatc tggtgaggac cctctagctg 47221 gtggggattc tctgcaaact tcctagacag cacagagcat cacagggtga gggggctgag 47281 catcctagct tgggtctctc ttcctcttct tatgaagcca ccagtcccac tcccatgata 47341 acccattaat ctattgaccc attaatccat gatggattag tcgattcatg agggcagagc 47401 ctttatgacc catcacctct taaaagcccc aagaccccac ctctcaatac cgccacattg 47461 gagattaaat ttcaacatga gttttggagg gggcaaatat tcaaaccata ctcatttgta 47521 agtaaaaaat caactgtttt gtgataaagt tgctcacctt tctttcccct ttggaaaatg 47581 ctcaaagcat ggcttacccc ataaaatcag cagaatatct ttacagtcat ctatccagcc 47641 tcccggtgcc ttcaagctat tctccactgc ttttctgttt tatccatggg cccccagcta 47701 tttcatattc ctcagagggg tgggaaagag cacagaagct gaagttttca cctctgagac 47761 tttgctttta aactgaaaag atatccattg tcacactggg attcatttct atctgaagct 47821 ttggaaacac atcacgactt ttaccaaagc ctatacagtg aatctaattt atgtgtcacc 47881 tttatttcac caatagttgc tttaaaaata agggaacact cacccccttt gaaaagctgg 47941 gtggaaggtt cagtgttgta tggcttatct cggaagtcag gggtcagtga ccaaaattca 48001 cggtgaccag aagcattcca aaatgctgcc agtaactcca tgcttactgt ttcttcctct 48061 tctgctctac tgaacattgt ggacatagac actcaggatt gttggggtgg atgatactgg 48121 ttaaatgagt ccttcttgga cacagctttg gttgaatgtg agcaccaaac actgagaagt 48181 ccttgtctat aactgatgtg cagaatgacc catacgtttt ttggtttggg attctttttt 48241 tttttctcta ctgtgtgtta tccttttctc taatagtggc cagtcacatt ctaccagtct 48301 gtatttgtct cacggttgtt gaagtggctt tgccaactta gctaagctag aaacatagtt 48361 tcctggaatt gccttcccaa tgtggttctg agttagagtg gaccaggaga aaaatttggg 48421 cagggcttgg gaagtgggag tgaagcaata gatactatat tctgaagttt ggtaggacag 48481 gcactattac cactgccata cactggcact gatcaactgg cacacctcgt tggtgtgggg 48541 cagcaacagg acctgcagca actccaactc tgacaagatc tccttcttca acttctccaa 48601 gtcctaagtc aggtttgtgt gcacttccac gactatggat gccagctctt tctttaggtt 48661 ggaggcaata agagacagat gcaggttcca gtaagtcttc atgggattta gtttgtccta 48721 gggattcaag ttaatccttg caggttccag tttgtccttg ctttccccca caccacatac 48781 acctcttctt cccaactgtg ggccctgaat atctacagtg acctcaggct gctataagtg 48841 cagagacaat agccttctat agacttcttc ctctgtgtaa ggtacatgga tgttccttgg 48901 tcaaggaata ggccaaggtg gatatccagg cctgcatgac tcagtgagtt tggcatgcag 48961 gcacacacct ccacttgtta tataacctgt ttgtgtaagt ttatacttcg ctctaagcca 49021 ctattgtctg taaaaggtat aactgccctg ctaacgttct acaggggctc ttgggactct 49081 ggttagctca acatggctta acatggtggg cacgctggtg cccagagaaa gaaagagaga 49141 gagccaaagc tgtccgtctt gcagatggac aggagggagc cagcacacag cttggcttgc 49201 tcatgcccag agagaaaaag agttaagccg ctgaccctga aggcaaggaa gagccagatg 49261 cacagctgtt tgtgggagcc actggctcaa gcagctgcga cagggcaaac agtgtgagag 49321 agctagtgtg agaaagctgt taataaaagc tgctgctgaa taaaaccaca ttcacctgcc 49381 ttcagccccg agtgttcttt ctgctcatcc accaactccc tctggacttc agcatgggct 49441 ggacccggac cccgggacct gacaactggt gatgagaatg ggatgaggtg agttggcccc 49501 agtccctgag ggctcccagg ttggctttgt ggccacagca tgggctgtga tacccggtgg 49561 cagctgtgct gcgaagatgg gctccagtgg aaacatggga ggcagtgggt gggtctcctg 49621 tgagtatgga gaaggcactg aagcacctgg aagtgcacag cactgagaag aagcatgcct 49681 ttgctggcag agtcagatga gcgtttgtaa ctgtgctgtg ggaagttcat gcccagtcct 49741 tgtggaactc agtgcagtga ggaggaagaa cctccatggc aggcttgccc agtgatccac 49801 cacaaaatag atcatgagca gctactgggc cccacgagta ggcccagaga ccccctgctg 49861 tggtggagca cccttccttt ggtgcctgtg tccctgctga gttgagggag ttaagcaagt 49921 aatgtcggca gtcatgcaca gacttagtgc aagccatttg ggagaaggac cttgctgcgc 49981 aacccagtcc tgctcgagca ttccagttca aagagtacct gctgcagttg gtggaagtgt 50041 aaagcctctt ctgtttgata agagaactgg ccgagatgcc cagcttggag ggcactggac 50101 aacgggagcc acatttggac ttcgcaatcc actggtccct gaacctggat aagtttccag 50161 gcagagctgc atttattgac aactatgaag actggtcagt gaaagtgaaa cctgtatctc 50221 tgcatcttgg catcgactgc ttggctctct gcttatgtgc tgtgtatgtc tctccatcca 50281 tacctgaaga cattctgggg tagatgtttt gcatggcttg gcagctgtgc catctatcac 50341 agacttaata gaccacttga caatggaact tggcagtgcc actgtgtggt ggacttggct 50401 aatgcattct tgtcaatcga caatgctcca gagagccagg aagagtttgc cttcatggga 50461 cagcgacaat ggactttcac agtgttgctg cagggctact gtatagcccc accatatgtt 50521 gtggttttgt taataatgtt atgttaacct ctgattctct tgcaggttta aaagtggcag 50581 tgtccctctt gcctgggatt ggggtgatga ggctgagaca gcctttctgg gtagtaaacc 50641 agggtgctca tttacactga atgtgcatgt aaccacagat aattttggct agggcctata 50701 gcagtgcatg gagcacttgg aagcaccagt gggctgttag tcccaactgt ggaagggagc 50761 tgagctccag tgcttactaa tagagaagca gttattaata gtgggatggg tgcattcatg 50821 ggtaatgacc ccctggacag aggcagcaca gacatcaact ttagcgaagt ggggtaccga 50881 cttgaaacag tgaagtatgc taagtaagta caaatccctt agcagcaaag ttgcaagagg 50941 tcttgggacc tgtagtccta atgcaagata aggccatggg gcctgaggca ctcctaaacc 51001 ctgagacttc atcattagga agggcatccc ctcattcctg atagggcatg gcacacagct 51061 aggtctagct ggggtgctac tgctgcctgg actgctggtg cggtccagcc tagtactaac 51121 accatatggt ttgaaaccag gtgtgggcaa agcagctaat aagctaaact cagggcagtg 51181 tgaatggtaa tcaccaagga tgtgacaacc tatggtaatc tgcgccaata gctgggcagt 51241 ttatcaaagc ttatgtatgt aacgggcttg agtgcccaaa agcttgtgta tgtattgggc 51301 ctgcatgccc aaagcttgtg tgtcaggctt atgtgtcaag cctgtgtata tatcaggcct 51361 gcgtgcccaa agtttatatg tcaggcctgt gtgcaaaact tgagtatcaa acctgtgtgc 51421 ccaagaccta agtctccctc agcctagggg gtggagtgta aggtacatgg atgtgctttg 51481 gtcaaggaat aagccgaggt ggatatccag gcctgcatga ctcagtgagt ttggcatgca 51541 ggtgcacacc tctgcttgtt atataacctg tttgtgtaag ttcacacttg gctctatact 51601 tggtataaag gtatatttgc cctgctaatg ctgtacaggg ctgttggggc tcagctcggc 51661 tcaacatggg ttaacatggt agtgggtgcg ctggtgccca gagaaagaga gagagccaaa 51721 actgtcagtc ttgcagatgg acaagaggga gccaggacac agcttggctt gctcatgccc 51781 agagagaaaa agagttaagc tgctgaccct gaaggcaagg gagaactggc tgcacagctg 51841 tgtgtgggag ccgctggctc aagcagccaa gacagggtgg ccagtgtgag agagccagtg 51901 tgagaaagct gttcataaaa gctgctgctg gccgggcgca gtggctcacg cctgtaatct 51961 cagcactttg ggaggccgag gccagcggat cacgaggtca ggagatcgag accatcccgg 52021 ctaacaaggt gaaaccccgt ctctactaaa aatacagaaa attagctggg cgaggtggca 52081 tgcgcctgta gtcccagcta ctcgggaggc tgaggcagga gaatggtgtg aacctgggag 52141 gcggagcttg cagtgagccg agatcgcgct actgcactcc agcctgggag acagaccgag 52201 actccgtctc aaaaaaaaaa aataaataaa taaaataaaa taaaataaag ctgctgctga 52261 ataaaaccat attcacctgc ctacagcccc ccgaatgttc tttctgctca tccacccact 52321 cctccggact tcagcatggg ttggacccgg accccaggac ctgacaccct gctcccacaa 52381 ttgcatatgg tctgattcct gtaatcatcc cttattgctt ttcactctgc ttctctgatc 52441 aaactctgca actatgttca agtgattctt attttacatc caaatttgca aatcagacac 52501 aggattatag acattacaaa aaaattcaat tttgaaaaaa aaaggaagag atttgtatat 52561 gacacttaag taacccatct attaccctcg ctttcttttt gatgtcaaac tcctttaaat 52621 ggcaccccaa gtttattgca aaagtatttt tcttcctccc attttctctg cagttccttg 52681 aatatgactc ctgcttttaa aatgctattt taattcatca gaaacttata aaacatcaaa 52741 tccagatctt agtccacttt atttctttgt actcttaaca ctgttgacta ctctcttttt 52801 tccctcatta aatttggttt caatgggtct caaatttctg tgacagattt ttggtcaagt 52861 tgtttccact aaaaagtgct gattttaaaa attaaataac ttaaaactac cagatgccaa 52921 aaaaaaaaaa gttcacaaaa cattctcctt tccttccaaa ggttttacaa tgcattgtta 52981 tcattaacca gtcttttacg actaaactta agtggccagt tgaaacaaac agttctgaga 53041 cccttccacc actgattaag actcaggcca ggcaccgtgg ctcatgcctg tactcccaac 53101 attttggaag gctaaggtgg gtagatcact tgagcccagg agtttgaaac cagcctgggc 53161 aatataatga gacctagcca ggcatggcgg cacatgtctg tagttccagc tacttgggag 53221 tctaaggcag gaggattgct tgaacccagg aggcagaggt cgcagtgagc agtgattgtg 53281 ccactacact ccagccaggg caacagagtg agaacctgtc tcaaaacaaa aaacaaaaaa 53341 caaaaaacaa aaacaccaga caacaacaac aacaacaaaa aagactgggg tggcatgtat 53401 tagggataat attcatttag ctttctgagc tttctggaca gacttggtga ccttgccagc 53461 tctagccgcc ttcttgtcct ctgaacccat ggcaactgtc tgtctcagcg aatttggcat 53521 gcaggtgcac acctccactt gttatataac ctatataacc tgttgtataa gttcatactt 53581 ggttctatcc ttggtataaa ggtatatctg cccgcaaaat gtcccagaag agaatagtca 53641 ggatagtcag aatattcaga gaagctctca acacacatgg gcttgttagg aactatatca 53701 gtcatggcag caccaccaga tttcaagaat ttagggccat gttctagctt cttaccagaa 53761 tggtcacctt ttctttctgc tcagaacact tgtaaggaat ttgagatgtg tgacaatcta 53821 gtgcaggggc atagccagca ctgatttggc ctagatggtt caggataatc acctgagcag 53881 tgaagccaga tgcttccatt ggtgggtcat ttttgctgtc accagcaaca ttgccatgat 53941 gaacatcttt ggcagatgcg ttcttttttt ttttcttaga aagaatgtaa tgcatgtttt 54001 taatcagaac aacagcaata acaaaagcta agtatggata tgccaatgta gtgttgaatc 54061 cagcaatgga cacaaataaa gagcaaatgc ttgacagaga cagctgtaaa taatctatgt 54121 acctcttaca tccccccact tcataaaaaa gacattctct ttggaaatat ggttacaaat 54181 gtaaatcact gatttcttgc aagaccacga tgattctgta ataaatatca aattgttcca 54241 tttcagattt gtaaaaggat tctttcagat atgaagactt gggagcaact gtgaagctgt 54301 ctttattcag atgctttgaa atataaagtg gtgatcttaa accaccaaga actgtaaggc 54361 atgctaactg atgtttataa caacttagcc ttcttttcct ttatatccga ttttattaag 54421 tgggcaaaca ccttgttcac attgaggccc acattgtccc caggaagaga ttcactcaaa 54481 gcttcatggt gcatttcaat agactttact tcaattctaa cattgactgg agcaaagctg 54541 accaccatgc caggtttgag aacaccagtc tccgctcggc ccacaggtac agtaccaata 54601 ccaacaatct tgtagacatc ctggagacac agacacaagg gctcataagt tggatgagtt 54661 aatggtagaa taaagtccag agcctcaagc agcgtggttc cactggcatt gctatcttta 54721 taggtgactt tccattccta gaaccaaggc atgttaccac ttggctccag catgttgtca 54781 ccatgccaac cagaaattgg cacaaatgtt actgtgtcag ggttgtagcc aattttctta 54841 atgtaagtgc tgatttcctt gtatctcatc tggtggtagg gtgactaagt ggaatccatt 54901 ttgttaacac caacaagtag ttgtttcaca cccagtgtgt aagccagaag ggcatgctcc 54961 tgggtctgcc cattcttgga gatgccaact tcaaattcac caacaccagg agcaatgatc 55021 aagagagcac agtcagcttg agatgtgcct gtgctcgttt tttttttgtt tgtttgtttt 55081 gtttttttga ggcggggtca tgctctgtcc cccaggccag agtgcagtgg tacaatcttg 55141 gctcactgca acctccacct cccgggttcc ggtgattctc ctgcctcaac ctcccgagta 55201 gctgggatta cagacgcatg ccaccacgcc cagctaattt ttgtattttc agtaaagatg 55261 gggtttcacc atgttggcca ggctggtctc gaactcttga cctcaagtga tccgcccgcc 55321 tcggcctccc aaagtgctgg gattacaggc gtgagccact gtgctcggcc tgtaatagtg 55381 tttttgatga ggactctgta tcctggggca tcaatgataa tcatgtagta cttgctggtc 55441 tcaaatttcc acagggaaat atcagtggtg ataccattca gctttcagtt tatccaagac 55501 tcaggcatac ttgaaggagc cctttcctat ctcagcagcc ttcttctcaa tttttcagtg 55561 gttcttttgt tgatgccact gcatttgtag atcagatggc cagtagtagt ggacttgcct 55621 gaatctacat gtccaatgac gacaatattg atatgagttt ttcctttccc attttggttt 55681 ttaggggtgg ttttcaagac aacctgtgtt ggcagcaaac ctgttgcaga aaagctactt 55741 tctcttttct gaatttctct cttcccttgg tttctatgac aagcactctt tggaattttc 55801 tcctatctct ccaactgctg ctccctgtct tagtatgctc acttctcttc tacccaatcg 55861 tcaaggtgta ggagttggtt gctcaaagtc cagttcttgg cactcttgaa ttctcactct 55921 accctctcta cctttatgac cccctccact ctaaaggatt caaccaccat cccttcacac 55981 tcatgattta tagatctatg cttttatctc agctttctct cctgaatgcc aaggtcatga 56041 ttcttatagt ccatttagtt gcagatgact ctcaaattca acaagcacaa actgacctat 56101 ttatttacca tgacatgttc catgtcccct aacttggata atggcattac aattcttcag 56161 tcactcgttc tcaaaacttc agagttgggg agtgatgtca ccaaagatgg agtagaagca 56221 atctggattc actccccctg cccactgaaa accaaaaaca actatccggt gccacaatta 56281 tcaccagcaa tatcccagaa ctcaaaatca aagctgtgac aatccctgag gccacagaga 56341 agtaaaaaac tatgagctga aagtgagaga aatggacttc tctatccaca atgcccctcc 56401 cccaaactac tagacaccac atggaaaaat cctccgagac tcatggtttc tacattggaa 56461 aaagtgagat caaggtggaa agccagcttc cccatcatct tgggttccta tgcaagaaag 56521 ctgttcctgc ctcaactcac aggaagcatc acaagtgcct ccacctaagg acaggtggag 56581 acaaaccttg gaggtggagc tgcatggccc agcaccagaa actcgggggg ctgctctcca 56641 ctctagaaaa aggggacacc aaatcagaga ggtggtttag cagcaccaca ctgtaggagg 56701 caccctgcag ggaacctctg ggcatgaacc cgtagccagc ctccccacac agctgaggag 56761 tcccctttgg aatctccccc tgtctgtaat gggcagcact tggagagcta cagaaaacct 56821 gtgtttaggg cgccatctag tgctcaaaga aggcagcaat ctagggctaa gggaatcaac 56881 gggcaaatcg cacagaacct ctaaacacac aaaacaactc agacaggaaa gacttgaaaa 56941 aataaccaat tctttaatgc gaagacatag atctacatac ataagaaata acagcaaaca 57001 gtgaataatg acctccccaa atggacaaag caaggaacca ctgatcaacc ccagtgagag 57061 agcaacatgt gagctctcag atagcaaagt cacaggagtc agatcaagaa ttcaaaatag 57121 cagttttgaa gatactcagc aatctccaag ttaatacaga aacgcaactt agaaatttat 57181 cagagaaatt taatgaagaa atggaaataa gaaaaaatca aacagatacc tggaactgag 57241 aaatacattg gctgaactga acaactcatc acagaggtat caacagcaga actgatcaag 57301 cagaagaaag aaatagtgag ctcaaagata gtctatttga aaatacacag tcagagaaga 57361 aaaaaagaaa aaagaaggaa aaggaatgaa gaatatctac aagaactaga aaataacttc 57421 aaaagagcaa atctgagtca ttgaccctca agaggaaatt gataaagaaa agggggtcat 57481 tagcttattc aaagaaataa cagaaaactt tccaaaccta gagaaagata taaatatcca 57541 gatacaggaa agtcaaagat taccaaacag attaaaccca aataggacta taccaaaata 57601 tgtattaagc tctcagaggt caaggacaaa gacaagattc taaaagcagc aagagaaata 57661 aaagcaaata acacacaaag gactttgatt tgtctggcaa cagactctca gcagaaacga 57721 tacaggacag aagagagtga gacgacatat tcaaagtgct caaggcaaaa actgagaata 57781 atatacccaa caaagctatc cttcaaacat gtaggagaga taaagacttt ctaagacaaa 57841 caaaaggtga atgaattcat catcaccaga cctgtcttgc aagaaaatgc tagagagagt 57901 tcttcaatct gaaagaaaag gatacgaatg tgcaataata aagtcattta ggctacgcat 57961 agaggcccac acctgtaatc ctagtgcttt ggggggcact aggcacagaa agacaaacat 58021 cgtattttct tacttatttg tgggatctaa agatcaaaac agttgaactc atggacatag 58081 tagaaggttg attaccagag gctgggaagg gtagtggggg ataggtaggg gagttgggga 58141 tgattaatgg gtaccaaaaa atagttagaa tgaatgacac ctactatttg atagcacaac 58201 agggtgtcta tagtcaataa taatttaatt gtacatttta aaataactaa aatagtataa 58261 ttggattgtt tgtaatacaa agaatgatca cttgaggggg tggatacccc attctccatg 58321 atgtgattat tatacattgc atgcctgtat ccacacatct catgtaccca taaatatata 58381 tacctactat atacccacaa gaattaaaaa taaaattttt ttaaaaagta gaaagacttc 58441 aaataaataa cccaatgatg cacctccagg aactagaaaa gcaagaacaa atcaaaccct 58501 aaattagtag aaagaatgta acaaggctgg gtgtggtgac acctgtaatc acagcacttt 58561 gggagactga ggcaggatga ttgcccgaca cccacctggg caacaaagtg agactgtgtc 58621 tacaaaaaaa gtttgaaaat tagccaggaa tgatggcatg tgcctatagt cccagctact 58681 taggcagctg aggcaggagg aatgtttaag cccaggtggt tgaggctgca gtgagccatg 58741 atcgcaccac tgcactccag cctggggagc agagcaagat tctgtcagaa agaaaaagaa 58801 aagaaaagag aaaaagaaaa taataaagat cagcacagac ataaatgaaa tcgagattat 58861 aaaaaaatac aaaagatcaa tgaaacaagt agttttccaa agagataaac aagcaaaact 58921 gacaaatctt taggtagacc aagaaaaaag agagaagacc caaagaaata aaatctgaaa 58981 tgaaaaggag acattaaaac taataccaca gaaatatcac agaaatataa aggatcatga 59041 gagaccatta tgaccaacta tataccaaca aactggaaaa cctataagaa atggttaaat 59101 tcttggacac atacaaacta taagaaatag aaagcctaaa cagatcagta acaagtaatg 59161 agatcaaagc agtaataaaa agtctctgat caaagaaaag cccaggagca gatggcttcc 59221 acaatgaatt ctaccaaaca ttttaagaat ataaactcta ttcaaattat tccccaaaaa 59281 attgaacagt agggaatact tccaagttca ttgtatgtgg ccagcattac cctgacagga 59341 aaacaagaca aaagatacaa caacaataac aacaacaaca acaacaaaac tacaggccaa 59401 tattcctgat gaacatagat gtaaaaattc ccaataaaat actagcaaac caagttcaac 59461 atcacattaa aaagatcatt caccatgatc aagtgggatt tgccacaagg atgtaaaatg 59521 gttcaccata cacaaatcaa taaatgtgtt atatcacatt aacaaaacta aggacatttt 59581 tgtccatgat gatttcaata gatgctgaaa aaagcattca gtaaaattca gtatcttttc 59641 atgacaaaaa ctcaacaaac tgggtataga aggaacatat ctcaaaacca taaaggccat 59701 atatgacaaa cccacagcta acatcatgct gaatggggaa aaattggtag tctttctggg 59761 gaaaaattgg cagtctttcc tctaagaaat gaaagaagat aaggatgacc aattttacta 59821 tttttattaa acatagtcct gaaagtccta gccaaagtaa ttgggcaaga gaaagaccta 59881 aagggcatcc aatttggaaa gaatgaaatt aaattatcct tgttcattga tgacatgatc 59941 ttatatctag aaaaacctaa agactctacc aaaaaattat tagaactgat aatcaaattc 60001 agtaaagttg caagatacat aatcaacata caaaaatcag tagcatttgt atatgctaac 60061 agcaaacaat tggaaaaata aatcaaaaaa gcaatcccat ttataattgc tacaaaaaaa 60121 tacctaggaa taaatttaac cacagaagtg aaagatctct tacaaagaaa actatgaaac 60181 actgatgaaa taaattgaag aggatacaaa aaaatagata tctcatgctt atggattgga 60241 agaattaata ctgttaaaat atccatacta tccaaagtga tctacagtgg tctcttcact 60301 ctattgattg tttcccttgc tacccataca atatgataca aaatgatcta cagattgcaa 60361 tccctatcaa agtgccaaca acattcttga tggaaataga aaaaacaatc ctaaaattca 60421 tatgaaacca caaaagaccc caaatagcca cagcaatcct gagaaaaaaa gaacaaagct 60481 ggaggcatca tactacctga cttcaaaata tgctataaag ccatagtaac cccaaacagc 60541 atagtactgg cataaacgca ggcacataga ccaatggaac agaataaaga acacagaaat 60601 aaatccacac atttacagcc aactcatttt tgacaaagat gccaagaaca tatattggga 60661 aagaacagtt tcttcaataa atgatgctgg ggaaactgga taaccatatg cagaagaata 60721 aaactagacc cctgactctc actatataca aaaatcaaat caaaatggat taaagaaagc 60781 caggcatggt ggtgtgtgcc tgtagtccca ctacttgaga ggctgaggca gaaggactga 60841 ttgcttgagc ccagaagttc aagtccagcc tgggcaacat gatgagaccc cctatctctt 60901 aaataaagac ttaaatgtaa gacctgaaac tataaaacta ctggaagaaa atgctgagga 60961 aatgcttcaa gacattggtc ttggcagaga tattttgtgt aagacctcta aagcacaggc 61021 aacaaaagca aaaatagaca aatggcatta catccaggta aaaagcttct gcacagcaag 61081 ggaaacaatc aatagagtga agagacaacc tatggaatga gagaaaatat tttcaaacca 61141 tatattaaat aaagggtgaa tatgcaaaat agaaaaggaa ctcaaataac tcaacagcaa 61201 aaaataataa tctgattaaa aatgggcaag tgatctaaat agacatttct ttttctttta 61261 gatttgccgt taacaatgaa tatatatttc ttaaaagaag acatacaagt agccaacaga 61321 tatatgaaaa atgctcaaca tcactaatca gggaaatgca aatcaaaacc acaatgagat 61381 atcatctcac cccaatttaa atggctatta tattcaacct gagaaaactg ataaaaatgg 61441 ctattataag acaaaaataa caattgctgg taaggacgca gagaaaggag aactctcata 61501 cactgcagat ggaaatttaa attagtacag ccattaatga aaatagtatg aagtttcctc 61561 aaaaaaacaa aaaaactacc atatgatcca acaatcccac tgttgtgtat atattcacaa 61621 gaaaggaaat caatatattg aagagatatc tgtactctca tatttattgc agcactatgc 61681 ataatagcca agatatggat tcaacctaag tgtccagcaa gaaatgaata aagaaaatgt 61741 ggtatatata ccacagtgaa atattattca gccataaaaa tgaaataaat actttcattt 61801 gcagaaacat ggatagaact gaagggcatt gtgttatgtg aaataaacca ggcacagaaa 61861 gacaaatatt gcatgttctc actcatatat gaaaacgaac aaacaaacaa aaaaaggacc 61921 tcatggaggt atagaataaa atagtgatta ccagaggctg tgaagggtag agggagggag 61981 gatgaagaga agttggttaa tgggtacaaa actacagtta gaagcaaaaa gttctagtgt 62041 ttgatagcac agtagagtga ctatagttaa caataattta ttgtatatct caaaatagct 62101 acaagagaag atttggaatg ttttcaacac aagtaaataa taaatatttg aggtgatgga 62161 taccccaatg acccttattt gatcattata cattgtatac atgtatcaaa atatatgtac 62221 ccccagaaat atatgtacaa ccattatata ttaattaaaa acttgagtta ttcaatactt 62281 ttccctctaa tcagactcct aactttagct tccacctctt tttctacccc ctaaagcttg 62341 tgtttccttc ccaaaacatg aattggctta tgtaaattct ttgttcaaaa tatctcctgg 62401 ttcccagtgg catatagaat acatatctgc cagttttgta cagcatagcc acattttgat 62461 ttcttccctt tcaactgtat gtcctactat actccaatgt ccctttcatc acacacacag 62521 tctgtggatc tccctcagca ctcccccatc ttctggatat atctggatgc tcctggccca 62581 ggggcttggc tcagaccttt tctgtttgag ttatgtacta tattgaatgc ctaaccatac 62641 tccctatcca gtttcagttt tcactcctag ctgatgactt ggataagagt ctcttgagtt 62701 agaaatagtt tttatcaata acatgggagt aacaatgcat ataatagcaa tagctcacat 62761 gaatcaagct cttaattact gtgccaggca ctgttaatga caaaaatgat aaatgtcctc 62821 caggataaag ggacctaaga gacagtttga tcaaatgtcc ttacaccctc aaagagggtt 62881 tcctgatata acaggtccag cctctgtcta cttgcccttt tctattccaa cttgtcagga 62941 gaagaggggc cataccctgt taagtccagt gagtagaatc acttcattca tcaaggcggg 63001 acgtggcttg tttatgcatg aggtagttat tccactctgg tttaatttct tttaaaatgg 63061 ggatgatgac aattgtggtt gttgtacaga taaaaactca tgtacagtgt caagcatagg 63121 acccatggtt aagagctatt aatattcaga atattaacac atcacagggt acctgtaaaa 63181 cacttcttaa agaactacag tgccttcaac ctagcctggc tcttatttat ttgacacttc 63241 agtggggaac aagaattcaa ttttggccag gtgtggtggc tcacgcctgt aatcctagca 63301 ctttgggagg ccgaggcagg tggatcactt gaggtcaaga ttcaagacca gcctggccaa 63361 caaggcaaaa ccacgtctct actaaaaata taaaaattag ctgggcatgg tggtgcacat 63421 ctgtagtccc agctattcag gaggcttagg cacaagaatt gctctagccc aggaagtgaa 63481 ggctgcagtg agctgagatc gtgccactgc actccagcca aggtgacaga atgagaccct 63541 gtctcaaaaa aaaaaaaaaa agaattcatt tcaggaagat agttctgaga ggcgaaagaa 63601 gaggagaagc agggaggagt gggaggatga ggtagcagca gtacccacag tgatgcagga 63661 gctcctggat gggaacctgg agggcagtgg tgtttttatt gcctcttcca tgattagaaa 63721 ataagttttg aactttgggg gctttaagga gtggaaatga gaaaagtaaa tacagaggct 63781 gttagccctt cttaagggag aatactactg tcttctgtgt atcttcatgt tgggactcag 63841 tctgagctgg aagtattcaa attaatgcac tgaggagctg gattatacaa aaattagctg 63901 ggcatggtgg catgcacctg taatcccagc tacttgggag gctaaggcag gagaattgct 63961 tgaacatggg agtcagaggt tatagtgagc tgagatcacg ccactgcact ccagcatggg 64021 cgacagagca agactccgtc tctaaataaa taaataaata gggccaggag cggtggctca 64081 tgcctgtaat cccagcactt tgggaggccg agcaggcgga tcacgaggtc aggagataga 64141 gaccatcctg gctaacacgg tgaaaccctg tctctactaa aaatacaaaa taattagctg 64201 ggcgtggtga caagcaccta tagtcccagc tactcgggag ttctgaggca ggagaatggc 64261 gtgaacctgg gaggcagagc ttgcagtgag cggagatcgc gacactgcac tccagcctgg 64321 gcgacagagc aagactccct ctcaaaaata aataaataaa taaataaata gggttcaatt 64381 gtacttaact atttgaagta tttaaaacca agcataacct ctgagctaaa atgagtgatt 64441 ttgtaaatat ttaggcagtg gtgtgaggaa gtaattcacc cccaaaacac actaatgtac 64501 atttagttat aatgagactg aagcagggag atgaaggcca agtccttctg ctgttgttct 64561 taagcctttg gtaaagcaac atagggagtg gcatctttag atttgtctaa aaaaaacaaa 64621 gaacagttac aaataaaatt ccatgcacat gtatcccaga acttaaagta atatatatat 64681 atggcaaaaa accacaaaaa caaataaaat tccagacaga aagacgatgc aaatattaga 64741 gaattataca caaagtatgc cttgcagctg ccacacaaac tcaacattat gaaactatca 64801 aatgcacaaa tgaacactct cttgaggaat ttctaatatc aaaggaattg ttttcttccc 64861 ttttggtggt acaattatga agaggtgggt tttgcagaat ttgtttagtg gtggtagggt 64921 ggctgtcaat tcagggttga ttcaaacctc tgttaatttg aaatactctt ccattttctc 64981 cagtagtgaa gacttatttg taggtcatga gatcaaaagg tgactacctg agaccgtgga 65041 aaacctaaag ttgtgtcaaa gaactattgc cagggaaaaa acatcaagat gagtttattg 65101 ttagttttgc tgcagagaga gagagagaga gtgtgtgtgt gcatgcagta atggaaacat 65161 gtgcagatct gatggtgggt tgtccaaaac aagcagcccc tgagtgctgt catcctaaca 65221 ccataggtag ggctcacagt ccgtcttggc tgactcccac agtcaggttg ggaggatgaa 65281 agaaacctcc actatctcag gaggcatgga aagctcagag gaaacgaaag acaaggtttg 65341 atgagctgct tttctggttt tcatagtggt ctcttaatag aattctgaaa aggaaaagga 65401 gccacaagaa aaaagagaaa gaaaatgttc tcatctattt ccaaattcta aggcatgacc 65461 ttcccccaag aaactcttgg tagatgtttt ctccctttct cttatttgca aatataatag 65521 aactgtttta tgtaatcagt actgggaata gggaactaat attggcccat aaaacagcct 65581 actttgggga tcggctattt agggagattt gataatagat ttgaacatgc ccaaaatgag 65641 ctctattaca ctagccttct ttagactata ttcatgcaga gaaaggatcc taaatcctaa 65701 aataggagag tgaaagtgag gagtcagaca gggagaaaag gcctgtagtg tcacaatttg 65761 cagattggga acttccaaaa ggattaaata caatgttaac tccattaact aggatgctga 65821 gtagacgtaa agcccacctg acatggttgt tttgttctgt tttttagtag ctgccaggca 65881 tttaagaaaa agccttttct gtgacccccc acacccacat tgatgttgac cttgcattaa 65941 aatgactagt tctgttatgg gatctttgga gtgtcaattt tctggccaga aacctctgtg 66001 gctggtgaca tctttgccct agttcttgtc ctgtgtccag gaagaatgag gtatgcgaca 66061 agtgaagagt gaacaagaca aagaggcgcc tgagttcttg tcctgcatcc aggaagaatg 66121 aggtatgcga acaagtgaag ggtggacaag gaggaagagg agctttatta agtgttagaa 66181 cagctcagag gagaaccaca gtgagtagct cctctctgta agcaggtaat cccatctctc 66241 catcctctgc cctgctctgg ctgagcccag aacttctagg gacctcagag gggaggaagt 66301 gtgtgctgat tgttccatgg gcagccatgg gtgggcccag aaaaggcacc acaggtcctc 66361 actccagtcg tcaggactgg ccgcctggcc tgaaggtggg gtcttactga ggacctgccc 66421 tcttccaccc aggagcctgt ctgcctccca cagcagtcca tggcgcccag gctgcttgca 66481 ccaaggggca cttgcaggcc agtgctgagc tgccctcagc cccccttagc ttcctctctc 66541 atgttcgtgg gcactcaaat tctggagggg gccaaggcag cagggcactg gcatgtcagc 66601 actgccctga gcctacacac atcctccggg ctgtgacagc acctccggct cggccccaac 66661 tcctctctga gatgagatca gagcaggtgc ctgagggaaa agagaacagg cagtgggagc 66721 agacacacct gagcctgtgg tggtggggct tcctggcccc caaggatgca ggctgcagag 66781 atgcccgtgt cctgcacctg ggagggtagc tacagatgca cctgaggagc tcctgcccca 66841 ccaacttgga aggggtgggg ctcccacttg tccctggctc ctgcccgctc tgtggagcca 66901 gaggcctggg tctgcagcca cggattgggc agttatagtt gcactcagga ggacagggat 66961 cctgcctgct cctggctccc tcgaagagca cagggaggct cagatctgca gcccagtagg 67021 gtcggggctc ctgcctgatc catggagctg gaggcccagg tctatagctg cagtttaggc 67081 gactgcagtg gcacccggga agctccctcc ccaacctgga aggggtgggg ctcccaccag 67141 ctccatggaa tgtgcagccc cagccatgcc tccctgctgc agccagcatg attgcagcag 67201 ccgctgccat cagttctact ttcctgcctt agaaattggt gatggtatag caggaatttt 67261 aaggaatcag agagactgat ggggttcagg gggatattta ttaattattt aggtgcacca 67321 gcccagttgg attaacatcc aaaggactga gccccgaaca aagagttaag ttacctttta 67381 agcatttcat ggggtcgggg gagatctgtg cagggggaag cgagaaacaa aggcagttat 67441 tcaattgaga catgcattac ttcatttctt actttttaag gaaaaacatg ttttgtgaat 67501 tgagtttatc tgtctagtga ccttgcagct gcacagctag ggaatcagag tcttcacaat 67561 gcctgggaag ggaggagaga taaggctcac tagccacaga aaaataggca gttagttttt 67621 aaaggactct agctctttct ctttctcaga gggaattggg ttttcttacg tacaactgaa 67681 tttctgctta catactcttt aatttctttt aattcctgtt ccaatgggaa tgagatctcc 67741 tagttttgga ctggggctag cagggcctcg ggtgcaccac cacttggcaa catcagcctc 67801 tgtccagaat cctgaacctg taagctagaa cagggttagc tgacagtgtc ctgaacccaa 67861 gaaggttggg agggtggggt tggattggag tatgcaggat tgacaatggt atacctcttc 67921 tccaaactac tggctgggca gaaatcttga gttctaatga atctgaacct agttaggcta 67981 acctttcctt agcacctttc cttagcagta aaggctaaaa tatcaagtca gtatccaggg 68041 cttcagcaac tcttgtgtcc cagtaagaca agccacaaat acctttggcc ccattatcag 68101 gcctccattg ccataaaatc tttttaactc tcaagaggtt acagagaatg aattactcct 68161 taatcatact tgaagcaaac cttctgtcta aaaatccaac gtagccaggc atagtgactg 68221 gagcctgtga tcccagacac ttaggaggct gaggtgaaag aattgcttga gaccaggagt 68281 ttgacaccag cctgagcaac atagtgactc ccatctctaa ataaataaat agacagatag 68341 ataagtaaat acatagacaa attgatagaa tgatagatag ataagtgagt aataaataga 68401 ttaggtagag atagatagcc agctaggtaa ctagccaacc agctgtctca aaaaaaaaaa 68461 tggctggctg tggtggttta tgtctctaat cccagcactt tgggaggctg aggcaagagg 68521 ctctcttgag tccaggagtt caagacgttt gggcagcata gtgacaccct gtctctaaaa 68581 aaaattaaaa attagctgtg tgtggtggtg tgcacctgta gttccagcta ccctggaggc 68641 tgaggtagga gatcaaggct gcagtgagcc atgactgtac cactgcactg cagcctggac 68701 gacagagcaa gatcctgtgt caagaaaaaa aaaaaaagaa acggtttttt ccctctactt 68761 tctcacagtt aatacaatac tcagcacaaa acacttttga catcagattc tccagtgaac 68821 accaactggg tgtcttatga tttaattcaa ttctgacact atctacctgg agatagtgtc 68881 agatcctatg ggttaacggc tcagtcccat gagattgtgg cccccatttc agatgccagt 68941 cacaagcact agattgtcac ttatacttct gactccctgg gtataaattg cagggtccta 69001 caaccccctc ctcaggttca attaatttgt tagagtggct cacaagactc agaaacactt 69061 acttacattt actagtttat tataaaggat acagaataac agcaaaatgg aaaagatgca 69121 tagggcaagc tatcaggagg agggtgcaga gcttctatac tctctctagg tgtataacct 69181 tccaggcacc tctgtgtgtt cagcaatctg gaagctcatc aaatctcatt gttcaagagt 69241 ttttatagtt ctggtgttct atggcacagt agggtgacta tagtaaacaa caaggtgttg 69301 tagatctcaa aatagctaga agagaggatt tatatgttcc tatcacaaag aaataataca 69361 tgtttgaggt gttggatatg ctaattactc tgatcatttg atcattacac aatgtataca 69421 tgtatcaaaa caaaatcaca ttaggctgcg tgcagtggct catacctgta gtcctgacac 69481 tttgtgagac caaggcagga ggatcgcttg aggccaggag tttgagacaa gcctaggcat 69541 catagtgata ccccatctct acaaaaaaat tttaaaaatt agttgggcat ggtggtgtgt 69601 acctgtagtc ctaactactt agaaggctgg ggcaggagaa tcccttgagc tcaggagtcc 69661 aaggctgctg tgagttataa tcaagcgact atactccagt gagggcaaca gagtggagac 69721 cctatctccc aaacacacac acacacacac acacacacac acacacacac acacaccaca 69781 ttatacccca taaatgtgta caaaatcatg tcaatgaaaa atacttcatt tttatgtttt 69841 attattaata tttttagaga cagggtatca ctctgtgacc caggcttcag tgcagtgacc 69901 tgatcatagt tcactgcagc tgtgaactcc tggggtcaag caatcctccc acctcagcct 69961 cctgagtagc taggactacg ggtatgcacc atcatgcatg gataattaaa aaaaattttt 70021 tttgtagaga tggggtcttg ctaggttgac caggctgaac ttgaaatcct gccctcaagc 70081 aatacttcta ccttagcctc ccaaagtgct ggaattacag gcatgagcca ctatgcctag 70141 aaaaaaaatt tattaaaaag agtttttata gaactcatcc cttagccctg ttttcccgcc 70201 tttcctggat gttggtgggt gaggctgaaa gttccaactt gccaatcctc taatcctcta 70261 ttacctgttt tttcaggtga ctggccctat cctgggctac ctgggagtca cactctgagt 70321 tatatcatta gcataaactc agatgtaatc caaagggaca cattataaat aagaagataa 70381 tactatttct caggaaattt caagggtttt aggagctctg tgacgggaac cagggacaaa 70441 gacctaatat atttcttata atatcatagg actatttccc tcaatatgac tttaaaaagg 70501 ttcagttttt atgatgctca gcttccccca tgaccagggt gacactattt gtagtaaaga 70561 gggacagaac agctgacagc tgatagtcag ttgtcaggac actaatattt ctttattctg 70621 ttaccaagca tagcattaac agaaaggaca cagtgcaatt cattagctag gtttgcatcc 70681 aagacttgga cgaaaatgaa agagtattaa aagcgggtag aaactttgaa aggaaaataa 70741 atctcggaac ccccaaatca ctaagccaaa gggaaaaatc aagctggaaa ctgtatcgga 70801 caaacctgcc tctcattcta ttcctaaata agatagttac aaagatttgt ttaaaagcta 70861 catacccctg tcacaatttg cccataagga aaatcctcat ggacaaagga cagacagaac 70921 tcaaagtcat ccctctgctg ctgtgcaaca aatgcatatc tgattgcttc ctttgcccta 70981 ttgtttcact aagccagact aaggcataag tgactattcc cgtaaattgt gtattcagtg 71041 aaaggctaat cagaaactca aaagaatgca accatttgtc tcttatttac ctatgacctg 71101 gatgccccct ccctgcttcg agttgtccca cctttctgga ccaaaccaat gtatatctta 71161 catatattaa tcgatgtctc atatctccct aaaacgtata aaaccaagct gtatcccaac 71221 cactttgacc acatgttgtc aggacatacc gaggctgtgt cacaggcgca tccttaacct 71281 tggtggaagg aacatggctg cactgaagcc aggcagacat aggctgaggt aaacatcctg 71341 catgactcaa caagtttggt gtacaggcac ataaatccac gtgttatata atcacagaca 71401 tgtagccata acatgggaag gctcatcact cagctcagag ccactattgt ctgtaaaagg 71461 tacaactacc ctgttaatgc tgtacaggtg tgcccagaga aagagagagc cagagctgtc 71521 tgtcttgcag atggacagcg ggaagccagg acacagtttg gcttgcttgc acccagaggg 71581 aaagagttaa gctgctgacc ctgaagggag agctggccgt gtggctgcgc gtgggagcag 71641 gtacagcagt tagccaagac agagacagac agtgtaagag agctgctgaa taaaaccatc 71701 tttcacctgc ctacaaccca ccccctcccc tgccctgagt gttctttcag ctatctgcca 71761 cccatccacc cactcccttc tgacctcagc ataggctgga acctgaccct ggacctgaca 71821 cttggcaaaa taaattttct aaattgattg agactcatct cagatacttt ttggtttaca 71881 aattggtgac cacagaggga ctcagtggag gtggccctga cctttggcaa atctcctgtt 71941 ggtgcttggt accagcttca ggtatcttta tagctcaaac caacaggacc atttcctgag 72001 gcctgggagc tcccctccct tcagagaatc cctgatctcc caaaatttgg ttgagatcta 72061 aagtatattt tgctgtacaa ctccttttct ggagttttac ttgcttccaa caaggcaggc 72121 aagttttcct gcttccatga cgatggaagg caggtaactc ctttctggag tttgagcttg 72181 cttccaacaa ggaaggcaag tttgagtttc ttcccgcttc taggatggta gagagcagtc 72241 ttcagtctga gacccatttc taggtaaata actgaattgg gttttttttt gtgtgtcttg 72301 gaaattctcc ttaataacta aaggttaaaa ttgacaacca gctggtctta atttctcctt 72361 accattagaa cactcagcga tcatattgtt ggggtttttt gttattgttt cagtcgttct 72421 cccatcggat ttgaccaact ctacccaatt tggtcaaatc tgaatgagaa ttccaaatca 72481 tggggaaaaa ggcctctgaa ttggctaaaa ttccttgcag ctgaaaacaa aacaaaaaca 72541 tatgtttagt ttctgtgtct gcttcctgtc ttttctttca ccctattcct ccttcccctt 72601 tgccattgct gaccaagaaa aaaatctaga gaaggcttct aatgagtcaa acccctcaaa 72661 gaactcaaaa tgaaggcact attcatgcct ctttggagtg ttctgttttc tttgtggagt 72721 ttcaagagtc atgggcagat tctttgtagg tctaaagctc tgctctcctg tattgcctta 72781 cctgacctct ttggtttttg gcagtactag agattacctc gtactgtaag aggatttgac 72841 tttggcatgt gtaatggcaa atgagagcta caaagttaag agtgggtgag gacagtttac 72901 aggaagtggt ctcagctgag tatttttcct cctaggaaat tgtttaggat cataattcta 72961 gttcagaggt tcattctaaa gggtcttctt tgttgccttt cctcctgaaa ttaatctcaa 73021 ttggcttgtc tgcacatttg catgaggaac tgaactgttt tcatagacaa atgagagact 73081 gagtttcctc agctccgaaa tgaaagggca ttttgctcct cccagctgaa aggcacccct 73141 gggtgatgag tgggagcttt cttttctacc tgctttaagt ctgctgttac ttttctactg 73201 aaataaaatt cactgtttgc atccaaccat ttctttttgt tattgtttgc aaactggtga 73261 gtttgtatta ctatctcatg gccagagttc tgaattaaaa gctataggat ctttgtatga 73321 gtgtgcatat gtgtgtttat gtgtacatgc atgcatttta ttatgtgttt ttggtcacaa 73381 ggtaccaaat tggcttaaag ttaagtagta ctcataaatt aaataagccc aaatgctttt 73441 caagttcatg tgacttaaaa tatttaataa gctagcttta caattattgg taaaataata 73501 ttagaaatgt cttaagaatt gtcagcatac atttttgttt gcatttatga atcaagagat 73561 ttcatactta tccctgccaa atattataag gtgtcaaaat ttggcataat ggttaaacta 73621 taaacccagc ccaaaacaga atgatatttg cttgtgtaat tcttgataaa taagatgtta 73681 atactgtttt aatgtaaaca gctaaatttt ggattattta gtaagataac catatattta 73741 atcttaaggt tcttacttag gtaaacacct gaaattcaca ggctataaaa cgcttcacag 73801 ggaaataatt ttaaatgatg actatcacag ttttcataaa taatctaggt aaacaattaa 73861 attaggtaaa tataatggga tatttataga caaatttgtc ataatttaga atctaaagtt 73921 atattaaact agatatttca ttaaatgggt attttccaat aaaaaatata tataatatag 73981 taggaaaaca ttctttctaa aaaaaggaag tgtgctctta tttaaatgtg aactactttt 74041 gtctaactaa aagcttattt aaaggttatg tgtaaaacaa ggtaaaagaa actaggaaat 74101 aagagagatg taaggaaagc tatagaaata aagaggtatt tttggttaaa aaaaagctta 74161 aagaaaaata attttatata agaatattag gccaggtaca gtagctcatg cctgtaatcc 74221 cagcactttg ggaggctgag gtgggtggat tgcttgagcc caggagttca agaccagcct 74281 aggcaaaagt gcgaaatccc atctccacaa aaaaatacaa aaattaggca ggcatggtgg 74341 tgcatgcctg cagtcccagc tacttgggta ggctgaggca ggaggattgg atcgcctgag 74401 cctgggaggt ggaggctgca atgagccatg atggcgccac tgcacaccag cctgggcaac 74461 agacaagact ctgtctcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 74521 agaatcatat atggtaaatt tttttctcct agaataaaat gactggttgt ttaagaaaga 74581 gagatgttca ggacaaacca gaaagtccaa gcgtgtcatg aatggtatgt ataagtcata 74641 agatttatgg aaaaaaactt ttatatgatc taattggcta taattaaagg gaaatgattt 74701 ataatggtct ttctagagat tgggttttga tattaaaaaa agacatatac taaagaattg 74761 gtttgaacaa tgaaattttc ttaaggtatt gctttactct taaaattaca agacatttta 74821 attctttaat acaaagttca actttgtgtc tcactgtttt cagctttctc tcccctttta 74881 aaaggcctga aataataact ctatcattca actcattttc agctcctgta agtttttttt 74941 tcccttcagg ttctaatttt tgtagcctga tgctaaaaat gttttattgt aaaggtctaa 75001 agaaaacgtt tccttccaac ataatattct gtgctcttgg ctctaaattg ttctatgaac 75061 cagaaaattt gcatttatga cccaggaaac actctttcta tgtctaacta attcaagtac 75121 ccttttcatt agttttgagt tgcaggttat ctaaatggac tccccatagg gaacagcagt 75181 catactgcag atcttttctt ttgcctttgg gtaactggcc taataaacag attttatgct 75241 ttatcaaaat aattcctgtc attactaagt attggtttgc ttggaaataa tactgagatt 75301 aaaaaaaatt taattgaggg tattacatcc atgaaacttt ctgtatgtgc ttttaaagtc 75361 cttgtgctat taagttacgg ggctttgact cctgggtcta aaaagggtct aaaaaggacc 75421 aagtcctgct aaatcttaaa cactaacagc cgttaaagcc tcatcttcgg agctggtaga 75481 agatgtcaat caaaataaac tgcatgcatg agacacaggc cagaaattaa agctattcaa 75541 ctcctcaagg cccagggact atcatgagag aggtggatgt gtgagattat aagggccgat 75601 tttgaccatt aatgtcaaag gcacactgat gtaagaccag catatgggtc cctgtgttag 75661 attaacaagg ttttcttgaa gcattcaccc actccttaat aaaaggttat aaaggttata 75721 aaaaagactt aggaaaatta tatcttatag tcaagatgat taaaatttta taggtttata 75781 aaattttgaa aaacaaattt aattgggcct catgccatct ttattaggac ttattgtttt 75841 ggaaattaag tctcctgtct caaagaataa aagtttttaa cttttttaaa aaatcgagtt 75901 attactttgg ctaaatgaat gacttatttt acaatgacct gtgatcctat tttgtgatat 75961 catgtgtttt aaactcttta tgtttgacaa acttttcaaa atcaaatttt aacttcagac 76021 ctcattaatt ttttgatatt agtctcctga agtccaaaag agacatcttt ggcttatttg 76081 atataataaa accatacaga agtattgtta aatatgaaag tgtttaactt tctttggatt 76141 tatataaatg tgttattagt atgtgttcca gaattatatg aaattcctgt gattctgatg 76201 tgtcttatag catgttatcg gtggtaattg tgattattat gttaaattgt tgtatgccat 76261 agaagtaacc aaatttcctt gtcaattgtc tctttaacta tgactgttct aagatttttt 76321 catccacagt tattttacct tcatcttttt tttttttttt gagagggagt ctcactctgt 76381 tgcccaggct ggagtgcagt ggcacgatct cggctcactg caagctccac ctcccgggtt 76441 cacgccattc tcctgcctca gcctcctgag tagctgggac tacaggcgcc caccaccacg 76501 cccggctaat tttttttttt ttttgtattt ttagtagaga cagggtttca ccatgttagc 76561 caggatggtc ttgatttcct gaccttgtga tccacctgcc tcggcctccc aaagtgctgg 76621 gattacagac gtgagccacc gcgcctggcc actttcatcc ttttcaaaag gtggttttat 76681 aatcagcata ggactctgac aggtgctctt gaatgcacac ttttgataac tttggacatt 76741 gtgacactag aatagaggaa aaacctccaa ggctcccatg gagagctgaa atgtttatga 76801 ttatcaagca gaacaggagt taactacata gactgaacta atagaagact gaaataatta 76861 tgacttttgc tcaaaatgtt gctcatcctt tgtttttcag agccaagaaa acttttcttt 76921 tgagctattt acagctttta acacttaagt atactcctat aaacaaaatt tagtgcatat 76981 ttctctctac ctgatctctc caaaatttgg aaactagttg catgtatact taacttatag 77041 caacatagtt agttgcataa gtgcaataag aatctgtttt cttttgtaac aggatacaaa 77101 tggaaaaaac tggttatttt accaaggtat tgacaggaat aacatacttt cagatatagt 77161 ctcctttaag aaatcaaagt tgacttacag ggccaaaaaa agccccttgt aaaagctatc 77221 ctcatacctt atctacacag tccctgtaca gtctcctaac acgtggtaag taaagaatgc 77281 cactttctga caggcccagg agacccaagt tttctgggga ccttgaggtg aggaattcac 77341 ccaattaata caagtatttg caggcacagg ctgggcttaa ggcattaaag ttgaatctga 77401 gattccttat agaataaagt tccagcaaag ccaattttaa aaaaagagaa gactatatgg 77461 caaataatta tttttgctga ctttatgcaa atactgcagc cataagacta aaacttattt 77521 tgccaatgaa tttgtcctat gatttgtctt tagtgaaaac gggactggag agagaaaaaa 77581 atatgtttcc aaataaacta tagcatacct gttagattct agtttgccta gtgtttttca 77641 atttttatta ttttctatag tttagactga attctaattt tttcctggct acaagtctcc 77701 aaaataatgt tttcaaattg tccttctttc ctttcccttc ttccccattt ttcctcattt 77761 aaaatcacta aaaattaagc tgtgctttct tcaagccctg caaactgaag ctagacaact 77821 tcagaagaaa ataacagcaa cctatttaca tacatcaacc actttcataa ctgcctactc 77881 atgcatggac ttcagagtaa tatggcctat atagattttc caggattgtt cttgtttgtt 77941 gttgcctttc tcccttcctc cccgttttct cttcatagga catgaaactt cacaacctgc 78001 taaacatgag ctttcctaat aacatgggac ctaatcatgt aggaataaag catcctagcc 78061 atgagagatc agacaaacct aagaccaaag gactcatttt cttctaaaat gctttctctg 78121 aaggattttt aaaagggggg aaatgtaaaa aaaaaaaaaa aaaaaaaatc tcgcgacccc 78181 aaaatcacta agccaaaggg aaaagtcaag ctgggaaaag tcaagctggg caggacaaac 78241 ctgcctcctg ttctattcct aaataagata actacaaaga ttttttacaa agctatatac 78301 ctccctcaca atttgctcac aaggaaattc cttgtgaaca gacagaattt aaagtcatcc 78361 ctctcctcac gtgagacaaa tgcctatctg attgctttct ttgccctatt gtttgactaa 78421 gccagactaa gggataagtg aatattcctg taaatcgtgt attcagtgaa agactaatca 78481 gaaactcaaa agaatgcaac catttgtctc tacctatgac ctggaggccc cctccccact 78541 tcgagttgtc ccgccttttc ggaccaaacc aatgtgtatc ttacatatat tgatcggtct 78601 catgtctccc taaaaggtat aaaaccaagc tgtgccccaa ccaccttggg cacatgtcgt 78661 gaggacctcc tgaggctgcg tcacaggtgc atacttaacc ttggcaagat aaattttcta 78721 aattgattga gacctgtctc agatactttt tggtttacaa aacaaagcca gaggcctagt 78781 ccagagcttg gcaaatagtc cacgctcact aagggtgagc tgaagggaag gcttgcagtg 78841 ttaactttcc taagatattt tagtaataac ataatagtct ctcccactgt ggtttaaaat 78901 ttcaatagca catagatttg catgttatat ttttctatta agatatgaac tattaaatat 78961 actgaaccaa ttctccttct ccccctgaag gtttggttta agattaggaa gatcaagtag 79021 agaagatatc agaaggcttt agattttaga tatatgtcac ccattgccat cgatctgtca 79081 tcataatctt ggatgggctc tataatgcca tccttgttct ggagcccagt tggagttggg 79141 ttcttgtagt gccactgcag cctagaatat ccaggcttcc catgctgtcc ttgcctcctc 79201 tgtctatgcc ttgctttgcc ctgactttgt ttggcctggt ataggcccct gtcatcagca 79261 ttcatttgta gtctccattt tcttaggtct gttttttaca aatttttcca aaatgttttt 79321 ctgccagatt caaattctgc atggcacatc ctcaagtgca gttcaccaaa tgtctgttaa 79381 atgaactaaa taaacaaaaa agtcagtgca ccatttttta taaggcacat tttttattta 79441 aaaaaagttg tgaattcctt gtaaacattg gttcccagga cccctgcaga taccaaaatc 79501 catgcatact caagccccac acttgaacct gcagaatcca agtatacaaa aagttggcct 79561 tacctatctg caggttttgc atcccacgaa tactgtattt ttgatctgca tttggttgca 79621 ggtgtgaaac ctgtggatat ggggggccaa ttctatttat tgaaaaaaaa tccacatata 79681 agtggacata tgcagctcta acctgtgttg ttcaagggtc aactgtatat ctgttttgtg 79741 tgccaaataa ggtatggata agaatttcaa aggaaaagtc atttaaacag aattacaagg 79801 aagcctcatt ttatgagctg acattcctca atcaccaata cgccctgtac ggctcacata 79861 catacatatg caaaatgggg gtggggaaag tccctagtaa aagaacagtg tgtttcaaga 79921 cagggcccat ctccatttat gctgaagagc ttagacacaa ccctgtagtc taaaggaatc 79981 ttccgaagaa gctactcata gggctttatg ggaaggtggc agccaaacat ctatctagta 80041 tattgcaaat tctgttcact atcctccctt tctgtggagg acatagggaa ataggaatca 80101 ttctggccca aagttcctct gcatcaaatg aaacacaaga ggagactcta ccaaagccag 80161 aatgaggcag ttctcccttt gctcacccat ccccagccca gaaggcatat tgttgcatat 80221 tccccagctc agaagctgac aatcatgcat tcaaatgtat tttgtctcat tcaacctcct 80281 tgaaaggacc ctggttcctg atgataaaaa cacaaaaagg gaaccttttg tttagtgctg 80341 ctagaagcaa gagggaggag gtagttccct gattaatatt ctagattaat tgatcatcaa 80401 aaagaagcag tcagaccttg gctttcttta tgctgtcttc ctcatgtgaa actgctaggc 80461 tagaaaacaa caaaaattta aaaatggagt gtgatgagag ttgaggagtt attgttaatt 80521 tttttaggca tgataatggt attgtagtta tgtttttaga aaaagaacgt gtatctgata 80581 tggtttggct ctgtccccac ccaaatctca tcttgtatta taatccccac atgtggaggg 80641 aggggtctgg tgggaggtaa ttggatggat gggtttcccc catgctgttg tcatgatact 80701 gagtgagttt tcatgagaac tggttatctg taaagtgtct ggtgcttccc ccttctcact 80761 ctctcttctg ctcccatgtc agatgtgcct tgcttcccct ttggcttccg ccatgattgt 80821 aaatttcctg aggcctccca agccatacag aactatgaat caatacctct tttctttata 80881 aattactcag tctcaagtat gtctttatgg tgtgtgaaaa cagactaata aagaaaacag 80941 tcttttagag aaacatgctg aattgtttat aaatgacatt atacctggga ttggtttcaa 81001 attaattagg gggcaatggc aatttcccac aagttgttaa ttgttcaagc tgggtgctac 81061 gtgcctgagg gatcattata ccattctatg tatgtttata cacatttgac cttttccata 81121 ataaaaatta aaggaaaaaa tctggccatc tgcatttctt ttcctttttc ttagctgtct 81181 ttgctctcat ttttgtgtct tcatctgcaa attaaaacac attataattt ttaaaaagtt 81241 acctctaaaa cctcaatact tttgagtaaa taaatatgct tttgctgttt ctcatttgtt 81301 gttacagaga aatttattca ttcaaacatt tattactatt tgtcaagaac tgtgctaggc 81361 actggggata tatatattat taaagtcaca ttaggtatta tatgatataa gaaattaatg 81421 ttaatttttt cataagttaa tggtattatg gttaagtagg aaaatatcat tttaaagaga 81481 tacacagaaa tattttgtag tgaaatgtga tgacatttgt aatttacttt gaatacctta 81541 gtgaaaaatc catatttgaa gcaaacatgt caaaatgcta acaaatgtta aacctgagta 81601 ctaaggtata tagtattaat tatactattc cctttacttt tccgaacatt tgaagatttc 81661 catgataaaa agggaaagaa aactatgtgt gtgttggagt ggggaaggag aatgctttct 81721 aggagctcag tctaatggag gagacaaata ggtaaagaga ttgatacaat ccagagtgac 81781 ataaggaagc ttgaggtgct ctggacacac agaggagtct caaaatcaga actgatttga 81841 ggaggggggt tgaggaatgg ttgaccaaga aaggtggctc tgagagtagt tatttgatta 81901 caaaatccat tgttgcaatg agtaagccaa gcttgttgga agaagagact ggtattgtag 81961 gcagagggcc tctctaccaa gcaatagagt caagagaaag aacatggaat actgaggaga 82021 cagaaagcag ttctagccta ggcaacacag catgacctca tctctacaat tttttttttt 82081 tttaatagct gggcatggtt gcgcgtgcct gtggtcacag ctactcggaa ggctgaggca 82141 ggagaatcac tggaaccctg gaggtggagg ttgcagtgag cagagatcga gccactgccc 82201 ttcagcctgg gtgacagagg gaggccctga aaaaaaaaag aaagaaaagg aaaaagaaag 82261 aaagaagaag gcaggcaggc aggcaggcac gaaagaaaag aaaagaaaga gaagagaaga 82321 gaagaaagaa agaaggaaag aaagaaagga agaaagaaaa aaagttttca ggccacagag 82381 tatggtgcaa caaaaagtga cagggtatat gtgattagag aaggatcagt cagggtctga 82441 actttaccct gagggctatg gagaactact gaaggatttt gagtatggca ctgacatgct 82501 tagatgtgtg ctttaagaaa gtttggtttt ccagtggcaa ggaagtaaag tgacattggg 82561 gaccaggaaa ctggggcaat atcagaggaa gcagagcctt tgtggaatat gccaaccaag 82621 agaaagcacc ctctggtcaa tcattagaca ggagagggcg taaaaaaggt ggctttttgt 82681 gatgcactac cccagatggg gtgccttcca taaattggca gccagtctaa agggagtgcc 82741 tttttgaaaa ctgtacagca gtgcccactg ggcacgagaa ttacacaaat gagcccttcc 82801 actgggctga tgcagccctg taaggcttac tttggtacag cccaatcccc tgacttggtc 82861 tggtgccatg tgctttgtac aactatgcac aaccaatctg gagtcagatt gcttaggttt 82921 gaatcccaac tgtatgacct tggggacagt acttataatc tgctctgtct ccatttcctc 82981 atctataaga tgtacccaac agttatttat ggattagatg gtaataatga atataaagca 83041 cttatgggga atgcctggca caaggttagc aagcgttcaa aaactcttca gtattattag 83101 tgttttcagg aaccttcttc taaaataagt tttaagaaat aaaactagtt ttgaagcaag 83161 gcttgcagaa tgctgttggc tttcttacca ctggagtctg tttcagaatc agagtcactg 83221 tcgctatctt cagaatactt cttgtgtttt ctttttgata aatttttctt tcttcctttc 83281 ttggaccttt cacttgaatg actagactac tatttttctg taaaaatata aataaaaaag 83341 taacttctaa aaaagtattt ctgattatca attttattgc atttgaaata ctgaaaagga 83401 atagatgata ggcaatttga ataaaagtaa agtctaatgt ttaactattt ataaactttt 83461 tctaacttct caagaaaggt ctgtatctta aaaatacttt ctgatgtaga aactgaagta 83521 gtgcatttct ttggctcttc attctttact tcaatatgtt cagcagagct gaatttaaag 83581 aaaatggtta taagaattag aaaagtgatg aaatatggac catagagtat aaaaacacag 83641 acttaaaact tttcagcaac aatgacaatt gtccaagaac cattaaaaac aagcattgga 83701 cttctttcat aagaatacag ttcatgtctg taatcccggc actttgggaa gccaaggtgg 83761 gaggatcact tgagccccag agttcaagac cagcctaggc aaaatagcaa gaccccatct 83821 atacaaaaaa tttaaaacat tagccaggta tggtggtgct ttcttgtagt cccagttact 83881 caggaggctg agatgggagg atcacttgag cctgggatgt caaggctgca gtgagccatg 83941 atcatgccat tgcactccag cctgggtgac agagcaagac tgcctaaaaa taaaataaaa 84001 taaaaaataa ataaatacag ttcaagggtc tgccaccttc ataataataa aataacttac 84061 atttgcacaa tgctttacag attacaaagc actactcaaa ctgaaagcag catggaggat 84121 ttatattttc agaagaaatc acaggctaaa aatagagaaa acagaatcag aaggaaagat 84181 gaaagaaact gtatataaaa agtctgaaga agatatcata attttattct aagcatgtaa 84241 aaaacaaatg gtagagcagc aactgttctg tctcctcaga gcaagaagat cctaagaagc 84301 tacaatctaa agagatttgg gggacacatg ttctcaggac ctgaggctat gtcacggtaa 84361 aaaattgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtat ttgttttgtt 84421 gcagagaatt cttaaaataa gtattattaa agttctacag taggcttggc gtggtggctc 84481 atacttataa tctcagcact ttgggaggcc aaagcacaag aacagcttga gcccaggaat 84541 tcaagaccag cctgggcaac atagtgagac ccttctgcac aaaaaattta aaaattagct 84601 gggtgtagtg gtgtgcacct gtattcccag ctactcgaga ggctggggca ggaggatcac 84661 ttgagctggg aaggtcaagg ctgcagtgag ccatgatcgt gccactgcac tctagcctgg 84721 gtgacagagt gagaccccgc atcagaaaga aagagaaaga aagcaaataa atcaataaat 84781 aaagttctac aatactttag aaagggactt tctatattat ataatctttc cctttgaaga 84841 tcacatctgc aataatttag gtattgccta gaaaattatt tttagggttc cttctgactc 84901 taattctatg agttcaggtc ctttaagcat ttaccaaggg tatctataag aatacacttc 84961 agataataat gttttatcac cttgtcacaa attacaggtt tacctgtgaa acacattgcc 85021 tcaactcaac ctaaacacag aaatagagga gagggaagaa gaaacttatg agcaacacca 85081 aagagaaaag aaataagcaa aaaaaaaaaa gtactaatat ttcacatagt attggataca 85141 ggaaaattaa ctaaaccgaa ctacactggc atctgcattt tttaaaaatc gctgtgtaat 85201 actattgtct ggaatgcaat ctgctcctgg gttgaagaaa aaaatcaaaa tataaatgta 85261 ttatctgttt taacactgaa ctcctctatt aataggaaat tagcacccca tgaatacaat 85321 ttaaagaaaa caattagcat gctaaaatgc actagcaaca tgttttaatt cttaggtatg 85381 ttagactccc tttagttcct ttttaaaaaa gagtacctta cctatctgtt ctttggagga 85441 cactaatttt ttattgtatt ataatgtttg tcatttattg ttttaatttt tgctgttact 85501 cataagctga aaaaaataaa ctaagagagg ttcaatttcc tcaaagttta gctgcttacg 85561 tctataaaaa tccttataaa ataaatgttt gatttcattg tttgaataaa tattttttac 85621 ttctttattc agggaaaagg tattataagt atacagagtt agtctcagag ggtttcccat 85681 ttcaagagca ggtaatcaaa aagtaaacat acgtctaaca gttcttggaa tatgagtata 85741 gggaccaagg atgcctgcca gactataaag acacacagtc tttgtggtgt gaggcacaaa 85801 attaaggccc aatattgtgt actaccttga caattgggga aaccaggagg gctcccaatg 85861 gccttactgc gaattctcct ctccattctg ctcctgcaaa taaggtcccc tagccaaatg 85921 accctccttc tcaaagagac cagacgcagt tccagcttat ctctgggtag cagattttag 85981 tacattgtca gcttgtggaa ttattcaaac aagccaatca tatccttcca agggaaagca 86041 ggtattacct caatttcttg ataccacaaa gtctacctcc cacagcccct gattattcac 86101 tgtcttacca acagcaacct cggtgtggcc cttcgtggca cgctgtgttc tctcccattc 86161 cagctgtgaa catatgtgat taataaactg ctgttgatct catctgtcct gtgttcagct 86221 actccataac acaaagggtg ggaatccttc cctcatcaac atggtgaata gcagacaatt 86281 ataacactcc ctgctgctac ctggttttat ctgtgcctta agataaatta tttatcttct 86341 atgtgcacaa atgcaaattt tacccaagtt gttcttatat gttttggtta ggattctaaa 86401 ataagaagtt ttttgattga cacaaactgg gaggaccttt ccatcccaaa ggcaggagcg 86461 ctcctcagag gacaggtgca gtagtttcac tcacctctcc cacatgggca aaaaatgtgt 86521 atctgtaacc taactcagga ctgtttttac ttcctctgct cccaggcatg aaactctaat 86581 aaccaacacc tgtgaaatgc ttattacgtt caaggcacta catcattaca ataaccctaa 86641 aaacgtcagt actatttgta tccccctttt acaggtgaaa aaataatgaa gcttggagag 86701 gttaccaagg tcacataggt attaagtagc aaagcaaggc ttcaaactca gaaagtctca 86761 atccagaggc tgtccttctc atcactatgc tttattacat cccactagtg gaagaaaaag 86821 actcatcaag acatctccca cctgttggaa tttcaaccta cccctcaata cctgtaatgt 86881 tttaaaattt ctcatttttt aaccgtagtg catgtaaggc actcagtata gtacctgaca 86941 cattcaataa acatttgata tgagtactta cacagcaata attataatta ttttccttct 87001 caaaccatct acttcccact ccaatattca ctctcattag aggaaaactc atctcactct 87061 cctctcattc ttcaaggtca agattgagac cacttaagac cttcctgaag gtagccacat 87121 ctgcagttta gcacatcact ttgcctacat tcatccagtt atcttgtctc agaggaaggt 87181 atttctcctt tccaatttaa acccttaatc tcaacccttt ccaactccct acaaggaatt 87241 tccacttagt tagcctgtta ggatctcaaa ccgatcattc actcaacaca caacactttc 87301 agtagattga gcccctatta agtgccaggc actgcaaatt ctgtaatgaa taaaataggg 87361 gtcctccttt ccacgagctc agtctagtgg agaataaaaa tatgttcacg aacgataaca 87421 caaaacaaat aaatgctata cgtaggtctg tgacccttag ccgtgtctgc tccgagatca 87481 gaggcgaaag agggtggagg gagaccgggg caccctacaa aggggcccaa ggatgagtga 87541 gaatgtgaga gaggagaggg gcgaacaagg agctgtcgtg gatagattgc agaaaaggat 87601 gaaaatgggc gcggtgaagg tagaaggagc gaaatgcaaa tggtcagaag cgggctgaaa 87661 gatgaagcgg actcgctgaa ggcaagtggg ctccttagag cgaaggtgca gccatgatct 87721 cactcacctc tgccacggat tcacctccct ctcctcgtcc cgtaggagcg tgagtagttg 87781 ccgcaatagg cgcaagagca ggcagaagcg aaagtgatgg tccacggcgc agtcagccgc 87841 tcttgagaat gcgaccacaa ctgggagcgg cagggctggt ttcgggagcc ttggctgagc 87901 ccattccatt ccccagtcag agaacaagag ttccagtgag agttgcgccc cagcggggaa 87961 cgggcggctt tgctgggctt ggggctcttg gacaaactgc aacacattcc cccgcccgga 88021 gcccgaggcc tccctctcgg cactgcagaa ccggacaccg gagctactgc agcctcgaaa 88081 acgggacatt ctcaggagcc tcagcagtat ccgttgcctt tgctccaaaa acccagcccg 88141 cgcaaccgct gggagcaaag gaaaccatgg gaactattgc gagtacgttg gcaaaacatt 88201 tccccaggtc tgtgtggcct ccggccaagg tgaagaggtg tgtgtcttgg tgtaaataaa 88261 ccgctactgc aaccgatttc gctcatcctt tggttccgca ttgaggctcg ccagggtcta 88321 aatccgcaca gagactaggt cttccttagg cttctacttg aaataactga agggcaggac 88381 gagagtccac tggagacatt tcacagaatg gccatatcaa caacatagac aattctgttc 88441 gtgtacgggt gctttccgtg aatgtgtgta ttacatgttc tataatattg ggcgcactct 88501 gtgtcgtgtg tgtgtgtata aaatatgtac atttggttat gtgtgtatat atgtacatat 88561 ataataaata ggactggctc aaacagattc acatattttt tcggagtcta tttgaattaa 88621 aagaaaaatc tgagatggga agaaaaggga caaagtattc aaagtatttc gtacacaact 88681 gaaagtgttt atttcagtgt tgtcagattt attaatagca aaaagaaata tacagtaggc 88741 cacttatctt tgaatttcaa ataaagaata atgttttagt ataagtatgt tccatacaat 88801 aatacagtag aaagtgacta ctttctatta aagatgaggt gatagccacg gttgtacttc 88861 ttcctctcct actacttcta aaatttgttt cttatattct ttttttgttt gcttgttttg 88921 ttttgttttg agatggagtc tcactctgtc acccaggctg gaatgcaatg gcgcgatctc 88981 agctcactgc aacctccacc tcctgggttc aagtgattct cctgcctcag cctcccaagt 89041 agctgggatt ataggtgccc gccaccacac tcggctaatt tgtgtatttt tagtagagac 89101 ggggtttggt cacgttggcc aggctggtct cgaattcctg acctcaagtg atcctcccac 89161 ctcggcctcc caaagtgctg ggattacagg catgagccac cgtgcccggc cttgttcctt 89221 atattctaag gcttttcgct tgctgtttga gaaagggaag tgggagcagc ccctgatatc 89281 cacaaactga cctcgcactc acaattaggc atgttcttct cttgttgaac ataaactcac 89341 agaacaccaa cccaagacaa gatcactggg agcctgatca agtgagacaa aacaagacca 89401 cttcataagc ttgtctaagc acagacaaaa acaagctcac tgtgacaccc agaaaacacc 89461 aaacacccct tcttggccaa gataggtcac tgctactttg ccagttaatg tacagcttta 89521 tcctccctct agtctgccct ccctataaat aagatttatt gaaatgtgca atcatacaat 89581 tgcccctgct ttctgatgta tctgagacgg aactcctgat tcctagacct tcccccaaat 89641 tgtccaaatc ctataatagg ttcattctaa caacttctta ttgaaacaca tcatagttac 89701 ctatagtgtg tgttgtccct ccctggagta agagtaataa actcaatctg tcaactatgg 89761 gtatgtttct ggtagtcttt ggttgaaggg caatgacaca ttccactgcc taaaatgttc 89821 ttccccagat atttatatgg ccctttccct tagttcattc cattctttgg ccatacatca 89881 tcttttcaga gaggctgtgc ttgaccacac aatttacaat aacaccctca tcattctcta 89941 tccctttgct ctatttttct tcatagcact tactactttc tgtttttgta ttatatattt 90001 atattatata tatacaaatg ttgattgtat gtctcttcca ttataatgta agttccatga 90061 agaccaatat ctcaacttga tgataattat atccacagtg cctacaacac tgcctacaac 90121 actgcctggc acagagtaaa tactaaatga tatatgttta atatgtattg aaaaaatgaa 90181 tgaagtttac agtgtcgggc tttataagtt tcattgtcac atagtcataa tcaatctgtt 90241 gtttggactt agtcttccac ttaaatagat tggcttactg caaaatctcc attcttaatt 90301 ctttatttgg actcatttct tatttgattt gacttcatat attgccaagt aattttcaca 90361 aatgactcat aggtagtata ttccctcaat tttgttcaag tttggaataa gctgtctgtt 90421 gctttttttg gtagattgta ttattgttag caattatcga cgactccttt tccagtagga 90481 gagttatata tccctatcca attgactttg gaaagtttta agaaaaaagg gcaataggca 90541 gtattagaca ggtcacttaa agagagattt gttgggatta agcagaaaag gggatagtgg 90601 tgagaaagca acatatattt ggcacttact gggttccaga cactcattcg ttcagcatta 90661 agcaaccatc tctatctaca atttgttggg ccaagggatg cttcacaaat aagttagtac 90721 ttgagttggg tttttatgga aaaccaggca cttgccagga actaaccaga caacaggtta 90781 agcgaggtgt ggctatgaag ggaatgacta ttccaagctc agggctccag gctcagactt 90841 gaaggcagta aattgcatgg catattctgg aaaccgtgag cagtttgaca ctatgcatag 90901 ttgattcatt taatcctcaa agtgactctg tgggaaggta ttattatccc tattttacag 90961 atgaagaatc aggcataaga tcaaagagct aggaagtggc agtatcagaa ttccaaccca 91021 ctcacatctt ctgcttaaga ctttgtttct tccactctac cttactaggg tttctgcaga 91081 tcccatggag tgaccttacc aaatctgtaa ccataagcat aacatatagg aagtgctcta 91141 agactgaacg agtgtcactt tatcttttgt catcaattaa tatatattaa gcacccacaa 91201 gattgttggc attttaacac aggagaggac agttaccgcc ctcatggaat tcacaatcaa 91261 ggtcaaaagg caagcaaacg ctgacaatac ggctataaaa cctaactcat cccattttca 91321 gccaataaaa cctttctaga atgtgaaaca attgatacct ccaaatccat ttgatatagg 91381 ttgttctctg acctcaggtg atcctcccac ctcggcctcc caaagtgagt tgttgttgtt 91441 tgaaatagag ttcaaactac tgggttgttt gaagtagggt taagccctcc agattaaaaa 91501 caaaagtaaa agtgaaatga ctagttgaga aacagaaaat taattaaact ttccagtgat 91561 cctgtcccaa agacaattat taaggccact caggcttctt ggtctgttct gctttcctac 91621 aaataaaatt cctttcctga ccattggacc ccctttcatt ataggtagct tttaaacttg 91681 tttagttttt gcttggttag ggggtgagag atacagcaag aagggaagag aaggtttact 91741 gtgctttgaa tgaacaggtt tcatcatatg ttagttggta ctccaaactt ttaaaatact 91801 ttcaggaaag tgacaatggc atggcaattc tgttgccagc ctcagtgatg taaatgctgg 91861 tacaaaataa attgcattgt gccttccagt ttactgagag ctgaccaaca aacatgatat 91921 ctagagtggc atctattgcc actgtgatct gtatttcaaa gatatgtctc attttagttt 91981 tcatcaatta agccactgaa agataaatag atcctaaact ttcaagatat actttgttct 92041 aattactaag attttttttt ttactactaa gttttgggga ggataactca gaaaaatggt 92101 tattaagtac tcagttgcca ggaaaaatac taatctgcct tagtttattt tacaacagca 92161 tacttctatt tgccatgaaa tatctttata gtaaacaatg aaacttttca taatgctaag 92221 tcctaatcaa aaatcaataa tgcaataatt cccacatagt ttgttttccc acaatacaaa 92281 atcttggaga agagcttaag gaattgcata tttccatgtg aaatacagct atattatacc 92341 actttgatgt tttattatac catgtggctt tactttattg aggaggatga agaatagatt 92401 agggataggt agatgagatg atgacatcca tgtctgattt cctgtcagag aaaattagat 92461 atagttacat ctgttttcag tctaatatcc tgatggacat tttcagcaag attttttagg 92521 aaagaagcac aggaattcca ctgcaaggta ttaataggta caaatttaag attttggaaa 92581 tgaatgcagt attatcctat atatatcttt ttaaaaagat aatgaaggta gtgaatggcc 92641 ttaattatgg cagaaaaaaa gttactattt ggttccttag ttgaatacga tcaataacca 92701 gcctaaaagg gaaacatttt gctctccaaa cacccatcct gttgcttcat ttattatggt 92761 actcagttgg aagggtatta ccagaggatt ttaaaaaatt atgtaactca caatgtattt 92821 tttcggtcat taaacaatca ctgaacacat actatgtgtc agacactaaa gccctctgaa 92881 gatgaaatgt tatggaaata gtaataataa aaaattaaaa tagtaacaat aaaaaattaa 92941 aaatgcaagt gttaatatta ttaatctttc aagatacagg tcaaatgaca gcattcaatt 93001 tgctattctc ttctctctgc ttctacacct tgtacaaata tttttgtcca tgtttgtgta 93061 cctacctact gagctcccta aagaggtaga gacctttggg ttattcattt ttgtattccc 93121 gagagcctaa catagggttt ttgctacaga tcaagcaggg ggtcagtaaa tgctagttgg 93181 aatgagggaa ttaacatagt actgcaccat ttagtattca attaagaaga tagttgtcca 93241 aacactaaat ttctttctta aaagtagtta ctgtaatgta gttgctctct tgttatccca 93301 gaacttgaat taccataaga agttgtgaat ttctatttga ttgtatgtat tttgattata 93361 tttcaagatg gtttatcatc tagcaaacag tctaatagag cagattcaaa cttgcaaaat 93421 aaatgaaacc tgtatagtct ctaaaagttt gtatgcccta gaagttcaca ttgaatttcg 93481 atgaaaacga ctcgcttccc ttccatccaa ttgcttgcga ctgtcttacc cgatctttcc 93541 ctttcctttt ttcccacccg ttagccctcc ggaaagagcc gaacacacaa gagcttccca 93601 gtcttcctcc gcccccttgc ggaaagaacc gaaggcagag cacggcgccg aagtggagcc 93661 gcctcaagct cgggcccttc cggaccaccc cggcctgcgc tcggaagagg agggcgcctt 93721 tggcttcagc gcttcgcccc cgcgctgtgc cctctgtcgg cggcgtgggg cagctgtagc 93781 agcgttggcg gcaggaggcg gcggccgcgt cgacgtcgac ccagactgga gcgacgttta 93841 aagaaggggc agaatcgctg gggagtgcgg cttcttcttg ttgggggact cccagccttc 93901 cgcgcgtccg gaggaggaga agcggcggcg ccgggaagca ggtgaggacc cggccccaat 93961 cagggagggg gcgaaggagc gcgcttgcct tctccgtccc tgggccgcgc ctgcgtttgc 94021 attcgcctca cttgaaccag gaagcggcag agtcggaggc tcagctcctc cggctctttc 94081 tttgtgtggg ccggggagcg aaggagggga cgagccgcga gggccgcggc gcggtcccct 94141 tcgccgttag gggccgaggg gctgccggtg ccctgggtag gccccggggc ttggggaatc 94201 catcacagag acccacttgg cttttccctc gcccctctcg ctgctttttt gtgcctcttt 94261 tactccccta gcccctactt tgatttagag cttttctgcc aggcccatcc tccccttccg 94321 tccccttccc ggggcacaac aatgccgcct gcttcgcttc atcccccccc caacacccca 94381 ccctgccgtt cgccaccttc tactttccct ggtaccccat attcctcccg ccctcactct 94441 gctgtttgta cctccgtctg tatttgcaag aagcttgctt tgcacgtgaa ttggggttaa 94501 aaacctcggt tgcagctggc agggtccata tggggaagag ggaggtggag gaagggagat 94561 gaggttcagc tgccgcaggg agccgcgtgt cggtttgttt cactccctcg cggatggctt 94621 ttatctcttt ccaccgtcac cagctcctcc gaactggccc tcggtcaagg gtttcacagt 94681 gatgtggaat ggcttttcca cagtcgaatc gcatatcttc ggagtgggtt cagaccgact 94741 tgtgctgtct ctggggccac cctgagtggg aggaggggga cacataaacg aaacgaaacc 94801 gagtcccagt gtcacctgga cacgtacatt tgacgcattc ccatattgga agaacccggg 94861 agcaaatgac aaaaattgcg tgccgttttc agaaaacctg tctttccact taatatcgac 94921 ttctgtcgaa atgcccgtat ttcggaagcc ctggtttttg gcaaactgtc caaaggtcac 94981 cctttctgat gatgaatgcc tttatggtaa cctggaattc cttggtactg ttaacatgca 95041 acaccttcta ataatgtttt aaatcttgat ttttagactt tttgcctgct gtttgttctt 95101 aaggtatcat tttgaaaaat ttagaaggta tttgagacag aactgccctc cacctgtata 95161 taacttaagc attgtgacat cgacaccctt ttggtaactt caaagggaga gaatattaat 95221 aaaaatattt cttcactaaa agtacttgtc aactaattgt taatatataa tatccatata 95281 ttaaaatata tccatatata atatgtatgg atattatata tttttaatat atgggttttt 95341 ttaaactctg gcctctttgt tactggaaat atcttctgca aagtgcaaaa aatgatgttt 95401 tgctagtgct acacaacaat gcattctact gacatcatcc cctttttcta gagtagtcta 95461 ttccaataag ttaatgtttt cattttcact aactcatttg gtgtggaaaa aattctcata 95521 ccaatgcata tgattcttta ccaacaaaaa taaaatgttt cgtgttttct tggaagtggg 95581 catttagaaa tggtaaactt tccttttcct ttgtcatatt ttttcatgat agtgtctgtt 95641 gacttcactt ggcactgttt aacattccac ctctggaaat tgtaaagatg gagagatgag 95701 gggaagggac aaggggatga agaaagagga aagggtaaga aagatgttta atagtttatc 95761 aagatgaggc accccaattt aaataacaga aaaataatgg gacgttggga ggtaataatt 95821 gtcagtgttt ataaatgtca ttggctccag ggtgtagatt ctgcatatat tcttaaagat 95881 tagttaaaat taagagtatg tatttcagac acagtattgc acgcgctgag cttttctgac 95941 atttcaatca tttacaggcg aaataacttc tgtgtgttca tgatgatttt ccttaatttt 96001 gacagttaaa agtcaactct tacctcttct tggagagaaa acccaaaatt gggactaaga 96061 aaattaattc taatctttaa aaatcactta gtggaggtgc atgtatctaa tattcaaaaa 96121 aaggtgacta tggagattac tgttcaaatc tctgtaacaa tgtaaaactt gctcaactca 96181 ggagcaagcc atgaaattgg acacttgttc caaaagccaa cctgtatgaa caatttctgt 96241 aaaagccaaa aaattatgct gaactttggt taaaacttga ataaactatt taatgatgct 96301 actgcttaaa ttctaaataa gtacttttgt tttttctctc taatcctctc ccatcccctc 96361 ctctctttct cttaaaggca tggagagtag aaaactgatt tctgctacag acattcagta 96421 ctctggcagt ctgctgaact ccttgaatga gcaacgtggc catggactct tctgtgatgt 96481 taccgttatt gtggaagacc gaaaattccg ggctcacaag aatattcttt cagcttctag 96541 tacctacttc catcagctct tctctgttgc tgggcaagtt gttgaactga gctttataag 96601 agcagagatc tttgcagaaa ttctcaatta tatctatagt tctaaaattg ttcgtgttag 96661 atcagatttg cttgatgagt taattaaatc agggcagtta ttaggagtga aatttatagc 96721 agagcttggt gtcccattgt cacaggttaa aagcatctca ggtacagcgc aggatggtaa 96781 tactgagcct ttacctcctg attctggtga caagaacctt gtaatacaga aatcaaaaga 96841 tgaagcccaa gataatgggg ctactataat gcctattata acagagtctt tttcattatc 96901 tgccgaagat tatgaaatga aaaagatcat tgttaccgat tctgatgatg atgatgatga 96961 tgtcattttt tgctccgaga ttctgcccac aaaggagact ttgccgagta ataacacagt 97021 ggcacaggtc caatctaacc caggccctgt tgctatttca gatgttgcac ctagtgctag 97081 caataactcg ccccctttaa caaatatcac acctactcag aaacttccta ctcctgtgaa 97141 tcaggcaact ttgagccaaa cacaaggaag tgaaaaattg ttggtatctt cagctccaac 97201 acatctgact cccaatatta ttttgttaaa tcagacacca ctttctacac caccaaatgt 97261 cagttcttca cttccaaatc atatgccctc ttcaatcaat ttacttgtgc agaatcagca 97321 gacaccaaac agtgctattt taacaggaaa caaggccaat gaagaggagg aggaggaaat 97381 aatagatgat gatgatgaca ctattagctc cagtcctgac tcggccgtca gtaatacatc 97441 tttggtccca caggctgata cctcccaaaa taccagtttt gatggatcat taatacagaa 97501 gatgcagatt cctacacttc ttcaagaacc actttccaat tccttaaaaa tttcagatat 97561 aattactaga aatactaatg atccaggcgt aggatcaaaa catctaatgg agggtcagaa 97621 gatcattact ttagatacag ctactgaaat tgaaggctta tcgactggtt gcaaggttta 97681 tgcaaatatc ggtgaagata cttatgatat agtgatccct gtcaaagatg accctgatga 97741 aggggaggcc agacttgaga atgaaatacc aaaaacgtct ggcagcgaga tggcaaacaa 97801 acgtatgaaa gtaaaacatg atgatcacta tgagttaata gtagatggaa gggtctatta 97861 tatctgtatt gtatgcaaaa ggtcatatgt ctgtctgaca agcttgcgga gacattttaa 97921 cattcattct tgggagaaga agtatccgtg ccgttactgt gagaaggtat ttcctcttgc 97981 agaatatcgc acaaagcatg aaattcatca cacaggggag cgaaggtatc agtgtttggc 98041 ctgtggcaaa tctttcatca actatcagtt tatgtcttca catataaagt cagttcatag 98101 tcaagatcct tctggggact caaagcttta tcgtttacat ccatgcaggt ctttacaaat 98161 cagacaatat gcatatcttt ccgatagatc aagcactatt cctgcaatga aggatgatgg 98221 tattgggtat aaggttgaca ctggaaaaga acctccagta gggaccacta catctactca 98281 gaacaagcca atgacctggg aagatatttt tattcagcag gaaaatgatt caatttttaa 98341 acaaaatgta acagatggca gtactgagtt tgaatttata ataccagagt cttactaaac 98401 tcctttgaaa tactagaaag ttttgttttg gatgatgggg caggggtttc agaagatctg 98461 taaaacaaat taaggtgcga acaagttaat ttgatctgcc acattatctg aaggaagtgt 98521 agtgggattt ttgttgataa tttttagaag caaattttcc tgaaagtttt gagtagaggt 98581 gagaccccct ccccaagtat ctgtttatat agttagtttt cagctcattt aaaagaggca 98641 aaaattaaaa gcttggagag atagtttcct gaatagaatt tgaagcagtc tgaatgttct 98701 ttgaaaataa ctggagttat tagcataccc tagtacatct tacagctttc cccttccatg 98761 ttagcacttt actgctgaat tctcaatttt cttaacattg agacaataaa tgtgtgtttt 98821 gtcttgtata tggcataaag agtaaataag ttttagagtt gttctggaaa atgtcagaat 98881 aagtcagtac ttgggttgtg taatctgcta gtccaagcga acagcaacct cctgctaccc 98941 tccctctatg aaaatagcca tgcagacaag tctctcatct gaagaacaaa ttagatttag 99001 ctaattagaa ttaatcctgg ctttcattgc catagtctgt aaaagacttt ggtggctaga 99061 ccactttata ccttcgcagt gtggtctctg ggggcaaaaa actaatgaaa acaatctctg 99121 taatggcaga taggaggaga tgaaaagttc tgttgcatgg atttttaatt ctctggctac 99181 cacatagtag agaatggaat gaagatttcc ttttggcttc ttaaggttaa aaatattccc 99241 atgaacatga aaattttcaa attttgaatc tgaaagccac caaatgtatc tttatgtata 99301 aatccttgta aatgatagat tccatgggtg agactttaca tattttgggt gggaggctac 99361 tggcatatat ttttaaatgt tcatattgcg tagaatctcc actaggaagt ctttatttga 99421 aatagttgaa tcagtgatct agtattttcc tttcggcaag atttgttagg tttttacccc 99481 ttctaaaata agttttattc catctgcaaa ttgctgcaat attatagtaa tcagaaacta 99541 cataaggaat gttatatagg cttgtcagtt cccatttttc ttgacaacaa taaataccac 99601 ttttaaaaat gacacatatt taaacactta gaaaataaag ttaacactta ctgaagtgct 99661 agtactaaac tgtgctagta ctaaaagaaa acaggttgga acatacatat agcctagcat 99721 ttataacaga attgttgaac gtctgtaaat gatttttttt ttttttgcaa aggaaaaaat 99781 tgatactgga aaagattgtt gtgcatagtt attagtcatt tgtaaccttg cttaagtatt 99841 tcttagtcca acatagatat tttctttctc ctgaccatgt attttaaaat atagtctatt 99901 tcttgacttt gaacttaaag ctttaatcat aatttctcat gtatacatcg ttcttctgat 99961 ggtaagctgg atttgaaggt agtggtttca gtgtttctta agttggtagc tgagggtatc 100021 aggcatcagt tcatgcaata atacaagaaa aaaaatcctt tgcttgccaa gaggtagagt 100081 gatgtgcatt tatctgtttt ctgttctgta agtctagacc ttcaaaccat ttgtaaacta 100141 acccctggga aatttgaaat tacctgataa cttaagactc tgtgatctct ggaatcacca 100201 tatgtttctt ttttgtgtag atattaataa cattactctt tgactatagt gtgcactctg 100261 aaatgtactc agtgaaaatt tgttttgagt ttcattaatg ctatttcacc agttagacat 100321 aattacttct accgatgtga atgatacgga tgccggcaga gcttccagat ctttcagact 100381 caactgctag gtcaattagt ttgtcataat aaaacttggc agattctaca agtctattat 100441 gacaaaccag gaactaattc tataatggaa aactatccat tctgaataat aggtatgtaa 100501 ttatttgctg ctgctgctgt gctctgtaaa ttctgaatat gacatttaaa ctctgtgcct 100561 actaaaggta tcttctggag tttttgggag gagagaaact ggaaaattaa attgtatttt 100621 tgccagaaga ctcttacttg catgtgtctc agggtcttca gtttttctat aagtttccat 100681 atccaaagtt cagaattcat gtgaaatact tctttggggc aaaagtcctt cattcctggt 100741 atttattgga ttggaaatct gtagcaagat gctgtttaaa attaccatat tgttttttta 100801 tcttatactt agctctctgg ctattgaact tccttttctt gtttgaagtt agcttcaaat 100861 ttgctcctat gctaaattac ctgtaaatat tctggatagg aactacttga aatagtaatt 100921 tgttaaaaga tatgacaaaa tgaaaatgct taaactacag aaatttaaaa atgccataac 100981 aatcttgcga gactaacttt aaaatatact ttaaatgatt attatgattt tggtggtaac 101041 gatcccccac acacaaccac tatgaagaaa taatgccgca tttttccccc attgtaccaa 101101 aaagataaaa aaatggtaaa cactgatcaa ggtattttgt attgtcaagg catgcatatt 101161 ctaaagaatt aaatgctaac ttaacagcac tggctttctg gctggtcaac tatatgaaac 101221 cttgttcatt cctccgagta ctgtaatgtt cacacttgta caatcttccc tgtcatgact 101281 ttaagttcta cttttcatta accatggcct gatattagtt cttagagctt cttgtggcaa 101341 aaataaaatg atttaattct gatgtttgag tgcgtgtttt acaagattgt ctttcagaaa 101401 ttatatgggt ttttatattg tttttcaatt tttatagcag gagactgggg ctgtatttct 101461 gatgacagca cggcaaaatt tcccactagg ttttatatgt tggttaaaaa tgtccccttt 101521 tatcttgaac cctaatagtc aaaagtgagt cagctgctgt cagttgcttt agagcgtttg 101581 ctgtactttt atagctacct tgacacaagt ggataacgag gggaggatag ttttattcat 101641 ttgaaacatc acaaaagcag tctgagtttt caacatggca gtgatacaga ttttaagcaa 101701 catccagttt atacagtttc tggattaata ttttaatgtt tcatgcccag ttcagtactt 101761 taaagcagaa ttaggaataa agcagtaaat attacaaaat gcagtcaaga tacatattgg 101821 aaatacaaat ccattcatta cagcaaatgt tttgcaaaag agtaaagcag caaatattaa 101881 tttttctaag gtaacatgca ctttgtataa ttcaatgtaa taaaaaagct gctcaaatta 101941 agtgttacaa aatctatact gtttcaggtt attttgtaaa gtaagaaaaa tacatagaaa 102001 tgagccatag aatcttaaca ctttaagaaa tgtgataaat gtaggataaa ctgtgtaatg 102061 gtgccttaaa aaattaaata tggacattca ctaaagccaa cggtcaaatg taaatggcaa 102121 aaaaattcat tgagtattac aagttgatat ttgtttagtt agtcagttgg gtattagtct 102181 ctttttgaaa tccagtagaa ttttaatatg ctcagaactg taaaaaatca tagtaatttt 102241 gtataacata aaaggattat agtttttcgg attcaaaatc tggataagaa cattcatgat 102301 gctgaaagtt aaactgaata tcttgcaaac atttactgtt aatgagagag tgcacttctt 102361 gtgccttgat tcttgatggc taagtgtcta caaggtagat gggagcacca gcatttgcgc 102421 tacctctaat ccatcaagtt gggaggtagc aaaattggcc ggaaaatttt ccagtgtgtg 102481 ttctgtgttt ggagaactgt ctaattaggt accctcctgt agcccctgta tttttacgtt 102541 ataaagtata ggattatcaa tcttccttta aatcaattct caggttaaca taagatcaca 102601 atgaaggtgt tcattctgaa gtgaaggcat gggtgcaaaa gctttaaact catcaggttg 102661 atggtttaaa gctatgtgta attaaaagaa ctgtacaaat acctaccacg gtgtgccaga 102721 tggattttaa atctgccgta acacttaaat atttgtcaag ttggctacgt ttgaggtttt 102781 gcagacttga agcggcattg taacactttc ataatcttga atgctgtcat gactggtctc 102841 aacagacaaa aggaccgata agaccagtgg cattaaaata cagatgactc ttttggggtg 102901 gggtgcagga tacgtgcgga ttgctggggt taagcacaat atttgaagat taaatagtca 102961 caaagattag taacaattca tatcacgacc caagaaccta atttataaca tttttaaact 103021 ctgaattaca accataagat ttatatctcc tgtagcaaat gtattttgta ataatgcaac 103081 atgtagtaga accactgtct cctaagtgat ctacttaaaa catctcacat gttgctgtgt 103141 atttcagtgt ttccggaaca atacatcctg ttccccacta ctgaagatgc aagaatattg 103201 cacttttccc tttaggaggt accaataaca aaagctgagc tgagtgatca caacagccat 103261 ttttacaata ctcacagaga aggaaggagt aagagatacg ggcagtcttt ttcaacatcc 103321 agacacaagc aaacaaaatc ttagccagaa ctctctgtat tggaactact gtgtagcccc 103381 atcagctggg aaattgtcct tagggactga ctttaggggt agggaaagat agggcctata 103441 aaccaggtgc ttcaacctag gtgctggggt ggcaatctgc caagggcccg agggcaacac 103501 tcctttacat aaagtgaaaa agatggcaac cacacttctg ctgaatacag aggtggagca 103561 tagggtaatg caagtccaaa caagcttctt gctgggctgg ctccccagtc ttgcttagag 103621 aatgtgagct gcccttctgt gctgcgttca gcctgaccac ccctgctctg cactaataat 103681 gaataagctt cgtccctatc tttccctagc ctggcaggcc attgtaaaac tttatgcata 103741 agagtcacac tttaaatttt gtaccctaca cacttccact gaagcagaaa tacaaaagcc 103801 acaataatct caatagccag caggcattcc tctttaggga ctgtaaggtg gtggcttttc 103861 aaaaggtgga tagtagggtg gagagtaacg gggcggtgca cttgaggacc acatgtagct 103921 gggtgaggga gaggcagact ggggctcatc agaaagtcca gagggagggg aggatggaaa 103981 gacaccggaa tgctgtgtga taaaaagaaa acgttagttt aggcagtgag tttgcaattc 104041 cctgaacagt cattactatt taaaaatgaa tatgggccaa agcttttttt ctatttggaa 104101 gaacagagag ctcccaaacc tcacagagtt aatgagcctg gaacaggtgg ttctctcagt 104161 cactcaggct acaagccacc tgcagaaata gggaatcctc ctctgcaacc ccttgcagcc 104221 cacatccaat caattgccaa atcctgttgc ttctacttct gtgttgtctc ttgaatccat 104281 cactaccttt tttacattta taactcaagc ctccaatacc tctcccttaa gctatggcaa 104341 caatgtccca actagtcttc aaaaggtaat ttttggcttt gagaacattt aatgatatta 104401 agccagtgat ccctgggaaa gtctttgcta aaagagtata tcacagagag gtagtacagt 104461 ggtcaggagc ccctctcata gacaagaaac taaggtcaga gaggtgaaat agcttgctcg 104521 gatctggagt cttctgactc caagtccaac attttctact ccaccttaac tgctgccccc 104581 tcatccccaa gtgcagagta ggttggagag caaagagtaa caaaggcatg gaggtgagaa 104641 tgaggagcaa tatgtcaggt tgcagagaca ctgagtattc agttttactt tgtgttgtct 104701 tactttggga aaggaatgga aatgaatttt ttaaaatatg cagtgaattg gcaggagtca 104761 tttgcccctt tctctccctg gcccacgtat atccttgact ctagtgattt ccctgctgaa 104821 ggagaggtgg cctgttgtgt ttgttgctgt atcctccaga gctcagaaca gtacttgcca 104881 agtaataggt gctcaataaa tcttgaatga gtgaatacaa cttcaggagc atagtagatg 104941 aacttggcat gcttttaaaa aataaaatct aggccgggcg cggtggctaa cgcctgtaat 105001 cccagcactg taggaggctg aggtgggtgg agcacctgag gtcaggagtt tgagaccagc 105061 ctggccaaca tggcgaaacc ccatctctac taaaaataca aaaattagct gggagtggtg 105121 gtgggtgccc gtaatcccgg ctactcagga ggctgagtca cgagaatcgc ttgaacctgg 105181 gaggcagagg ttccagtgag ctcagatcac gccactgcac tccagcttgg gtgacagagt 105241 gagactccgt ctcgaaaata aataaataaa taaataaaac ctatgtccag atttctaagc 105301 cttgcttgag acacagaaaa atcttcctta ccacataatg ttaagactga gaagagatgc 105361 tagctgccat catcccctgc ctcccaaatc ccatccatta tttcactcca aagaaatggc 105421 agggtcgtag aacctggaga gcccttaaag gcaaaacaac ctgaggcagt ggactcaaga 105481 caatctagat tccttcttaa aaactcaatt ttcctcctcc ctccaagaag ataccctttg 105541 ttaaaaacat cggtatagtg atttcctgat tttccatctt gcacccttca aactctccct 105601 tcctcctcct ctccctccct cctgttctgt tgctttcctc aactaagaac tctctcctct 105661 taaccagaga ggggaacaaa gacccaccag ggcaaagaat ctctgtgcct gggcatatgg 105721 cactcactgc ctctctaaat cccactacag agatgacatg agaggcagca ggccagtttt 105781 aggtgttccc agccacagaa gggagaggca agtttcctga agtgaagcta aaggcagttc 105841 tgtcctgaga ggcattaaaa ggcttctgtg ggcagcagct gctgtgggtg gcagttgtac 105901 actttgataa taaaaggggg tggtacaacg gcccttgggt tttcacttct tagttttgtg 105961 tttgagaagt atggggggag ggggtcatga ttctcacttc cctttgagga tgctgggagt 106021 cttaaggata aagctgaagg gttcctggca gagctaggga gctaactggc tgcagtccac 106081 acacccttgg ggtttagcgg gggaaaaggc ccatagtacc agtgcatagt ttcctagcaa 106141 caggcaggca ccttgggaaa gatctggctt tgtcgcagcc agggagttgg cctggtgtat 106201 tctggttcct gcctgagagt gtgactttaa taatgctgtg ggcagctcag gagaacagaa 106261 ctttgaatcc ttgtgcacca ccaagcacga agagcatcta aatagcctac tgctgggcct 106321 agtcagctcc agcttcctga gcggcttctt agaggctggg gtggccaaca ggctgatggt 106381 gcttgaagag gagcaccacc ctttcaggga cacagatggc tttcctcttc caccttagcc 106441 tgcctcctgc ctgcctgatg gctgacccac acccttcaag atcaatgcgc atttccgtca 106501 gggcatgctt ctctccagga gccttgtggg acagtgactt cacaaagagc ccaggaactg 106561 gctggtgata attactagtt aaagacactg tgtcccaagg gtaaaatccc agcatctgtc 106621 ggtattgtgc attccctctc tgttctgccc ctggaagcaa aggctaggga gatggagaga 106681 agagtgggta aaagaaagag gaaaaaaaaa aggatgagtc aggaacaagg gaaaggaggg 106741 gacgaaagga aagacttgac agcggggtgg cgggcgggtg gcggggggaa gaaaaagttg 106801 agtgatagca acaacagcct tctaaggctc tcagcaacct gtgccataaa ctgtaccctc 106861 ctaggatcaa agcaatcctt ttaataaatg ggcccaattt ataatgagta acttttagtg 106921 gctagggtga aacctagtgt accctcgtga ttaaataaag aggaaaaata gaggcacttt 106981 tgaattgcat aggttgagac tttcgtttta agcttaaact tggctggtca ccaaatggat 107041 aggaaaggga ccattccatg gttagagtaa agtaggggta ctggcccagt atgccagact 107101 gggaaatatt gctgggaacc ttgaggggtt tataaatagc tgggtttaat ttcccagcag 107161 aagtgaaaac tgatgctttg taccctacag attttggact acttggctaa agctgaaggg 107221 gtggtgggga aatagtgcaa aggacaagag aaatgagtgt tggacagatg agcgaggagg 107281 caaaggtagt ctcagcttat atccttcctg ctttggagta tgaggagttc ctgagaagca 107341 aggacaccct ggagcccagt gtttgaaaga atccagtttc tctaagctca gcactcaaaa 107401 cttcctcagg gccagctgat cagacccaaa tccttgctca tgtctgacca caagaagtgt 107461 ttcccaagct tcctcactgg acagctaccc aataagggta tgactttgag gggttaaaac 107521 aaaacattat aagtcttttt tcctcatcct cctattagcc tcagggtcaa aaggaagttg 107581 aaatcaccaa acccaaagtc acagaaatag tcctacttgg aaataaccaa aatgggaagt 107641 cattcagaag aaagtctaga aaggtccatt ttctggctgt aacttaggtc attttacaat 107701 cagaagagca tgagggagct ctggcctttt gggggtgggt ctgtgcttca ggtggtttta 107761 ttgtcagatc tggagtgcat ttgctgactc ctttttattt atctgggaat ataagcagac 107821 gtttcctgcc ctcttgactt tccttttttt ttttttttta tacagcgctt ctctcttgtt 107881 gcccaggctg gagtgcaatg gcgtgatctc ggctcaccac aacctccacc tcctgggttc 107941 aagtgattct cctgcctcag cctcctgagt agctgtgatt acaggcatgc accaccatgc 108001 ctagctaatt ttgtattttt agtagagatg gggtttctcc gtgttggtca ggctggtctc 108061 gaactcccga cctcaggtga tcgcccgcct tggcctccca aaatgctggg attacaggcg 108121 tgagccaccg cgcctggctg actttcttgt gtaaagaatg gtattttcgc ctatgaaact 108181 ataagggaat gaaacttggt gagcaaaaga aagctaaact aaaatactaa aatgaaacca 108241 tttaagttct agagagaaat cacttaaaat cttaccccct caagacttag gctatacaat 108301 tataaagtct tcaaacagga aatactaaag acatcttgag cctagggaaa gaagtgggag 108361 acatataaaa agtgtccaag cagcccctga attcccctct gaggaatgca tgaaggaagg 108421 gaatcctttt tcagtgggga tcaagagtaa agagaccctt tctgaggacc atgagtccag 108481 tagaaagtat aaatatccag tagaaagtat aaatatccca aaattaactg gggaaagagc 108541 tgtgctccgc ttagattcac aacttcaaaa atcccccaac ccaatctctg gaatcagtca 108601 gccctagata taaatcctta ctctttcact taccacctct gtgactttgg tgaagtcact 108661 taatctctcc atgattcagt catcttatct taaaaatggc aattgcaaga gtacttctct 108721 taaagaggta aatgattgaa tgatgatata tgtaaagcac ttagcacaat gcttagtaca 108781 tagtaagtac tcagtgaata gtagctgtta ttaaaaagta agggtgggcc aggcacggtg 108841 gctcatgcct gtaatcccag cactttggga ggccgaggcg ggtgaatcac gaggtcagga 108901 gtttgagacc agcctggcca acatggtgaa accctgtctc tacaaaagat acaaaaaaaa 108961 aaaaaaaaat agccgggcat ggtggcacgc acctgtaatc ccagctactc gggaggctga 109021 ggcaggagaa tcgcttgacc ccaggaggtg gaggttgcag tgagccaaga ttgtgccatt 109081 gcactccagc ctgagcgaca gggcaagact ctgtctcaaa aaaaaaaaaa aaggagaggc 109141 ttctaacaat ccccaaagtc tctgctacaa gcccatagta taggtttgga ctaggtagct 109201 aaatctggat cccttttctg ggttaggtat gctgtttcca tagcattctg ggtaatgttt 109261 tctttacttt tttgtcccca ctactgtccc ctgacccccc acccccaagt ccctagacta 109321 acgcagggac tgtctcattt aatttctatt tccacagctc ctggctcaat gcgtggcaca 109381 tggaacacag agtaaacatt gattgaactg aaaccaacaa tccaagtcac cgttggcatt 109441 ctcaatcttg ctttcatttc tggggtagga tttgaatgag gaatgatggc tccataccca 109501 aaagagaggt ttcctatctt ataatttgtg gtaaccatca ccagatgtat tattgtccct 109561 acttaaaggg ttcattataa agtgcaaatt cctgaaatca atgtaattta ctttgatata 109621 aaagtagagg aggcctcttg ggctagaccc atgtgtggct cctctattta aggatgactt 109681 tttattatag ggaccccaag agaatgatga ccttccttgt tttagcttca acatccctgc 109741 attctcttga agaattcctt gttgcctcgg tcactattga aattatctat gcccttcttg 109801 aacatatttt tgttttccag tctataccac ctatttgggt aacaagttac ctgagttgtc 109861 tacctgtgta cacagtaggc tttagaacga attttatcat tatattttct aagctttaac 109921 ccatttatgc ctagtgtccc attattggaa cgctaagctt gtgggagtta tttatatcct 109981 cctgctcaag gtcatcgcca aggtctgatt tttcacaaaa aaatttgcaa cctctggcat 110041 caatgggtta atggatacct ttttgaacta gtactatatt ttataaaagt gaacaagtac 110101 atgtgcacac tatatgtatt attattgcat tatggagcca cagacctctc tgaacggtat 110161 ctaaaccaac cacaacaaca ctttattcaa cttaatcaga tttttaagtt tcagttattt 110221 tgccaaactg aagttgaaat taggttgccc aagggctttg tattgccaaa ctggctggtt 110281 agttggaagt acccccagat agttgtactt ttatgtcccc atggaacatt catggtctct 110341 tgatcaatag aagcaaattt attatgagtt tcccttcctt aaaaggtatt tcctttggtc 110401 tctaaagttt gaattaacta gattttagtg taaccttgct cttctatggt caagggttta 110461 taatcgttat acatacctaa aggaccattt cacttgtttg cccaaagtgt ctagttcagt 110521 gttctgcatg caatgaattc cttttaagta agtgcctttt ttgggtctgg ttttctacta 110581 gtagtcacag ttcaaaagga aaaggggaaa gtgcaaattt gttaacacta tttctgactt 110641 acgtgataac agggcaaaag ggggaggggt gttataaaaa tactctgtgt attgctctgt 110701 gtagtcctag ttaggaggcc tggagaaaaa gaatgggact tcttcctcat tacaggtttg 110761 attcaaaccc ttgtgtggct cagaatgccc ttgtgttaag gagcaggatt aggcctctcc 110821 ccatatcagg ggaatggaaa aagccactgt tgcacagggg tccatggtag tgtactaatg 110881 agctgccatt tcccaactag cttatcttct ccttggtgta gtttaaggca atttgggata 110941 ccttgatcct acctccctct ctcccttttt cttccatctc tctgtttctt ctcttgcgca 111001 gcttccaccc gcccatctct gccatcgttg cctgcctaca tagtccttaa gctggcaacc 111061 acaggctgct ggccattagg gagatgcatg gctacctcat acatatgcag acaatggaaa 111121 ggttccataa ttgaatttag ggagagaaca agagagcatg agtgagaaca agggtaatac 111181 agagttcttt tttctcccaa aacatacctg aaagtcataa gcagaatatg gtggcaggtg 111241 aggagggcta tggtagtagg tattgtagga tgccacttgg ggatgagcat agtaagaaac 111301 tgttggatgg gtattttcaa cagaacagtt cagtgctggg agagttgggt tctgtagaaa 111361 aacacaaatt agaaacaaca taaacagtca tcactaggag aatggttaaa taaacttagt 111421 acatctatac tatggaatac catgcagcta ttgataaaca atgatgtgga cctatttaga 111481 gaaggttgtc caggatatat ttttatgttt taaaagccat tatgttacat taatataaca 111541 tgatcccaat ttatgtttta aaaaaagacc cccaaaatta tacatgtatg tatgtttgta 111601 aatgcaaaga aattaacctg gaaggatgca tagcaaacta ttaactgtgg tttatctctg 111661 gggagcagag tagaattagg aggtatatag ggattcttaa ctttctacta tatatgtcaa 111721 aatgtttaca actaatttgt attgtgtcat tttccccaaa ctttttaact tgaaaaattt 111781 aaacctatag aaaagttgaa atcatttgtg tgatttttaa aaacaagaat taaaaagaaa 111841 aaataagact caaaatgcaa aacatcaaat agagaaggcc gaaccagcta gtgttcttgc 111901 tgtgacagag tgtgttcaat cacaaagaat ttgtaaaagt tgtaaggatt cttagagatc 111961 ccaaccttgc taagggggtc tcctgaagac atgaagggta ttgatttctc atcagccctg 112021 tagagtagag gcaaaatcct agcagtgcac agaagccatg aaagaccaag aaaaatatat 112081 tgatcccccc accttttttt tttttgcaaa taaaacgatg cagacataat tttatatcct 112141 tttcacaaca tggtggtgtg tgtagccctt caagggttaa agctaacctt gtccagaatg 112201 gaggccctgc tctcatcaga gccaaagcac tgctaaagga agttaccact actcattagt 112261 tcccaagtag gactgtgctt tgtgccacat ggatcaaggt gtggggtaaa tgatttttcc 112321 atcttctatg tcattgatac ctcccccatt catgtgtata attgagagct gtctgaaaaa 112381 tcaatgtgtc ttccttttgg aacatagcta cagcacatta acagagaagc agaagcttcc 112441 tctcaagtga tgaaaatgtg taggcaggct ttggcagtca caaaggcaac catgttaggg 112501 tgttggcagg gggaaactcc cttccagcaa gtttcaacac agttctgttg ctctcttcct 112561 tggagtgggt tactgtttac atctctagga ggtttgaaga gctacaaacc tcagtcataa 112621 tacagctcac agacagccag ataaatgttt gtcagcctct ttaatgtgtg aaagagggct 112681 gagatc // LOCUS HSAF000545 1020 bp DNA PRI 18-MAY-1997 DEFINITION Homo sapiens putative purinergic receptor P2Y10 gene, complete cds. ACCESSION AF000545 NID g2104786 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1020) AUTHORS Bohm,S.K. TITLE Putative purinergic receptor related to P2Y5 and P2Y9 is localized on the X chromosome JOURNAL Unpublished REFERENCE 2 (bases 1 to 1020) AUTHORS Bohm,S.K. TITLE Direct Submission JOURNAL Submitted (20-APR-1997) Dept. of Surgery, University of California, San Francisco, 521 Parnassus Ave., San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..1020 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /clone="J333E231" CDS 1..1020 /note="G-protein coupled receptor" /codon_start=1 /product="putative purinergic receptor P2Y10" /db_xref="PID:g2104787" /translation="MANLDKYTETFKMGSNSTSTAEIYCNVTNVKFQYSLYATTYILI FIPGLLANSAALWVLCRFISKKNKAIIFMINLSVADLAHVLSLPLRIYYYISHHWPFQ RALCLLCFYLKYLNMYASICFLTCISLQRCFFLLKPFRARDWKRRYDVGISAAIWIVV GTACLPFPILRSTDLNNNKSCFADLGYKQMNAVALVGMITVAELAGFVIPVIIIAWCT WKTTISLRQPPMAFQGISERQKALRMVFMCAAVFFICFTPYHINFIFYTMVKETIISS CPVVRIALYFHPFCLCLASLCCLLDPILYYFMASEFRDQLSRHGSSVTRSRLMSKESG SSMIG" BASE COUNT 240 a 259 c 210 g 311 t ORIGIN 1 atggctaacc ttgacaaata cactgaaaca ttcaagatgg gtagcaacag taccagcact 61 gctgagattt actgtaatgt cactaatgtg aaatttcaat actccctcta tgcaaccacc 121 tatatcctca tattcattcc tggtcttctg gctaacagtg cagccttgtg ggttctgtgc 181 cgcttcatca gcaagaaaaa taaagccatc attttcatga tcaacctctc tgtggctgac 241 cttgctcatg tattatcttt acccctccgg atttactatt acatcagcca ccactggcct 301 ttccagagag ccctttgcct gctctgcttc tacctgaagt atctcaacat gtatgccagc 361 atttgtttcc tgacgtgcat cagtcttcaa aggtgctttt ttctcctcaa gcccttcagg 421 gccagagact ggaagcgtag gtacgatgtg ggcatcagtg ctgccatctg gatcgttgtg 481 gggactgcct gtttgccatt tcccatcctg agaagcacag acttaaacaa caacaagtcc 541 tgctttgctg atcttggata caagcaaatg aatgcagttg cgttggtcgg gatgattaca 601 gttgctgagc ttgcaggatt tgtgatccca gtgatcatca tcgcatggtg tacctggaaa 661 actactatat ccttgagaca gccaccaatg gctttccaag ggatcagtga gaggcagaaa 721 gcactgcgga tggtgttcat gtgtgctgca gtcttcttca tctgcttcac tccctatcat 781 attaacttta ttttttacac catggtaaag gaaaccatca ttagcagttg tcccgttgtc 841 cgaatcgcac tgtatttcca ccctttttgc ctgtgccttg caagtctctg ctgccttttg 901 gatccaattc tttattactt tatggcttca gagtttcgtg accaactatc ccgccatggc 961 agttctgtga cccgctcccg cctcatgagc aaggagagtg gttcatcaat gattggctaa // LOCUS HSAJ97 3859 bp DNA PRI 08-JAN-1998 DEFINITION Homo sapiens mRNA for EYA1B gene. ACCESSION AJ000097 NID g2661374 KEYWORDS EYA1B gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3859) AUTHORS Abdelak,S., Kalatzis,V., Heilig,R., Compain,S., Samson,D., Vincent,C., Levi-Acobas,F., Cruaud,C., Le Merrer,M., Mathieu,M., Koenig,R., Vigneron,J., Weissenbach,J., Petit,C. and Weil,D. TITLE Clustering of mutations responsible for Branchio-Oto-Renal (BOR) syndrome in the eyes absent homologous region (eyaHR) of EYA1 JOURNAL Hum. Mol. Genet. 6, 2247-2255 (1997) REFERENCE 2 (bases 1 to 3859) AUTHORS Abdekhak,S., Kalatzis,V., Heilig,R., Compain,S., Samson,D., Vincent,C., Weil,D., Cruaud,C., Sahly,I., Leibovici,M., Bitner-Glindzicz,M., Francis,M., Lacomde,D., Vigneron,J., Charachon,R., Boven,K., Bedbeder,P., Van Regemorter,N., Weissenbach,J. and Petit,C. TITLE A human homologue of the Drosophila eyes absent gene underlies branchio-oto-renal (BOR) syndrome and identifies a novel gene family JOURNAL Nature Genet. 15 (2), 157-164 (1997) MEDLINE 97172972 REFERENCE 3 (bases 1 to 3859) AUTHORS Abdelhak,S. TITLE Direct Submission JOURNAL Submitted (03-SEP-1997) Abdelhak S., Unite de Genetique des Deficits Sensoriels, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris, FRANCE COMMENT Related sequences Y10260, AJ000098. FEATURES Location/Qualifiers source 1..3859 /organism="Homo sapiens" /db_xref="taxon:9606" allele 1..124 /gene="EYA1C" /citation=[2] /replace="aaaccaataaggttaggacaagagaatagctgtggtttgcgttgcaaa a accaaaaaaaaaaaaaaaaaaaaaaagaaagccccgaggctccatgggcagacctaca a ggctgcgcaaacaaatcgagggatgagattctgctgtttctttgtctagggttctcag a tgctatctgccgctgctgtttggtggggaaggagcgctgggcgcaaagctgttaccaa a cagaacggtgggagctgatggctccgagtttggggcgaggtagaaactctccagtgcc a cttccgactttaagccttcctgttgccgtccactgtggcgggtttcttcctggggaac a cgttttcgctcagtcgctcggcagcccgagcctgcggcagcggccaggcgcctgcccc c tgcgccgagctttcccctgcagaggcgctccactcccagaagcgccgcggctgcacca g agcgcctgagagcccccgcgcgtacccatccaggagcaaaactatgtcaggaatggag g tttgctaacccagaaaattcgaaggaacacattaaactggtggatgcagcagatgtaa g cgctg" gene 1..124 /gene="EYA1C" allele 171..300 /gene="EYA1A" /citation=[2] /replace="ca" gene 171..300 /gene="EYA1A" gene 178..1956 /gene="EYA1B" CDS 178..1956 /gene="EYA1B" /codon_start=1 /db_xref="PID:e1198519" /db_xref="PID:g2661375" /translation="MEMQDLTSPHSRLSGSSESPSGPKLGNSHINSNSMTPNGTEVKT EPMSSSETASTTADGSLNNFSGSAIGSSSFSPRPTHQFSPPQIYPSNRPYPHILPTPS SQTMAAYGQTQFTTGMQQATAYATYPQPGQPYGISSYGALWAGIKTEGGLSQSQSPGQ TGFLSYGTSFSTPQPGQAPYSYQMQGSSFTTSSGIYTGNNSLTNSSGFNSSQQDYPSY PSFGQGQYAQYYNSSPYPAHYMTSSNTSPTTPSTNATYQLQEPPSGITSQAVTDPTAE YSTIHSPSTPIKDSDSDRLRRGSDGKSRGRGRRNNNPSPPPDSDLERVFIWDLDETII VFHSLLTGSYANRYGRDPPTSVSLGLRMEEMIFNLADTHLFFNDLEECDQVHIDDVSS DDNGQDLSTYNFGTDGFPAAATSANLCLATGVRGGVDWMRKLAFRYRRVKEIYNTYKN NVGGLLGPAKREAWLQLRAEIEALTDSWLTLALKALSLIHSRTNCVNILVTTTQLIPA LAKVLLYGLGIVFPIENIYSATKIGKESCFERIIQRFGRKVVYVVIGDGVEEEQGAKK HAMPFWRISSHSDLMALHHALELEYL" BASE COUNT 1153 a 842 c 758 g 1106 t ORIGIN 1 tctccttttt ctcttttggt taaaagaggg cattgtcgtt ctcagccatg tgctctgtat 61 aattaagagc tgacactgaa gcagagtaac aacatcttct aattttttta cccctgatca 121 caggtgcaaa catctcaagc cagttcagat gttgctgttt cctcaagttg caggtctatg 181 gaaatgcagg atctaaccag cccgcatagc cgtctgagtg gtagtagtga atcccccagt 241 ggccccaaac tcggtaactc tcatataaat agtaattcca tgactcccaa tggcaccgaa 301 gttaaaacag agccaatgag cagcagtgaa acagcttcaa cgacagccga cgggtcttta 361 aacaatttct caggttcagc aattgggagc agtagtttca gcccacgacc aactcaccag 421 ttctctccac cacagattta cccttccaac agaccatacc cacatattct ccctacccct 481 tcctcacaaa ctatggctgc atatgggcaa acacagttta ccacaggaat gcaacaagct 541 acagcctatg ccacgtaccc acagccagga cagccgtacg gcatttcctc atatggtgca 601 ttgtgggcag gcatcaagac tgaaggtgga ttgtcacagt ctcagtcacc tggacagaca 661 ggatttctca gctatggcac aagcttcagt acccctcaac ctggacaggc accatacagc 721 taccagatgc aaggtagcag ttttacaaca tcatcaggaa tatatacagg aaataattca 781 ctcacaaatt cctctggatt taatagttca cagcaggact atccgtctta tcccagtttt 841 ggccagggtc agtacgcaca gtattataac agctcaccgt atccagcaca ttatatgacc 901 agcagcaaca ccagcccaac gacaccatcc accaatgcca cttaccagct tcaagaaccg 961 ccatctggca tcaccagcca agcagttaca gatcccacag cagagtacag cacaatccac 1021 agcccatcaa cacccattaa agattcagat tctgatcgat tgcgtcgagg ttcagatggg 1081 aaatcacgtg gacggggccg aagaaacaat aatccttcac ctcccccaga ttctgatctt 1141 gagagagtgt tcatctggga cttggatgag acaatcattg ttttccactc cttgcttact 1201 gggtcctacg ccaacagata tgggagggat ccacccactt cagtttccct tggactgcga 1261 atggaagaaa tgattttcaa cttggcagac acacatttat tttttaatga cttagaagaa 1321 tgtgaccaag tccatataga tgatgtttct tcagatgata acggacagga cctaagcaca 1381 tataactttg gaacagatgg ctttcctgct gcagcaacca gtgctaactt atgtttggca 1441 actggtgtac ggggcggtgt ggactggatg agaaagttgg ccttccgcta cagacgggta 1501 aaagagattt acaacaccta caaaaataat gttggaggtc tgcttggtcc agctaagagg 1561 gaagcctggc tgcagttgag ggccgaaatt gaagccctga ccgactcctg gttgacactg 1621 gccctgaaag cactctcgct cattcactcc cggacaaact gtgtgaatat tttagtaaca 1681 actactcagc tcatcccagc attggcgaaa gtcctgctgt atgggttagg aattgtattt 1741 ccaatagaaa atatttacag tgcaactaaa ataggaaaag aaagctgttt tgagagaata 1801 attcaaaggt ttggaagaaa agtggtgtat gttgttatag gagatggtgt agaagaagaa 1861 caaggagcaa aaaagcacgc gatgcccttc tggaggatct ccagccactc ggacctcatg 1921 gccctgcacc atgccttgga actggagtac ctgtaacagc gctcggcact ttgacagcgc 1981 acagctgctc tgtgaccagg gacagatcca gcaggcccca gtctcgcatc agcgccggcc 2041 tccagaactt agcaatttcc gcctggtgat gcgcagttgc tgtcagtctt gacctctgcc 2101 tttgtggtga atggaggacc acgtctattt catcagaaca gctgttgact ctagtactgt 2161 gaatccagtg aaaataagcc atgagaatgt tttagcacag cgttatgtgt ctgccacatt 2221 aactacacgg ttcaaacctg tgaagaaagg acctgcaaac gcttcagttg ttagcatttt 2281 caatgtgata taaacagctt ctccaataca gcaaacctaa ttgcacaaca gagactgaaa 2341 tgtgtttcct gaataccagt ggaggaattt tcttgtaaag aaggtttact ttttggtgtc 2401 tcatacccag ggtaatctgt acatctctac ttatttatga acagactttt tttaaaaaga 2461 taaaaaaaca gctttattga ggtataattc acccaccaga cttttttaaa catcaaataa 2521 ttgaggagac aatagcatta gaaataagtg attaaaggcc tctgcctcac aacatggcaa 2581 gtacagtact ttgaatttta gcacattgca taatagtttt aagtatgtct aatttaaacg 2641 tataatatgt acatcactga gacaatcatg tacagaaaga atttttggtg taaatttgta 2701 ataatggata attcttttac atattgttta gggaaatgat attgaaaggt agcaatgcct 2761 ggatagtgaa gcatgaggca gcacgtgcac aaattcatgt gccgtgcctt atctgagttt 2821 tcggtataaa tatgtagata atggattttt ttttagataa tgttgtcaag accaaaagca 2881 tggatgtcaa gtgtcagtaa ggattttgtt tctaaaattt tttcctgcat cagttcttct 2941 gagggccttg atgaaataac acagcagttt cttaaacaat ttgaaacaaa atgagctctc 3001 ctaccacctc actttttcat ttccacacta atgtattata tgtaactact tggaaaaaat 3061 aattattcaa atgcttcttc ccacaaagaa tatagatgat agtagatata ttttattaat 3121 aaaatggttc atgaatcgga gactaacaaa gttttcatgt gctcagaatt attaattatc 3181 gtgtctgcat tttctttcga taaaggaaga cacacgatgc taatccggaa atcagcaaac 3241 tttgcattac tccctatgtg cgtattttct ctttcttcct gtcaccctga ggaaggttca 3301 ttgccattgt catcaccatg gaaacaacgt tcctctccac ctgcattatg tactacatga 3361 caggcatcaa tctggggaaa taataaaatt atcacctttg tcagaccata agagtttctc 3421 caaaagtggt cagtttggct gggcaatatt ttctctcatc taacaaacac aatccattgt 3481 catgaaatta cccttaggat gagtcttctt taatcaatca tatattgggc ggaaaaaaca 3541 ccagctttga cccgaagtag ttgaagagct acttcattct tttctgaagt tgtgtgttgc 3601 tgctagaaat agtcatttgt gaattatcca aattgtttaa attcacaatt gaattagttt 3661 tttcttcctt ttggcttgaa gcaaacagtt gaccattttt aaccttttca ttttatgttt 3721 ttgtactctg cagactgaaa agacaaagtt tatcttggcc ttactgtata aaggtatgct 3781 gtgtccaccg ttgtgtacag aatttttctt cattaatttt gtgtttaagt taataaaatt 3841 tatttgtgat gtactgtaa // LOCUS HSART3 1104 bp DNA PRI 26-FEB-1997 DEFINITION H.sapiens ART3 gene. ACCESSION X95827 NID g1495418 KEYWORDS mono-ADP-ribosyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1104) AUTHORS Koch-Nolte,F., Haag,F., Braren,R., Kuhl,M., Hoovers,J., Balasubramanian,S., Bazan,F. and Thiele,H.G. TITLE Two novel human members of an emerging mammalian gene family related to mono-ADP-ribosylating bacterial toxins JOURNAL Genomics 39 (3), 370-376 (1997) MEDLINE 97224466 REFERENCE 2 (bases 1 to 1104) AUTHORS Koch-Nolte,F. TITLE Direct Submission JOURNAL Submitted (23-FEB-1996) F. Koch-Nolte, University Hospital, Dept. of Immunology, Martinistr. 52, 20246 Hamburg, FRG REFERENCE 3 (bases 1 to 1104) AUTHORS Koch-Nolte,F., Braren,R., Haag,F., Khl,M. and Thiele,H.G. TITLE Molecular characterization of the gene for murine skeletal and cardiac muscle ecto mono(ADPribosyl)transferase Art2 JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1104 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="P1 Genome systems" /chromosome="4" gene 1..1104 /gene="ART3" CDS 1..1104 /gene="ART3" /EC_number="2.4.2.31" /note="expressed in testis" /codon_start=1 /product="mono-ADP-ribosyltransferase" /db_xref="PID:e223921" /db_xref="PID:g1495419" /translation="MKTGHFEIVTMLLATMILVDIFQVKAEVLDMADNAFDDEYLKCT DRMEIKYVPQLLKEEKASHQQLDTVWENAKAKWAARKTQIFLPMNFKDNHGIALMAYI SEAQEQTPFYHLFSEAVKMAGQSREDYIYGFQFKAFHFYLTRALQLLRKPCEASSKTV VYRTSQGTSFTFGGLNQARFGHFTLAYSAKPQAANDQLTVLSIYTCLGVDIENFLDKE SERITLIPLNEVFQVSQEGAGNNLILQSINKTCSHYECAFLGGLKTENCIENLEYFQP IYVYNPGEKNQKLEDHSEKNWKLEDHGEKNQKLEDHGVKILEPTQIPAPGPVPVPGPK CHPSASSGKLLLPQFGMVIILISVSAINLFVAL" BASE COUNT 342 a 239 c 233 g 290 t ORIGIN 1 atgaagacgg gacattttga aatagtcacc atgctgctgg caaccatgat tctagtggac 61 attttccagg tgaaggctga agtgttagac atggcagata atgcatttga tgatgaatac 121 ctgaaatgta cggacaggat ggaaattaaa tacgttcccc aactgctaaa ggaggaaaaa 181 gcaagccacc agcaattaga tactgtgtgg gaaaatgcaa aagccaaatg ggcagcccga 241 aagactcaaa tctttctccc tatgaatttt aaggataacc atggaatagc cctgatggca 301 tatatttccg aagctcaaga gcaaactccc ttttaccatc tgttcagtga agctgtgaag 361 atggctggcc aatctcgaga agattatatc tatggcttcc agttcaaagc tttccacttt 421 tacctcacaa gagccctgca gttgctgaga aaaccttgtg aggccagttc caaaactgtg 481 gtatatagaa caagccaggg cacttcattt acatttggag ggctaaacca agccaggttt 541 ggccatttta ccttggcata ttcagccaaa cctcaggctg ctaatgacca gctcactgtg 601 ttatccatct acacatgcct tggagttgac attgaaaatt ttcttgataa agaaagtgaa 661 agaattactt taatacctct gaatgaggtt tttcaagtgt cacaggaggg ggctggcaat 721 aaccttatcc ttcaaagcat aaacaagacc tgcagccatt atgagtgtgc atttctaggt 781 ggactaaaaa ccgaaaactg tattgagaac ctagaatatt ttcaacccat ctatgtctac 841 aaccctggtg agaaaaacca gaagcttgaa gaccatagtg agaaaaactg gaagcttgaa 901 gaccatggtg agaaaaacca gaagcttgaa gaccatggtg tgaaaatcct tgaacccacc 961 caaatacctg ctccaggtcc agttcctgtt ccaggtccca aatgccatcc ttctgcatcc 1021 tcgggcaaac tgctgcttcc acagtttggg atggtcatca ttttaatcag tgtttctgct 1081 ataaatctct ttgttgctct gtag // LOCUS HSBDKRBI2 3332 bp DNA PRI 29-SEP-1996 DEFINITION Human bradykinin B1 receptor gene (BDKRB1), gene, complete cds. ACCESSION U48231 NID g1388120 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3332) AUTHORS Bachvarov,D.R., Hess,J.F., Menke,J.G., Larrivee,J.F. and Marceau,F. TITLE Structure and genomic organization of the human B1 receptor gene for kinins (BDKRB1) JOURNAL Genomics 33 (3), 374-381 (1996) MEDLINE 96299633 REFERENCE 2 (bases 1 to 3332) AUTHORS Bachvarov,D.R., Hess,J.F., Menke,J.G., Larrivee,J-F. and Marceau,F. TITLE Direct Submission JOURNAL Submitted (02-FEB-1996) Dimcho R. Bachvarov, Centre de Recherche, Hotel-Dieu de Quebec, 11 Cote du Palais, Quebec, QC G1R 2J6, Canada FEATURES Location/Qualifiers source 1..3332 /organism="Homo sapiens" /db_xref="taxon:9606" gene join(U48230:1..1417,1..3332) /gene="BDKRB1" intron <1..226 /gene="BDKRB1" exon 227..347 /gene="BDKRB1" intron 348..1247 /gene="BDKRB1" exon 1248..3332 /gene="BDKRB1" CDS 1258..2319 /gene="BDKRB1" /note="G protein-coupled receptor" /codon_start=1 /product="bradykinin B1 receptor" /db_xref="PID:g1388122" /translation="MASSWPPLELQSSNQSQLFPQNATACDNAPEAWDLLHRVLPTFI ISICFFGLLGNLFVLLVFLLPRRQLNVAEIYLANLAASDLVFVLGLPFWAENIWNQFN WPFGALLCRVINGVIKANLFISIFLVVAISQDRYRVLVHPMASGRQQRRRQARVTCVL IWVVGGLLSIPTFLLRSIQAVPDLNITACILLLPHEAWHFARIVELNILGFLLPLAAI VFFNYHILASLRTREEVSRTRVRGPKDSKTTALILTLVVAFLVCWAPYHFFAFLEFLF QVQAVRGCFWEDFIDLGLQLANFFAFTNSSLNPVIYVFVGRLFRTKVWELYKQCTPKS LAPISSSHRKEIFQLFWRN" BASE COUNT 771 a 891 c 753 g 917 t ORIGIN 1 tgacaatggc ccctaccttg aatggctgcc aggaagatta aatgagaaaa tacttaaatg 61 tgaaacactt agaatggcgc ctggaacaca gaccattaat accatctaac agatgttagt 121 tgttatcctt atttattact catgctttcc tttctctttt ttcttttctc tctctctctc 181 tccttttttt tttttttttt tgttgttgtt gttgttgttg agacagggtc tcagtccgtc 241 ggcccagact gaagtgcagt ggcacaatca tagctcgctg cagcctcgac cttccaggct 301 taaacgattc tcccacctca gcctctcgag ttgctgggac cacaggtatg caccaccatg 361 cccagctaat ttttgtattt tttgtaaaga caggatttca ccatgttgcc caggctggtc 421 ttgaactcct gggttcatct gatccatctg ccttggcctc ccaaagtact gagattacag 481 gtgtgaacca ccacacccgg ccaatactca tgtttttcaa gcctgtaaga ggaacctcct 541 agcactgtcc ccaccccggc caccacactg gtcccagccc caacaggtca gcttcctttg 601 ctgttcccgg gcttccctca gctctatctc aagccatgac ctctgcctcc atgtctgcag 661 ccccatgagg ctggggctgc tctgtcctgc atatctccag tgcctggcaa ggggctggca 721 agaggtagag gctcattaaa tgcctgttaa aaccctaata gtaataataa taatggtaca 781 gttgttacta agactaatca ctaccttcca aagtctttcc tctatgcaag gcacggagct 841 aagcaccctg tagatcctga caacagccct cccgtgatcc cacaagagag acacgattct 901 ctccaatgtt taaaaaggaa agtaaaagtc aatggttcta agtagctcac actgagtgat 961 tgtaccggga tttggactca ggcaccatcc cttaaaccag aggtcagcaa acttttcccc 1021 ctcaggaaca gatggtaaat atcttcagct ttgcaagcca gacagtctct gttacaagcg 1081 ctcgactccc ccttgtagca tgaacacagc tgtagataat acgtccagga atgggtgtgg 1141 ctctgtgcca ataaaacttt attgtccaaa aacaggtgac aggttggttt ggctcatagg 1201 ctgtagtctg ccacttcctg ttttattcta ccttctgttc atttcaggtc actgtgcatg 1261 gcatcatcct ggccccctct agagctccaa tcctccaacc agagccagct cttccctcaa 1321 aatgctacgg cctgtgacaa tgctccagaa gcctgggacc tgctgcacag agtgctgccg 1381 acatttatca tctccatctg tttcttcggc ctcctaggga acctttttgt cctgttggtc 1441 ttcctcctgc cccggcggca actgaacgtg gcagaaatct acctggccaa cctggcagcc 1501 tctgatctgg tgtttgtctt gggcttgccc ttctgggcag agaatatctg gaaccagttt 1561 aactggcctt tcggagccct cctctgccgt gtcatcaacg gggtcatcaa ggccaatttg 1621 ttcatcagca tcttcctggt ggtggccatc agccaggacc gctaccgcgt gctggtgcac 1681 cctatggcca gcggaaggca gcagcggcgg aggcaggccc gggtcacctg cgtgctcatc 1741 tgggttgtgg ggggcctctt gagcatcccc acattcctgc tgcgatccat ccaagccgtc 1801 ccagatctga acatcaccgc ctgcatcctg ctcctccccc atgaggcctg gcactttgca 1861 aggattgtgg agttaaatat tctgggtttc ctcctaccac tggctgcgat cgtcttcttc 1921 aactaccaca tcctggcctc cctgcgaacg cgggaggagg tcagcaggac aagagtgcgg 1981 gggccgaagg atagcaagac cacagcgctg atcctcacgc tcgtggttgc cttcctggtc 2041 tgctgggccc cttaccactt ctttgccttc ctggaattct tattccaggt gcaagcagtc 2101 cgaggctgct tttgggagga cttcattgac ctgggcctgc aattggccaa cttctttgcc 2161 ttcactaaca gctccctgaa tccagtaatt tatgtctttg tgggccggct cttcaggacc 2221 aaggtctggg aactttataa acaatgcacc cctaaaagtc ttgctccaat atcttcatcc 2281 cataggaaag aaatcttcca acttttctgg cggaattaaa acagcattga accaagaagc 2341 ttggctttct tatcaattct ttgtgacata ataaatgcta ttgtgatagg ctaaatgatt 2401 actcccgtag attggggggt acctaatccc tggacttgat gaatgttacc aaattaaggg 2461 tcttgagatg gggagatgat cctgaattat ccaagtgggc cctatataat cacaagggtc 2521 cttataggag ggaggcagga ggctcagagt caggagatgt gactatggaa gcagaggcca 2581 gaggaattca ggacggccac tacgagccaa ggattgcagg caccctgtag aggctgtaaa 2641 gggcaaggaa atggcttctc ccctggagcc tccagaagga atgggtcctg ccaactccct 2701 gtcttcagcc cagggaaaca gatttaggat ttctggcctc cagaactgtt agaggataca 2761 tttgtgtttt gttttgcttt gtttgctttg ctttgctttg ctttgctttt ttgagatggg 2821 gtctcgctct gtcacccagg ctggagtgca ctggcacaat cacggctcac tgcagcctca 2881 acttcccaga ctcaagggat cctcccacct cagcctcctg tagctgagac tacaggtgtg 2941 caccaccatg cctggctaac ttttctattt tttgtagaga tggtgtcttc ctgcattgcc 3001 taggctggtc tcaaactcaa gggctcaagt gatcctccac tttggtctcc catagtgcta 3061 ggattatagg cgtggccact gcgcctggcc ccatttgtat tttaagccac cgagtttctg 3121 gtaatttgtc atagcagcag caggaaacaa ataacaagta tcgggtaatg gcctctctta 3181 ttacacttcc atttgtctat tcaaagcttt ctaggctaac tgccaggaat acagggatgg 3241 tatagctaga agttctatat tcaaggaact tacatacatc taaatattat aatataaaaa 3301 atgaaatgag aaattgcatg aacacaccac gg // LOCUS HSCAMF3X1 41154 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens Y chromosome cosmid cAMF3.1 containing Yp pseudoautosomal boundary, PAB1. ACCESSION X96421 NID g1216157 KEYWORDS PAB1; pseudoautosomal boundary; repeat; repetitive DNA; Sry gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 41154) AUTHORS Whitfield,L.S. TITLE Direct Submission JOURNAL Submitted (05-MAR-1996) L.S. Whitfield, Department of Genetics, University of Cambridge, Cambridge CB2 3EH, U.K REFERENCE 2 (bases 1 to 41154) AUTHORS Whitfield,L.S., Hawkins,T.L., Goodfellow,P.N. and Sulston,J. TITLE 41 kilobases of analyzed sequence from the pseudoautosomal and sex-determining regions of the short arm of the human Y chromosome JOURNAL Genomics 27 (2), 306-311 (1995) MEDLINE 96044437 REFERENCE 3 (bases 1 to 41154) AUTHORS Ellis,N.A., Ye,T.Z., Patton,S., German,J., Goodfellow,P.N. and Weller,P. TITLE Cloning of PBDX, an MIC2-related gene that spans the pseudoautosomal boundary on chromosome Xp JOURNAL Nature Genet. 6 (4), 394-400 (1994) MEDLINE 94332149 REFERENCE 4 (bases 1 to 41154) AUTHORS Weller,P.A., Critcher,R., Goodfellow,P.N., German,J. and Ellis,N.A. TITLE The human Y chromosome homologue of XG: transcription of a naturally truncated gene JOURNAL Hum. Mol. Genet. 4 (5), 859-868 (1995) MEDLINE 95360005 REFERENCE 5 (bases 1 to 41154) AUTHORS Ellis,N.A., Goodfellow,P.J., Pym,B., Smith,M., Palmer,M., Frischauf,A.M. and Goodfellow,P.N. TITLE The pseudoautosomal boundary in man is defined by an Alu repeat sequence inserted on the Y chromosome JOURNAL Nature 337 (6202), 81-84 (1989) MEDLINE 89082660 FEATURES Location/Qualifiers source 1..41154 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cosmid cAMF3.1" /chromosome="Y" /map="Yp" repeat_region 1..142 /partial /note="Alu repeat: matches 170..308 of consensus" repeat_region 519..806 /note="Alu repeat: matches 1..308 of consensus" repeat_region 1202..1505 /note="Alu repeat: matches 308..1 of consensus" repeat_region 1657..1880 /note="MER17 element fragment" repeat_region 1977..2075 /note="MER17 element fragment" repeat_region 2095..2392 /note="Alu repeat: matches 308..1 of consensus" repeat_region 2459..2757 /note="Alu repeat: matches 1..308 of consensus" repeat_region 2847..3144 /note="Alu repeat: matches 308..1 of consensus" repeat_region 4011..4336 /note="Alu repeat: matches 1..308 of consensus" repeat_region 5647..5717 /partial /note="Alu repeat: matches 237..308 of consensus" exon complement(9027..10217) /citation=[4] /product="YTPEP11" repeat_region 10664..10810 /note="L1MB7 element fragment" exon complement(12332..12486) /citation=[4] /product="RACE6" CDS 12363..12977 /codon_start=1 /product="SRY" /db_xref="PID:e225620" /db_xref="PID:g1216158" /db_xref="SWISS-PROT:Q05066" /translation="MQSYASAMLSVFNSDDYSPAVQENIPALRRSSSFLCTESCNSKY QCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLT EAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKMLPKNCSLLPADPASVLCSEVQLD NRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRYSHWTKL" exon complement(12913..12971) /citation=[4] /product="RACE6" repeat_region 14459..14512 /note="MER31 element fragment" repeat_region 14546..14629 /note="MER31 element fragment" repeat_region 14929..15012 /note="L1MA9 element fragment" repeat_region 15822..16145 /note="Alu repeat: matches 1..308 of consensus" repeat_region 16511..16585 /note="L1ME2 element fragment" repeat_region 16688..16982 /note="Alu repeat: matches 308..1 of consensus" misc_feature 17942..18175 /note="pseudoautosomal boundary: ALU-proximal region" /citation=[5] repeat_region 18181..18485 /note="Alu repeat: matches 308..1 of consensus; Alu inserted at pseudoautosomal boundary" misc_feature 18486..18490 /note="pseudoautosomal boundary: ALU insertion site" /citation=[5] repeat_region 18943..19059 /note="MER4A2 element fragment" repeat_region 19203..19229 /note="MER4A2 element fragment" repeat_region 19233..19350 /partial /note="Alu repeat: matches 308..193 of consensus" repeat_region 19358..19515 /partial /note="Alu repeat: matches 175..1 of consensus" repeat_region 19930..20013 /note="MER2 element fragment" repeat_region 20018..20317 /note="Alu repeat: matches 1..308 of consensus" repeat_region 20616..20756 /partial /note="Alu repeat: matches 308..168 of consensus" repeat_region 20899..21186 /note="Alu repeat: matches 1..308 of consensus" repeat_region 21391..21700 /note="Alu repeat: matches 308..1 of consensus" repeat_region 22049..22183 /partial /note="Alu repeat: matches 149..1 of consensus" repeat_region 22341..22564 /partial /note="Alu repeat: matches 308..121 of consensus" repeat_region 23092..23297 /partial /note="Alu repeat: matches 308..101 of consensus" repeat_region 23851..23951 /partial /note="Alu repeat: matches 103..1 of consensus" repeat_region 23962..24279 /note="Alu repeat: matches 308..1 of consensus" repeat_region 24505..25069 /note="L1MB7 element fragment" exon complement(25214..25237) /citation=[3] /number=3 /product="PBDX" unsure 26088..26815 /note="minisatellite repeat" repeat_region 26088..26815 /note="minisatellite" repeat_region 27288..27557 /note="L1MB7 element fragment" repeat_region 27564..27821 /note="L1MB3 element fragment" repeat_region 28838..29158 /note="Alu repeat: matches 1..308 of consensus" exon complement(29400..29441) /citation=[3] /number=2 /product="PBDX" repeat_region 29589..29654 /note="L1MB7 element fragment" repeat_region 29674..29805 /note="L1ME2 element fragment" repeat_region 29724..29861 /note="L1MB7 element fragment" repeat_region 31183..31505 /note="Alu repeat: matches 308..1 of consensus" repeat_region 33127..33428 /note="Alu repeat: matches 1..308 of consensus" repeat_region 33547..33834 /note="Alu repeat: matches 1..308 of consensus" repeat_region 33841..33990 /note="MER39 element fragment" repeat_region 34630..34911 /note="Alu repeat: matches 295..1 of consensus" repeat_region 34912..35219 /note="Alu repeat: matches 308..1 of consensus" repeat_region 35560..36080 /note="MER41 element fragment" repeat_region 36666..36953 /note="Alu repeat: matches 1..304 of consensus" repeat_region 38043..38147 /note="MLT1A element fragment" repeat_region 38116..38272 /note="MLT1B element fragment" repeat_region 38290..38461 /note="MLT1B element fragment" repeat_region 38528..38584 /note="MSTC element fragment" repeat_region 38568..39209 /note="MLT1B element fragment" repeat_region 39223..39524 /note="Alu repeat: matches 308..1 of consensus" repeat_region 39528..39632 /note="MLT1B element fragment" repeat_region 39787..39860 /partial /note="Alu repeat: matches 308..236 of consensus" repeat_region 39861..39972 /partial /note="Alu repeat: matches 264..153 of consensus" repeat_region 40677..40863 /note="MER20 element fragment" BASE COUNT 11133 a 9623 c 8363 g 12035 t ORIGIN 1 tgcctgtaat cccaactact tgggagggtg aagcaggaga atcacttgaa cctgggaggt 61 gcaggtttca gtgagccaag attgtgccat tacactccag cctgggcaac aagagtgaaa 121 ctccatttca acaaaagaaa aagataaagt taatgcaaca ctttgagaca gttgctgatg 181 cttatcgact aataagacag aacttgatgc tggactccag acagatttgc ttgagaaaat 241 gttcccttct ggcctcaaat gtgacctcca taatgttccc ctcaacatga ctgaaagcaa 301 ccaggacagg tccatccaga tattaatgga caattaagcc taacttcaaa atggttgatc 361 agcaatgctt tcagagaaag gtctttatca aaatggaaaa acgtaaaagt tgattataca 421 agttgggtca ctcgtgatag ccaactgaaa cagagttgaa aggctggggt ttgtggggcc 481 ataatgggtg aggaaacagg ttacataaca ttgttccaga ctggctgtgg tggctcatgc 541 ctgtaatccc agcactttgg gaagccaagg ctggcggatc acttgaggct aggagtttga 601 gaccagcctg gccaacatgg ggaaaccctg tctctactaa aaatgcaaaa attagctggg 661 catggatgca cacacgtgta atcccagcta cttgagaggc tgaggcagaa ttgcttgaac 721 ctgggaggca gaggttgcag tgagccaaaa ttgcaccact gcactccagc ctggggacag 781 agctagactc tctctcaaaa caacaacagc aacaaaagat tgttccaaga atgtaactct 841 cagctagcct ggctactgaa acaacttgct aaaaatctaa gactagtttt atgcaccacc 901 atcagcttgc ttgccagctc tccaaaacct gaatagtgcc aatgaacttt ctcaaggaac 961 aatacaccat ttctcttttt cttttataaa atcttaaagc ttgtgtttgt tcatcagaca 1021 tactgatgac cacactgtct gcatgtaggc accagattgc aagtcttact tcccaaataa 1081 aacgttttta ttcagagatt catcttcaca ttttcaacag agcaaaacaa aatcacacaa 1141 tgaatttata agcatctcaa ataagaaaag gtgagcccag acaaaatata gacagagaca 1201 gttttttgtt ttgttttgtt ttgtgttttt gagatggagt cttgctctgt tacccagact 1261 ggaatgcagt ggtgcaattt tggttcactg taacctctgc ctcccaggtt gaagcaattc 1321 tgcctcagcc tcccaaatag ctgggattac aggtgtgcac acacgcccag ctaatttttg 1381 tatttttagt agagacgggg tttcaccatg ttggccaggc tggtctcgaa cttctgccct 1441 tgtgatctcc ctgccttggc cccccaaagt gctgggatta caggcatgag ccaccgcact 1501 cagccagaga caaggttttt atctaacagg agtgtgtgtg tgtgtgtgtg tgtgtgtatt 1561 tgtgtgtgtg ttaacaagaa taaaagacaa tccagatatc aaactgctaa tcatgcaaac 1621 tacttcaaat gaattcacat ttaagctata gtcctgcatc aatcactcaa taatggggat 1681 acctactgag aaatgcatca ttaggtgatt tcctcattgt aagcacatga taaagtgtat 1741 ttacagaaac ctagggggtg tagcctatta cacacctagg ctgtatggta gagcctagca 1801 ctcctaggct acaaacctgt agagcaagtt actgcaagaa atccagttgg caattgtaac 1861 ccaatggtga gtatttgtgt gtaaacatgg aaaatataca gtaaaaaata tagtagaaaa 1921 catgaaaaaa agtaatacag ctatacaggt aggatacttg ccatgattag cacttgcggg 1981 ctggacgttg ctctgggtga gccagtgagt gagtagtgag tgaaggtgaa ggcccaggac 2041 cctactgtgt aatactgtat gcttcacaaa cacttaggct acactaaatt tatgtttatt 2101 tttttatttt gagacagagt ctcgctctgt cacccaggct ggattgcagt gatgccatct 2161 tgtctaactg taacctctgc ctcccaagtt caagtgattc tcctgcctca gtctcccaag 2221 tagctgggat tacaggcgtg agccaacatg cccagctaat ttttgtgctt ttagtagaga 2281 cggggtttca ccatgttggc caggctggtc ttcaactcct gacctcaagt tatctgcctg 2341 ccttggtctc ctaaagtgct gggattacag gcattagcca ctgggcccag tcattaaatg 2401 taattgttaa tcaacaacaa attaatctta gctcactgta actcttttta ctttagaagg 2461 tcgggcgtgg tggctcatat ctgtattccc agcactttgg gaggccaggc ggattgctta 2521 agctcaggaa tttaagatta gtctgggtaa catactgcaa tgctgtctct acagaaaata 2581 caaaaactag ccaggcatcg taggcagagc ctgtgaaccc agctgcttgg gaagctgaga 2641 tgagaggatt gcctgagccc agaaggtcaa ggttgaggtg agcctgcact ccagcctgtg 2701 tgacagagtg agaccctgtc tcactagaaa aaaaaaaaag cgagaaaaag aaaataatgc 2761 cattttgaag ttacaactgc aaacagtgat gcaatccgat accttgcttg ggtagctgat 2821 ggttttgcct caggtcatga gaaatctttt tttttttttt tttttgagat ggagtcttgc 2881 tctgtctccc aggctagagt gtaatggcat gatcttggct tagtgcaacc tctgcccagg 2941 ttcaagcaat tctcccgctt ctgtgtaatc cctcagtggc tggaattaca ggcgcatgcc 3001 accacacccg gctaatttgt gtatttttag agaccgggtt tcaccctgtt ggccaggctg 3061 gtcttgacct cctgacttca ggtgatctgc tagcctcggc ctctcaaagt gctgggatta 3121 catgtgtgag ccacggtgcc cagctgagaa atcttaattc agtcaaaaca aacacagagc 3181 tggcaccata cccatgactt gggaccaccc cagaaaaaag ttcatcaacc atagttgtga 3241 aaaagatccg aggagacata tttctaccag agctggtaaa agatcaatgg caaatgttat 3301 tcagcattcc ccaagatctt cagccaagag ataggacaat tgaatatgga ctggactggc 3361 aacttcccac agattagata gggtatttct tgtcagaaag taaggaatcc cctgggcagc 3421 tacagtgccc tctattgatc ctgttggagt ttgggccaaa acgttccaca taccaggaca 3481 ccagaataca gcccctttgg tcagctgttt aatacggttg tttgatgttg cttatagata 3541 acttcacccc tcaggcaaaa tgcttggtat gtactcccag cccataatcc tcaggctgcc 3601 aatattctaa taatgactga gataaaagtt ccactgtcat tttgtttgat gggaaagaac 3661 tgctctgcca agtacctaca aaacactgct atttcggccc ataatcttct gttcttcttc 3721 tctgacctat gaaaggaaaa atgaacctct tcctacagag ggctcaacat tatgctgaca 3781 acttacagaa gaggcctgct gggtttgcat tttgttaccc ttctcaacac ttctcgtctg 3841 ccttggtggg catcgtctat gcaggaaaac gactggcaac acttaaaaag cattcattaa 3901 agctcttcaa cattagaaca gcacattagc tggtatgaca ggggatgatg tgattaattg 3961 acctactgat aagactcatt tcagtaaatg ccacacaaga atgtataata ggctgggtgc 4021 tgtgggtcac acctgtaatc ccagcccttc gagaggtcaa ggcgagcgga tcacagggtg 4081 aagagattga gaccatcctg gccaacatgg tgaaactggg tctctactaa aaatacaaaa 4141 aattagctgg gcgtggtgac atgtgcctgt aatcccagtt actcgggagg ctgaggcaga 4201 agaatcattt gaactcatga ggcagaggtt gcagtaagct gagattgcgc cgctgcaccc 4261 cagcctggca acagagcgag actttgtctc aaaaaaaata aataaataaa taaataaata 4321 aacaataata aaaaaagcgt aatagctagc ctatcctacc ctatattcta aaattcaaaa 4381 gtaatggttt ttgttatgaa atctcgtaag tcttgccata aagagagttt tcttcaaatt 4441 cgggtgacac caactgcagg gcacttaact cagttggtgc ccctatgtta ataataaggg 4501 aaccacaccc atgactcata acccaccctt actctgggag tagcgtcaat acattataat 4561 cctgcaggcc atgcactcac ttgctacaga ccggtcctga tgatccagct taaactggtg 4621 tgctccaaat ggtagtgaat gattacatga gtggctctaa tctctgtcct tgggtaccac 4681 aaggttggct agaatgctgc actttaggtt ctccatgggt ccaaggatga tggacaaatt 4741 tagttggcag acctgctaat ctcctacatg tgattaatag ctagacaaga gctatttttc 4801 attggaatga tcatttagca ccatctttgg gcattaaaga cattacctat taatggcatg 4861 tagaagctct cactaagaat attcagcagg ctctcaatga aagtttccaa agtatttctt 4921 gaaagaactc tgaaatatac ctcatgtata cactgtttgg tgaaattgca cagcactgga 4981 tatcttaatt gctgcacaat agagaaccag tgcaacaggt tattataact gaatgctgtt 5041 tatatattct agataattct gacaacataa tggtagccct gcaagacatg aaaagccaga 5101 taactgcaat gtttgaccct accctccctt tatcccactg gctttctctt tggtttagtt 5161 ctgggggctt acaatggcaa aagcagttaa taattcttgt tattgtttac attgccattg 5221 tggccttttc ttgaaactag aacataaatt tattaataca atgtccggca ctggatgcct 5281 ccttctaaag tctcagggac catctcagaa gaggtcagca catgacattg gaacagctga 5341 attagaggta gagtgtgggg acaagaatta ctttagttta aaatctgaat ctggctaaca 5401 ttgagtctag aaataacctc caaaatgtct agtttttgta ttactcttta tgtagaaata 5461 tctattaatt gttaagtttc ctcccaaata acccctgttg ttgtagaaat tataggctgt 5521 catgcctgca gctaactaca cattccttcc tgagcatgta ttctttcttc cccaagatat 5581 atgccatgga tatgggagat tgcgatgcaa atatctacct gtctagtgga cacccaagac 5641 cactcggcag tgagccgaga tcacaccatt gcactccagc ctgggcaaaa agagcgaaac 5701 tctgcctcaa aaaaacatac aaacaaacaa aaacacatag ctttctgttt ctttgggtct 5761 tcatttttga aggttctcat gtcatgtaaa acatttacta aataaatatg agtgttattc 5821 tcgtttatct gtctttcatc atagtgtgtc acctattggc tttgttgaca ggtgagaaaa 5881 atatattact ttttctctct cctaggttgg gggcaactcc ctactaatgt aagattttga 5941 aagatgcaaa atgttcatct cacatcacaa tcgtaacaat gtacatgcag ccaaaaataa 6001 caaagagttt catgaatcac attttcactg ggacttgtat attaatacca ttgctttaga 6061 cttctcctac atcattactt tagggtaaaa acatttaaat ggagcatgaa aagaaattac 6121 tcaccgtaaa taagcctagt tatagtttta ttttaaccaa caattttttc gtgttcttca 6181 tctgtagctt ctttgtgacc tgaacaaggt ttgagtagaa ataacttctc ccaaaaagcc 6241 aaaaggaatt acaaagagga ttagcaaaag aagaagccct atttctttta aaagcttgca 6301 atattaagtg ccctcttgct ggtatagatt aaatacaaat ataataaagc aaatgtaagt 6361 gatattataa atgagaaatt agatataacc aaagctatga gagaggcagt aaaaactaat 6421 tacaagacgc ttataataac ttataacaat taaaattatt tgaaaatgga aagtaagtat 6481 tgtttttacc acaaaataca aactactaac catcatagat cagacaaaga cttaccactt 6541 tttaattacc agactcaaac aattgtttta gcctttttct aatcaaattt caaataacat 6601 ataattgcta ggtaatacaa aaatttgaaa agcccggaat tcctcagatt tttcctgttg 6661 tggttattgg atacttgctg caggtttctg cataaaggct attttacaaa caagcaacat 6721 tttttagcag taagatacta aagccatcgt cattcaaaga aggcaaaaac gggcacaaat 6781 tataataatc attgttatat tattgttttg aaggttaaga taaaaaggat aaatgcaatt 6841 ttaagtccaa aatcaggaaa gtatttttga aatgtatgac agctaaagga taatatctct 6901 taactataaa tcactaaagg aacaatattt ttcagaagaa aaatagaact attcatgatt 6961 tgaccaagcc tttggacact aaacaaaata tttcaaaaca ttaataaaaa gaaagcagtc 7021 tctaatctta atgcaacaga ctgagaaaaa ataatgtacg gaagggtaca acagaatgct 7081 tccatgaggt atctttaaaa actgtttcct aaaatggttc ttggtcaaac agctgctggc 7141 tcactagaca aagaggagtg tagttagcct tggacaaaaa ggaactgaag gaagaccctg 7201 gaaaagatca aaagtgacct tcattttatg gagagaaaca agctataaca tgtagtatct 7261 aagctgatta gaagaactaa aaagagaagc tcatacttgt gcatcagaag gtaaatgaaa 7321 gagtgaagtt acctctttgt tttaaggaag aaaggaaaat tgtggatgtc atctgttttc 7381 tgtttacata tttcaggcat ggatagccac aatgtgattt taagacggtt agttacaact 7441 gatttgaaaa aaaaaaaaaa tgcttcactc tatgagaaat ttcttcccaa gtatgaaacc 7501 ttgtttttac aggcaatttc ctatactttg aaaaaatcaa aataataaag taaaagaaaa 7561 ataattcagg tgaagttaga gaaaaaaaca ggcagcatta ttttaaagtt gtaaactatt 7621 ttgtttactt atagtttaat ttacatgtag tagatatgca tttgtaaggt tcttcggctc 7681 aggtaggaga tcattctatt tcccactgca ccctacttca tcctcccact ggcaaataat 7741 tagattatcc ctgggaaaaa aagatgccag taaaattgat catgtttaaa tgcatcagtt 7801 gctaggtgat ttatctgatt aagtcttgaa acagtagaac ctagcaatta aagtgagcat 7861 taacttctac ctaccaaatc agaagactat tctaactttt tgagaattag atgttgaaaa 7921 tatggcccat gaatttagca tggttaaaat aaaaaacatg caaacaaaac aaacccaaca 7981 tcttgaaagg acatttgact ctaaagtccc aaaaataatc acaagtctaa aaatcctaag 8041 tttagtgtta ctctattaca cctttttatt tgtaagtgtc ctttcacaaa agttttaaat 8101 tttgctcttg tgcattttat ttaccttttc ttttgttgtt tgtgtctttg gtgacctgcc 8161 aaccattaga cttcaaaaaa cagcctatag ccaagctgca ggataaatga acacataagt 8221 tgacttagaa tagtcaactc tgtctagtat acaatttatg ggggatggtt tatgaccaca 8281 tatatttcta ctttgatggg aatatcttga gataaaatta gagagaatga gtggagtaat 8341 attcacaaca tttttgctgc attcatccct gaatttgaag aaataccaaa gtacatcttg 8401 tgaggagaaa aaataaataa attcatataa aatgttgtgg gttttattct ttatgcagtg 8461 gtaaactgtg tttgcataca ccatagcaat taaattaggg ctacaaaggg tatttaacta 8521 atgagcataa aataccttaa tgtacctcaa atgcaattaa ttgcattgga ccaatctaag 8581 ttactattct tcagttttca tttttatttc attattcatt tcatttttat tctgatataa 8641 aaatgaacca ggatctgtgt gaaattattt gaatctaatg tctttgaaca tttttcttac 8701 cataccttaa gattaaaaaa acaaaaaaaa atcccttagt ttggcaactt ttgctgttgg 8761 ttaagcccgt ttggatttaa cattgacagg accagctaac ttcctaccag ttaacattgc 8821 ttgtgggcct gactgatgta aaaatattta acaactgttg agaaatagtc atcagtgaat 8881 cttaaaggtc gtactcatat aaaacaatat agcgctttaa ctttttattt actgaatttg 8941 agagaaagga atagtgttcc agaaactggg atcactcagg cgctcaggga agcatatggt 9001 tagctgatga tataaaggcc caatattgga agaactgtta atgtagtccc ctggagacag 9061 tgagtgaatt attagccctg gttgagagat taaaatgtgt tgatatgata ttatagaggc 9121 aaatttaatg ctctcggcct ccccctgcat ccttattttt gcatgtaggt ttattacttt 9181 acaggagaac caacactttt ttcctttttt tttttttaag tggttgagtt tgtttttctg 9241 aacccagagg catgtttaca catttcatta atttaagtac cagagaagta tactgagaac 9301 ctactacatg tcacacacta tcctaggagc tgtagacaga aaaataataa aatctcgtgt 9361 ctttatttaa attttgttga ctgccacccc caagtgagtc tcctcacaca ccccgtcctg 9421 taatatatgc atctgggagg tcttttttgc cttcttaaaa acatatagat ggttggacat 9481 atgtatataa gaatataaaa ttcaccactt tatcttttgt gaatgtgtgc tgtgaagaac 9541 tcctttactg gggtgatgga accagtggct acaaagtaag gagctggttt actgctgtaa 9601 agggttcgcg gctttgaatt tcaagctctg gttctgtgtc cttgggcacc tgcgcgtgaa 9661 tcgttgccgc gaggctgggc caagttaagg ccccacgcag tttggcttcc gggccaagga 9721 agccccacag ggtgccccac agggtgaagc cccatgccct acagggtgaa gcggctgaag 9781 ctggtagtgg ttccgaggaa gcggtcaaag tcccgctcca gaggttcctc ttcttggttg 9841 tcactcccgg aaccccgcca ggggtctggc ttccccatcg acacctcctc ctgttcagtc 9901 accatcacgt atcccacgag gccggacggc actgccacct cgtcgccccg tagactgcgg 9961 ccccgaaact acacttaaga gtccctcggg gccctggcgg atggcaggcg tgatgaagcg 10021 ccacaccgtg gtgggccgca gggtatcagg tgtagtgtgg cgggggcggc gtcctgcagt 10081 gtggctgggc gcaactggac gctgtacctc tccatagccg cctagtcgcc gctctccatc 10141 ctcgctcctg tgcgacgcca ctttatccct cttaaactgg acgattcagt agtaccgggg 10201 aacgaaggca acaagaccta accatagcaa caacattttg tttttgaaac ccatcttgca 10261 actgcctgac tgcaaaatgt acttgcagat tctgcagaca aaaattgctg ccacttatta 10321 aaagacttca tagtgaaaac tgacttttta cccaaaccac tgattcaagg acttttgcgt 10381 agaaactcat gaactgttaa gataaccctt cattaccaat taagataata ggttgtaagt 10441 aacctaatga gtaccaactt tctcttattt cctttaaaaa ctgtctctca aaacagccgg 10501 tcacgacaca atgtcagatg cttctggaat ctgtttctct aattacagtt ctaaaactgc 10561 caacaaactc cttatttgga cttcagtttc tctgactctt tggttcacca tgttgtgcag 10621 ccatcacctc tctctagttc cagagcatat tttatcaatc ctcaaaagaa accgtgcatc 10681 caccagcagt aactccccac aacctctttc atccagtcct tagcaaccat taatctggtt 10741 ttagtctcta ttcatttgcc tttcctggat atttcatata catgggatca ttcagtatct 10801 ggcctcttgt atctgacttt ttcacacagt ataacatttt caaggttcac ctatgtggtg 10861 ccttgtgtca ttttttgcta ttttcatgtc aggttttgaa atggtatgtt ttttctcatc 10921 ttgtaaggga tttcctgttg tagaaaatct gtttctgtgt atgactgcaa tttaaccatt 10981 ttatttgcta tccagataca tgttttaact tgagccaact atatttccac tttgtataat 11041 gaatactaaa ttttgtatct tagatactaa actttagtag ataactagga tacaaattac 11101 agttttcata tttttctttt catggaatat aattttttaa agtttctgat attaattcat 11161 tataacattt actggacaaa ccaaccacat ttggttctat gaactctttt catattttac 11221 gggattttta ttttgttgtc ttaggaattg tggcatcatt cagttttgct gagagtttat 11281 ttttattaat aggttctgtg tactatgcaa cctgtagttg ttactcctgt tttgcaaata 11341 tggcattaaa acattggtga gtggggggac tttcctccca ttttatttca gcaaagaaat 11401 cttgtcattt ctcttcagac tttaagattt attgaggaga gttattcact tgtttagtct 11461 ggtaaactgt gacctttaaa ttttgaagga ggttgaacta aaaggtgaaa tatgataaat 11521 tgatggctct atttgttgtg aatggaaatt taatttagac ttgtaaaaga ttgtttttta 11581 aattgctaat tgtgtgacag tgaagagaat gacttcaaaa tatcccacta tacaatctgc 11641 aaagaaaaga attgtgtccc ctttttagtg tagcttaaca cttcactgaa actgttttga 11701 gttcttaggt catatttttt tttctctaaa cgaaacaatt acttttctaa aagtcaaatg 11761 ttagccatcc tagaagttgg gcataaaata cttgtaagta tatgctaata ttctgatact 11821 taatgcctgt gaaaaatgtg tatagaattt tcaattttta aatagaagtg aagaaaaagc 11881 gataataatt actataaatt caatatgcag ttatgtatgt atgtgtgtgg ttaagacaat 11941 taggttctca ttaagctttg tttttttaaa gataacatac acatatattg ataatgataa 12001 acaattcata tagctttttg tgtcctctcg ttttgtgaca taaaaggtca atgaaaaaat 12061 tggcgattaa gtcaaattcg catttttcag gacagcagta gagcagtcag ggaggcagat 12121 cagcagggca agtagtcaac gttactgaat taccatgttt tgcttgagaa tgaatacatt 12181 gtcagggtac tagggggtag gctggttggg cggggttgag ggggtgttga gggcggagaa 12241 atgcaagttt cattacaaaa gttaacgtaa caaagaatct ggtagaagtg agttttggat 12301 agtaaaataa gtttcgaact ctggcacctt tcaattttgt cgcactctcc ttgtttttga 12361 caatgcaatc atatgcttct gctatgttaa gcgtattcaa cagcgatgat tacagtccag 12421 ctgtgcaaga gaatattccc gctctccgga gaagctcttc cttcctttgc actgaaagct 12481 gtaactctaa gtatcagtgt gaaacgggag aaaacagtaa aggcaacgtc caggatagag 12541 tgaagcgacc catgaacgca ttcatcgtgt ggtctcgcga tcagaggcgc aagatggctc 12601 tagagaatcc cagaatgcga aactcagaga tcagcaagca gctgggatac cagtggaaaa 12661 tgcttactga agccgaaaaa tggccattct tccaggaggc acagaaatta caggccatgc 12721 acagagagaa atacccgaat tataagtatc gacctcgtcg gaaggcgaag atgctgccga 12781 agaattgcag tttgcttccc gcagatcccg cttcggtact ctgcagcgaa gtgcaactgg 12841 acaacaggtt gtacagggat gactgtacga aagccacaca ctcaagaatg gagcaccagc 12901 taggccactt accgcccatc aacgcagcca gctcaccgca gcaacgggac cgctacagcc 12961 actggacaaa gctgtaggac aatcgggtaa cattggctac aaagacctac ctagatgctc 13021 ctttttacga taacttacag ccctcacttt cttatgttta gtttcaatat tgttttcttt 13081 tctctggcta ataaaggcct tattcatttc agttttactg gtatttcatt ttaaacttaa 13141 tttcaagaca agttgtgtca acacgattaa catgcaaaga aataagacat ccagaagtga 13201 gcctgcctat gtttgtggcc gtcagagtac taacttgata caaacggaca ctgtggctta 13261 ctttaaatgc tctaatgaga aacacacttg aaaattgtac caaaaaaaat cacacttcta 13321 tatgcagcgt gttaagcagt cctctctaga ccgtgtattc attggtcttt cagctacttt 13381 gtacgtgtct ataaattgca ggtaactaag gaatggatat gtaagcagga tcaaacttgt 13441 ttctttctct ccccttcacg ctgtggaaaa aaccagtttt acctccactt gcaattcagt 13501 tcctttactc catataaatc caaacggttg acatttcctt tcaactagtt ataaaatgcc 13561 tctggtaaaa caaaatattt aattccttgt catttttgta tctctatgaa acttatcatt 13621 ttgcctttct tctgaaaact atcttttaaa atggcaatct acttgtttcc atggcctatt 13681 aacttttaag cctgtggaat gaaaaataca gaattttcat tctcaacagg aatgttgaga 13741 acgccgacta cccagattat ggagatggaa gggcgggggc tggtgaccca actggtttaa 13801 tttgatggga tttaaataaa aatgtaatgc caagacttca taaaatttgc acataagcta 13861 aagcaggaaa ctagaagttt acaaataact gtaaaccaca gtttaaagta taaattacaa 13921 ataaattttt ctacaataaa aataaatcaa aagtcaaaat taatggtatg tatcaaagtc 13981 agctttccaa tatttcaaaa cttttcagag acattacctt aaataataaa actcaaacac 14041 taaaagagtg tatattagag atggatgtca tgtttctcaa agttcagggt ttaaaaaaag 14101 ctttattcca gagacaatat tacttaatct gagctttatg ccaaatgaga actgtatatt 14161 gcccattacc ttttaaacac tgactcaaaa ctactttgta atatttaaaa ccactctttt 14221 cactctccca aaaataacac ccattctgac cttcattacc atacattaat tttccttttc 14281 ttgaacctct gacacagatt ctctgtttga ccaaacttta gtcaggtttc tgaaccttct 14341 cctagactcg cctgtgcact tccttgccaa actcagtttt ggcaaagaac tctcctaagc 14401 caggttagcc agactgtacc cccaccgcat atctgatcac actcaacact tgatcgatca 14461 ccctcaatat ctgatcaggt ttctcatcct ccaccaccct ctaggtgatg tctgatcagc 14521 ctagcctgtc ttcagcaaga atcctcctta tctatgatgt cttctcttag tgattgtcca 14581 tccattgacc ctactccctg ctccttggct ataaatcccc acctacccag ctgtgtatgt 14641 ggagctaagg ccaatctctc ccccagtgca agacctcatt gctgtgctgt tggtcgtgaa 14701 tagtcttcct tatcaagttt gaacaagtat cactgaatac catttttcct taacaatgtg 14761 aataaatata aatacaaaat attatgtgga gcttcttttg cttaatacat tttggtggtt 14821 ttcagattat taaaaattat gacatcatgc atgtatacat tttggaaaga ctaaatcaag 14881 ctaatgtatc tactacctca tatacttccc tttccttttg cactgggaac atttaaattg 14941 tattctctta gcaattttca agcatacaat acattcttat tacctatatt caacatgctt 15001 tacaatagat ctgaactaat tgcttctatc tagatgaaat attataattt ttgaataaca 15061 atactcatct cttcacctga ccctactaga gggtagccac aattctactc ccaaaaatat 15121 gaaagacttc acatttgaat accctcctgg cccagggaca gtgttattct tctccatacc 15181 attctaattt tagtgtatat gctgcccaaa ctagaatttt taagaaaaaa ttgttgtgcg 15241 tatttcattc tatttttgtt ggacaaatga atggacactt actggtttca attattacaa 15301 acagtgcttt tttatacatt cttatacatg tacaatggag aacatttata agaatttctg 15361 tgaagcacac tcccaggaga ggaattgata ctttatgtat ggtttactta ggaattactg 15421 aaatgaagca tttttgtatt tttgtgatgt ttaaacaaac atcaataaac aaacaaattc 15481 taactataag tatcaagtgt gctttggctg ccaatggatg tttgctatat tttaaaatgc 15541 caacacttaa aatacattaa gagggagctt ggtactgaat aaagaacaca ggctctggaa 15601 ggtaaaccct tcagctttgg ctcacttgag tgccacttac aggcttgagc ttgaacaatt 15661 tacttcgctg tctgagtctc agttttttca attctgtaat aattttgtta gttgaaatgc 15721 tatttactaa taaaaccaac ccatgaagag atgtgaaggc tacatgaatt aaggtacaca 15781 agcattaaat aagtgagtgt atataagaaa tacttagaac aggctgggag cagtggctca 15841 cgcctgtaat cccagcactt tgggaggcca aggtgggtgg atcacgaggt caggagattg 15901 agatcatcct agctaacatg gtgaaacccc gtctctacta aaaatacaaa aaattagcca 15961 ggcttggtgg cgggcgcctg tagtcccagc tacttgggag gctgaggcag gagagggctt 16021 gaatccagta gatggagctt gcagtgagcc aagatggcgc cactgcactc cagcctgggc 16081 gacagagcaa gactccatct caacaaaaaa aaaaaaaaaa aaagaaaaga aaagaaaaaa 16141 agaaatactt agaacagaca tcagcattag tgaagaatta actaaatatt tgctattatt 16201 aacctgaaag taattgctcc atctaaaaaa tatcttagtg aagagtaatg tagatattca 16261 tgattgatta gacattattt cattaggcat tgaagtgggt aatgtttttc ccaatgtcac 16321 gattttactg atgtgaaata cattaatttt ccatatttgt tccaaatctg ttccacaaag 16381 atttgtttct acaaggtgta gaaacaaaag gtagttatca gagttctagt atttcaagaa 16441 atttaattaa tattttattt catattggac tatgtaataa tgtcaagagt aaaaatcttc 16501 atgagatatt gtgcacagat cttttattta tagatgcatg aactttggca aatgcatatg 16561 cctgagtaac tatcacccca actaggttac aagctattta catcattcct tcaaccctta 16621 aaatttcctg tgcccttttg caattcctct gtcaattgca tgatttctat catgcaattg 16681 agagatcttt tttttttttt tttgagacag agtcttgctc tgtcactcag gatggagtac 16741 agtggcccaa tctaggctca ccacaagctc cgcctcccgg ttcacgccat tctcctgcct 16801 cagcctccct agtagctggg actacaggcg cccgccacca cgcctggata attttttgta 16861 tttttagtac agccagggtt tcaccgtgtt agccaggatg gtctcgatct cctgacctcg 16921 tgatctgccc acctcgacct ctcaaagtgc tgggatgaca ggcgtgagcc accacgcccg 16981 gctgagttca aggctttata tcaacggagc tgtattctga agggcatctg gcttctttca 17041 ttcaggttga aatttgcatg aatttaccac agtttatgta cactgttggt ggtgaacact 17101 gggctatttt gagcttgagg cttttattaa taaagctgct atcaacagtt gagctcaagt 17161 ctctaagtgg acatctgttt ttaatgtacc tgggccaaca tctaggagtg aaattttcag 17221 ctcagttaat tttgtgtgta cctttccaag aattttccag ggtttttgtg ctatttcaca 17281 ctgccacagc aatgtctgag ctttccgttg ctatacatgc tcaccaaaat ttggaactgt 17341 tagtgtctta aattttaacc acagaatgaa ggtatgtagt taaaataatt aaagtggtta 17401 taaaaatata cttgattttc tgctagaaca agttacccct cattactctt cttcctcaga 17461 atttctaagt tataatcagt tttcttaaat ttgaatattg ctgatgttcc ttaaaatctc 17521 tatgctcttt agaaatgacc agaaagttct gctttttaga cagagatcac ataaagttgg 17581 tagattacag taaggaaatt aacagaacta caaatacgtt tagcaaaaac aacaacaaaa 17641 aaagttctgg attgctgttt gttagtttaa taatgttctt catatctgtc cagaaaattt 17701 ttgtaagtct aaaatctatg taatttgagt tttccttgtt gtcaggtaag aaagtaatcc 17761 ttcctcctac ttttcattag agtactacct ttagaaaact agtattttcc cctctgtttt 17821 ttgaggtata tgtaaatctt tttaagaact aactaagcca gatggaagga tggcacccca 17881 acaggtcttt cttaatggcc ttgagccttc tctttgaagt gtaaggagtg ttcccaagtc 17941 cccttgtctt tcctgagcaa attgagggtc aggctgctat ttttttgtgg cccaataatg 18001 agatgcagat gaactgagaa ggaagagagt ttttatttct acaagtggtt acagggagaa 18061 ggcatgtaga taatattacc agacccactc aaaatgataa agttttatag agcttatata 18121 ccttctgagc tatacgtcta tgtgtaagtg cccatttatg taacgacata aatgattaac 18181 tttttttttt tttttttttt tttgaggcgg agtctcgctc tgtcgcccag gctggagtgc 18241 agtggcgcaa tctcagctca ctgcaagcgc cgcctcccgg gttcacgcca ctctcctgcc 18301 tcagcctccg gcgtagctgg cactataggc gcccgccact acgcccggct gatttttttg 18361 tatttttagt agagacgggg tttcgcggtg ttagccaaga tggtctcgat ctcctgacct 18421 cgtgatccgc ccatcttggc ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc 18481 cggccttaac ttcttttaat ctataactaa ggtctgagtc ctgagcatca tcctcaggag 18541 cctcagtaaa ttgacttaat ctaatgggtc caggtgccgg ggtgattacc ctcatgttgt 18601 ctcctgctaa atccaggagg tttgcagagt tccttcaaag ccccaataaa cttgtttgtg 18661 gaggcctggg gagtttctgc agacccccag taaaacttgt ttaatcctaa atgggtcctg 18721 ttaagaattc attcattatt ttgtcatgct ttaaggccct ggaaagtcct gggaaaaact 18781 ctcggtgggc ttttgttaca ttccagcctt tgtataacag cactggcttt taatatttaa 18841 ccactcagtc aatactgaaa caggtgttgt gggggcctgt gttagtgaga cctggcctgc 18901 gacacttcca ctctcagttt ccaagtgctc tcactcatac actgtgaagg gaacataaat 18961 ctcgggaccc ccaagtcact aagctaaagg gaaaagtcca gctgggaact gcttagggcc 19021 aacctgcctc ccattctatt caaagtcccc cttctgctcc ctgaggcaga ttcgtatctg 19081 attgcctcct ttggaaaggc taatcagaaa ctcaaaagaa ggcaactgtt ttgtctctca 19141 cctgtttgtg acctgaaaac cctctccctg cttccagtct tcctgccttt gcttcaagtg 19201 ctccaccttt ccagacggaa ccaatatact tctttttttt tttttgagac agagtcttgg 19261 tctgtcgccc aagctggagt gcagtggcat gatctccgct ccctgcaacc tccgcctcct 19321 gggttcaagt gattctcctg cctcagtctc acagcctcag gcacacacca ccatgcccag 19381 ctaacttttg tatttttagt agaaacaggg tttcaccatg ttggccaggc tggtctcaaa 19441 ctcctgacct tgtgatccac ccacctcggc ctcccaaagt gccgggatga caggtgtgag 19501 ccacctcgcc tggcctaatt gacttttttt tttttttaga aaaaaaaaaa aaaagcccaa 19561 ttgtttaaat cggtttaaat ttaaatagtg gcttaaattg gctttcaatg aatatgtgtg 19621 accctgagca tttctaacac acatcagctg gtcaattgca taattccctc ttctttgaat 19681 ttttcttgtc agcgaatgga atgatcatat ccctcttctt ttagtcgcct catagtctct 19741 ctcctccatt ttctttttct tttcttttct tttctttttt tcaccatccg tctgttttgg 19801 taggagctat ctatacacag tttgtgctct ataaccagaa tttctgcatc tatagatcca 19861 accagtcttg tattgaaaat atttcaaaaa ataaaatata gcaatccaat aaaaagtaat 19921 acaaataaaa aataatatag tataacaact atttgcatag catttacagc gtatcaggag 19981 ttatgagtaa cctagagata atttaaagta taccagaggc tggacacggt ggctcatccc 20041 tgtaatccca gcactttggg aggccaaggt gggcagatta tgtgaggtca ggagttcgaa 20101 accagcctgg acaacatggt taaaacccgt ctctactaaa aatacaaaaa ttagctggga 20161 gtggtggcgc atacctgtaa tcccagctac ttgggaggtt gagtcaggag aatctcttga 20221 acccgggagg tggaggttac agtgagccaa gatcacgcca ctgcactcca gcctgggcaa 20281 cagagtgaga ttctgtctcc aaataaataa ataaaaatat acaggaacat atacttgcat 20341 tatatgcaag tactgcacgt tttatatcag agacttcagc agctgtggat tttggtgtct 20401 gcgggtaaga aggcgtctta aaactaatcc cccatgaatt ctgaaggatg actgcatttt 20461 gcatattaaa ctttccctgg tggtatgagc agtaaggagc ctcttctagt tgacagctca 20521 tcttttgact tttttcagag cctgaagttg gaacattatt attattatta ttatttaaat 20581 gtagacatag gctaacccat ttttaaaggt tatgcttttt tttttttttt tatactctca 20641 ctctgtagcc tcagctggag tgcaatggca ggatctcggc tcactgcaac ctccacctct 20701 cgagctcaag tgattcttat gcctcagcct cctgagtagc tgggactatg ggcgtggtta 20761 tgctttttgt attgtatcta agaaaaacct cctaactctg aggttataag acatttccct 20821 ctatttttca aaaaaaaaag ttttctttga tattcaggtt tgtaatccac ctgtagttca 20881 tttttaagta ttacatgagg ctgggtgtgg tggttcacac ctgtaatccc agcactttgg 20941 gacaccgagg tggacagatc acctgaggtc aggagttcga gaccagcctg gccaacttgg 21001 tgaaaccccg tctctactaa aaaaatgcaa aaattagcca ggcgtggtgg cacatgcctg 21061 tagtcccagc tgaggcacga gaattacttg aacccgggag gagaagttta tggggaggtg 21121 agatcacacc actgcactgc agcctgggtg acagagtgag actccatctc aaaaaaaaaa 21181 aaaaaagtat cgcatgaggt tgagatttga tcacgtatca tccacaagaa cagccaattg 21241 tctcagtcct gcctatgtgg aatttggttt cttctctcat tgttttatag ctgcgacctc 21301 tctcatatgc caagacacaa ttcagtctgt ttcaaaccct tctgaaccta tgcattgctc 21361 agtttctctg tctctgagct acggttttca tttctttctt cttttttttt cttttctttt 21421 gagacagggt cttgctctgt catgcaggct ggagtgcagt ggcgtgatca tggctcactg 21481 cagcctcagc ctgggcacag gtgatcctcc cacctcagcc tcctgagtaa ctgggactag 21541 aggtgcatgc caccatgtcc actaattttt tttaattttt ttgtaaagat ggggtctcac 21601 tatgttgtcc aggctagtct caaactcctg ggctcaagaa atcagccagt ctcagcctcc 21661 caaagttctg ggattacagg cgtgagccac cacgcccagc tccacatctt tattcttaat 21721 aacatttcca tctacagatc ctggtatttc tggggaaatg ctttttaaaa tctcacactt 21781 agatctttaa atttccactt ttaacttaac ctgaggtgca ggctgggtca agattccatt 21841 atcattcagg gaatcattct tttagtggga agaacctcta tttgggcctc aaagttgttt 21901 gcacctcgaa ggttattttg tctgattttg ctagcccaac ttggctttct ttctattggt 21961 tatttgcttt gcgcattttt tttccagcta ttttcacctt ctctttggca ttttgttgca 22021 aatgcttctc tcgtaaatag catgaaacat tttatttgta tttttagtag agatggggtt 22081 tcaccgtgtt agccaggatg gtctcgatct cctgacctcg tgatccaccc gcctcggctt 22141 cccacagtgc tgggattaca ggcgtgagcc accgcgccca gccgaaacat tttaaattgc 22201 atcatctcat ctccgtcttt ttacagattt attaaatcac tgaacatcta acataactaa 22261 ggattcacta ttcctactat tttattttac ttttcttctt ttttaattcc ttccttctta 22321 acttccattt agtattcagg tttttaattt attatttttt tttatttttg agacaaagcc 22381 tggctctgcc acccaggcgg aagtgcagtg gcgcaatctc agctcactgc aacctccacc 22441 tctggggctc aagtgatcct cccacctcag cctcctgagt agctgggact acaggcattc 22501 accacaacac ttggctaatt ttggtattta attttactct tcattttttt tgtatttaat 22561 tttattcttc attcttcatt ttcatccact agttcaggag ttatatgttc aatttttatt 22621 tttagcagga acacttaacc ttaaaatgtc aaaatccagc aatatctaac tacttttact 22681 cagttcttcc tttcttatcc tctctgtgaa ggtagcctca tcttcagaag ttaagtttca 22741 ctttgttgtt aacaccagtt attattattt ttttaaacag ctaacaatca cttagtttta 22801 ccggcttgtt tgcaatgtct ttggctggca ttgctattcg catcctgctg tgctattcag 22861 ggtgtgaatt tcttcttatg gcagagcatc atttggaaga tttttcagca aaatttgcag 22921 agcttcttgt tggaaatctt tatgctatca tcactctcca acagcaattt agctgcattt 22981 tcaattcttg cttgatagtg aacatttcct gaacaaccta taaacacccg tccattgact 23041 tcctgacatc tctcatggca gtcaaggagt ctgctggaag tctgggagtt atttcttttt 23101 gttgttgttg ttgagacgga gtctcgctct gtcgcccagg ctggagtgca gtgggagatc 23161 tcggctcact gcaagctccg cctcccaggt tcatgccatt ctcctgcctc agcctcccaa 23221 gtagctgaga ctacaggcgc ccgccaccat gcctggcaaa ttttttttgt attttttagt 23281 agagacgggg tttcacctgt aatgcccaac ctggttttta ctaacctgtt tttagactct 23341 cccttttcct ctactcacct agccttcttt ccacctgaat ggactctccc ttagctaaga 23401 gagcccgcca gactccacct tggctctttc actggcagcc ccttcctcaa ggactgaact 23461 cgtgcaggct gactcccagc acatccaaga atgcagttaa ctgataagat actgtggcga 23521 gctacatccg cagttcccgg gaattcgtct gattagtaac gcccaaagcc tcgcgtctat 23581 caccttgtaa tagtcttaaa gcccctgcac ctggaactgt ttactttcct gcaaccattt 23641 atccttttaa ctttttgcct actttacttc tgtaaaatgg ttttcactag acccccagcc 23701 ctccccttcc taaaccaaga tctaaaagtt aatcaagccc cttcctcagg accgagagaa 23761 tattaagcat tagccgtctc tcggtcgctg gcttataaag gactcttaat ttgtctcaaa 23821 gcgtttttct aactcgctca ggtacaacat accgtgttag ccaggatggt ctcgatctcc 23881 tgaccttgtg atccacccgc cttggcctcc caaagtgctg ggattacagg cgtgagccac 23941 tgcgcccggc cccgggaatt atttcttttt cttttctttt cttttttttt tttttttgag 24001 acggaatctc actctgtcac ccaggttgga gtgcagtggc atgatctcgg ctcactgcaa 24061 cctccacctc ccgggttcaa gtgattctct tgcctcagcc tcccgagtag ctgggattac 24121 agacatgtgc catcatgccc agctaatttt tttatattta gtagagaccg ggtttcacca 24181 tgttggccag gttggtctcg aacccctgac ctcagatgat ctgcccacct cggcctttca 24241 aagcgctggg attacaggtg tgagccaccg cccccagcca aaggttacct ttttaattga 24301 aaataaataa ataaattgaa gtcaaattta catataatta accatttaaa agtgaataat 24361 tcaggggcat ttagaatatt cacagtgttg tgcagccatc acctctgtgt acttccagaa 24421 tgctttcgtt cccccaaaaa gaaacctgga atttatcaag cagtcactct ctctttcccc 24481 tccagcccct ggcagtcacg acattgcttc tggttctagg gattcaccta ttcttgaaat 24541 ttcatcttca taaaaatata cattatgtga tcatttgtgt ctggcttctt tctctttgca 24601 aaatgttttt gaggctcaac cactttgtag caattttcag tgcttcattc ctcttcaagg 24661 ctaaataata ttccattgtg tgaatggacc acattttgtt tatccattcc ttccactgat 24721 taacatgtga gttatttcta tcttttagct ataatgaata gtgctgctct gaacatttgt 24781 gtacacattt ttcttggaac ccgtgttttc aattatttgg agtacatacc taggagtgga 24841 attgctggtt catagagtaa tcctgtattt aacttttgga ggatccacca aagcatttgt 24901 tcacagtggc tgtaccattt tagactccca ccagcactgt acaaacattc cagtttcatc 24961 acatccttgc caacacttgt tattttccat tttagatttt agccatctca gtggggacga 25021 agtggtgctt cactgtggtt gtgatttgca tttccctaat caccacattt tgctttcaat 25081 atctgctatc tggtacagtt gttgcatctc tttcatatac cccacagttc tgtaaaacat 25141 cctcaaccat cccctccgta agcatcagtt caaggacgtt gcctcagttt ttcccagctg 25201 aagagacact cacctgagtt tggcttcttg gtgggttctg tgaacaaaga gagaaaataa 25261 atgcaatatg cattgaagat tcactggaag catgaagaaa ggccatgaac gccctgatga 25321 aaggtgaaaa gggagtgaca aagccataca aagtaaatga tatagacaca aattccatgt 25381 tgaattgaga aaaaaggcaa gaaaacaccc ttaaaatcca ttcaaaaaca atccaccctt 25441 gaggctgtga ggatgcccac aggatggcca aagtagctga gaacatccaa gacaagtaag 25501 tgagccacct gcccgtagag ccaccacagg cacagggccg gacggtggag ggtgaggcag 25561 ggggaggcaa aagaaaagaa ggagaaagac ttgcagcttc cagaatgatg gcttatctca 25621 ctttctttca ttcattcatt tattcattca gcaaatatgt attgcatgga tgtggatatc 25681 aggtcaggag tgctcagggt gcagcctgtg actgccctgc ttagtgaacc tgggtgtgca 25741 gtgtggatgg gcagacgagg gcaggaggag gtttgaggaa gatggctcaa aggaggtgga 25801 gtcggccatg gctgggaaaa ttggagcttc agggctgtgc aatggccaga cttcttccaa 25861 acagaggaaa caccgtattt gaaagcatgg tttacataac agcctagcat tgccttggta 25921 ggaggcggct ggaacacggg tttgtgtgtg gtgcggacat tcactgggga tgcagagact 25981 gcaagaagga gagcttgaac ctccaaacgg ggcatatggt ttcgcatgtc ctgtatttta 26041 atagaaggtt tctcatcacc ttttctcctt cctgccacct ccctctccct tcctgccctc 26101 ccttcttcct tccttccttc tctccttcct tccttctctc cttccttcct tctctccttc 26161 cttccctctc tccttccttc cctctctcct tccttccctc tctccttcct tccctctctc 26221 cttccttcct tctctccttc cttccttctc tccttccttc cttctctcct tccttccttc 26281 tctccttcct tccttctctc cttccttcct tctctccttc cttccttctc tccttccttc 26341 cttctctcct tccttccttc tctccttcct tccctctctc cttccttccc tctctccttc 26401 cttccctctc tccttccttc cctctctcct tccttccctc tctccttcct tccctctctc 26461 cttccttccc tctctccttc cttccttctc tccttccttc cttccttccc tctctccttt 26521 tcccccttct tctttccttc ctccccaccc tcctttcttc cttcctcccc tccaaccttt 26581 ctcctctccc tctctccttc cttccctccg tccttccctt attctttttc tttcttcctt 26641 tcttctccta tttttctttc tttccttccc ccctccattc ttcgcttcct ccttcattcc 26701 tctccttctt ccttcctccc ctcagtcttt tcttatctcc tctcctttct tcttccttcc 26761 ttccctccat cgttccctct cttcttcctt ttcttccctc cttgtttcca tcctttttcc 26821 tccctccctc cctgttttcc tctgtccttt cattgtttcc ttcttttctt tcttctcttc 26881 cttttccctt ccttcctcct cccaccctcc ctccctctct cccatccttc ctttctctgt 26941 cttcttttta aaatgtaaaa ttataacgat tttttaaaat agtgatacac tcacagagtt 27001 tcttaaagcc caggcagtat agacagtgaa aagtgaattc cccacagtcg ccatgcaggt 27061 gactgtggtt cctctgcact ggggcttcct tcccatgagg aagggctgtg gatgaacagg 27121 cacatgcctg cccattcatg gggggctctt ggtagcaaaa cgatgcattc aaagatgtgt 27181 ttgcaggcac tggcagcctt ctaggtgagc aatgaaaagg tagtagcagg tgagtggaaa 27241 ataagtttat ttatgtgatt tttaaaaaat taatgtgaaa ctcacatata aaatttgaca 27301 ttttaaagtg tgcaattcaa tggtgtttca tatattcaca atattgtaca atcaccacct 27361 gtatctagtt ccagaacagt ttcatccccc caaaaggaga ccccatagcc acgagcagtc 27421 cctccctttt gtcccctctt ccagcccaac aatcgctaat ccactttctg tctctctgaa 27481 tttgcctatt ctggacattt catattaatg ggatcctaca ccctgtggcc ttttgtgcct 27541 ggcttgtttc actgagcaca gtttcaaggc tcatgtagca cgcatcagaa cccccttcct 27601 ttttacagcc aaatcatatt ctgttgcctg gagggaccac attctgttca ttcattcatc 27661 ccctgatgga cagttgggtt gtttccacct ttggcttatt gtgaatattg ctgctgcaaa 27721 cgtgggtaca caaataactg tttgagtccc tgctcccaat tctcttgggt atctgtctag 27781 aagtggaact cctgggtcac atgtcagtga tctttcagat caataacatg tcttgttctg 27841 tatctgagcc taccagccag cccctccttc actattgcac agggttctct tgcagatatg 27901 cctcattctt tatccagacc aaacgctggt gcaacacact cagtctgaat tttggagata 27961 atcccccaat gcttccaaag tggttgcaac cacacacctg cctctaaccc ccacccctca 28021 atctgccttg acagggtagg attcaacctt caccttttct aacttcatgg gagaaatggc 28081 atctccatgt tggggtgagt cattacattt tcctgttgca ctgacaacaa ttttgtatct 28141 gccttgtagg ataattatta gcaaaaacat cactgtgtgc agaaaactgg gagagagatg 28201 agttcctatc atcttgttgt tggcttttta ggatcagatc tggttatcat tctctcttcc 28261 tccaacaaac accattatgt aactgcactg gtcaaccagt tgttcccaat gctaaagccc 28321 aatggattct tcctctgtgt acacaaaagt ttatttttct ttcaaaataa atgtttggtt 28381 ccattgtgcc tgcatcagaa accaaatctt tccctttctc ccactcaccg cggtctgaat 28441 acacctgaat ctcaattaga ctatctgcct caagcaggac tttctgcagg gcccaggtca 28501 catattttca gcattctggt cagccagaaa ggttctagct gaacttggac tgatgaaaac 28561 acagaatggg aagattccag gtcctcagtg ccctggagaa gaaaaactca cagccccgtc 28621 agtgaaggta accagagaag ttgaggggag aacggaatga agtgtgctct gggaggggag 28681 attggaggcc atggattggg atcggggtat gggacaatga gggatatgaa cagaagtgta 28741 agaaaagcag aaactgggga gattggaggc catggattgg gatcggggtg tgggacaatg 28801 agggatatga actgaagtgt aagaaaagca gaaactgggc caggcatgtt gaggcacgtc 28861 tgtaatccca gtgttttgag agaccaaggt agaaggattt cttgagccca ggggttcaag 28921 accagcctgg gcaacatagt gagaatccgt ttctacaaaa aagaaaaaag aaaaaaatta 28981 gcttggcatg ggggcacaca cctgttgttc cagctacttg ggaggctgag gtgggaggat 29041 cacttgagcc aggaagtcga ggctatgtgg taggctatga ttgcaccact gcactccagt 29101 ctgggcatca gaacgagact cccatctcaa caaaataaag taaaataaaa taataaaata 29161 ctagcatatt cctaaaccac aggtttagaa gcacagacct accaagaaat gagtctgaga 29221 cttttagaga gtgtgtaaag gtttgtctca aggtgtctgc aatctcctgc aaagaacaga 29281 gatctcaaga ccaagaaatt cccaacactg gcccaatcca accccaaagt aactatacag 29341 ctgatgctgt tagatccaag ctgaaaaggg aaggctcccc ttgaaatatc ggcacttacc 29401 agggtcatca agggcatctg ccaaatcaaa gtctctttga cctattggga gcaaacaggg 29461 cataagtcac cccatgatgg aaaggccagg atcccccttg ttcctcacct catgctgctc 29521 ccctcctcct gcaagcctag ggtcgcccag agcactaaaa ctaacctgac catgagctac 29581 gtgggagatg caaccaccac ctccacctag tcccagagca tactcatcac ccaaaagaag 29641 acccctgtcc ccattcagca gttagttccc ctcccgcctc cccagcccct gacaaccact 29701 atccactttc tgtctccatg aattgtctgt tctgcaaatt tcatataaat agaattctac 29761 aatatgtggt attttgtgcc tggcttcttt cattgagcat gatgctcttg agttgcgttc 29821 acgttgcaac ctgtgtcaga gcttcattcc tgctcatggc caggaaactt tgaaggaagc 29881 ctctctccaa gcacatagac ttctctcctg gactgtccca ctcacagctt aggacaggca 29941 tatccccgcc ccccaacgcc ccccccaccc ctccacacac gcacagagtc taggttgaga 30001 ggggctgccc tagaagaagg tgaagcacaa caatgtgagg gtcctggatg cctgagagac 30061 tatatggagc ataggacacc tttgccacct cacctggctg caacacaaac aagagataaa 30121 cttttaattc tttgggggct gtttgttata gcagctagcc tacactgaca aatacagggt 30181 caggaagcaa cctgtgtgga caagctgagg ttcctacctg atccaaaagc agttaatcta 30241 atgaatgtgt ctgcagcaga gagaagagaa cattcacatc tttgggacac atcattaaaa 30301 acacactgca gaatttgtga gggtgaaggg tccctcttgc tgacatatat gcacaccttg 30361 atcttgttcc tcacttgccc ctgccagtac tagccttgta gagaaaacac caccagagaa 30421 tttatcagga agtagaggat gcagccagga gcacattgag tcagcactta ctacgcattg 30481 agttctgttg cagtaatgcc aggcacccaa gagtctcatc tgcccaccca aggatgttgg 30541 tatgaccagc atgcatattt cccaaaggaa gaaactgagg ttctgggaag ttcagggcaa 30601 ggcaggagca gagtcagaat ctacagaatt gtccaggggg gacaatctct cactcaaaca 30661 ctaagccctg gaggaggcta agcctggggg aaccaggtga gccctaggtg ctgggtctcc 30721 cccgctctga tgtctgtctc cctcccaaaa atcggtctgc ctcggtgacg cttgctctgg 30781 gggaaggttc aagggctcca acatagattc gtataggctg tggcatggct ccagtcagat 30841 gccacccggg caggctttgc cgctccttgt ggaggcaatt agggcctctc cggagcagaa 30901 gacgcatccc cagacaggaa tccctaagtc aggacaaagt cacaaacctt ggagggaagg 30961 ggaactctga gcccagggtg cacacagcag ctgcccctgg gcccagactg gaggctggga 31021 gctggcggtg ccatcacagg tcaaatccca cacggccgtt tctctcaaac tctgggagcc 31081 cccacgtccc tctcagatga cgttctctgt taccagcgat cactacaggc tcgcctttaa 31141 gaacaaccca gcaaatcttt gctccctcca gctctcttct tattttttct ttattgttgt 31201 tttctttttg tttgttttgt ttttagatgg cgtctctctc tgtcgcctag gctggagtgc 31261 agtggcacaa tctcagccca ctgcaactcc cacctcctgg gttcaaacga ttcttgtgcc 31321 tcagccgtct gagtaactgg gattacaggt gcatgccatc acgcccagct aacttttgta 31381 ttttcagtag agacagggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca 31441 agtgatccac ctgccttggc ctcccaaagt gctgggatta caggcgtgag ccaccgcacc 31501 tggcctcctc cagcgctctt gatcatcata gtccaagctg ctcagactgg gctgggggac 31561 aggaaccgtg tcagcctctt ggatctgtgc aacatctccc ctccaaggct gccagcagtt 31621 ccagtcgaga cacctgtcca ggacgccaaa gttaagcgtc ccggctatgc acacaccctg 31681 agaatttaca ttccaatctg tgttcgccgt tgaccaatat ctttcactgt gttatgtctc 31741 cgcccatccc agcaatccca ggcatcccct cagcaaccac ttaatctcct ccaatctgtg 31801 cacccggcca ctccctgctg ttgccctaaa taaccaaagg cctcggtgtt gtctctcccc 31861 tcccagtccc tcctctctga aatgctgttg cataaaacag gcaggtggca cgggagggct 31921 gcagacagct cgtctcatcc tgcagcagga ggccaggttg cagcggaggc tccttcagct 31981 ttgagatgct ccggatgacc ctgagagatg acaggaattg ggggtgggga gggtcacaga 32041 ggaagtggag cctgggggct ttaggctttg cttggtttca ggggttgcac tacagccaga 32101 agttgctgca cgcgtgactc acacacacga ggccgttgtt ggtcacatcc agacctcgtt 32161 acataaacca ggcggactcc ggccagcatc agacggcctg ggcactgaga cagtctccag 32221 tcctgcatgc caggcagcaa atctatctcc caggcagggc cccctgtcct ctctcctctc 32281 cagggaagtt gcagcaaagc cacgcctggg gtcctcacgg gtctggagta agacccctgg 32341 gctgggagat gtcccaggta gacaacgcta actctggggt ctctgtctcc aaggtcgggt 32401 tgcacatgac gccattgctt tcacggggtt tattttccac tgggaacata agacggggca 32461 ggaacaggag atctatttct gatttgcttt gcttatagtt agtgaactga gcggctttta 32521 atcaaatcct tgtggtctgg gatccctaaa cccccaagaa gaaagcagcc tcattttgtg 32581 ctgggactat gacaacggtc ctcggttgct ggcatcctgg agatgtcgat taagtggctc 32641 ctgttggtgg atgggtgggc cgagcgttac acatgagccc ccggtaaagc tgtgctcggg 32701 acccttcgga gcaaagccat gtgatacagc actttgaagc attcttctgt agcctgcagt 32761 caccaaaccc ttctggggcc caaattccac ccctgccgga ttcaaagatg cttcttattg 32821 gagttcccct gcctcccacc caaattcttt atgttgattc agacctgcac gtggttcagg 32881 cttatgaaac aataccgcca agtgcaggcc tcagcagcag cctcagaagc agaagtttct 32941 ctcggaactt ctccagcccc catgtctctg agtcccattc tcccctaagg caccgaagga 33001 actagaatcc ctcttcccca agacgggtcc cagaaacaag aacgcctttc cccccgaagc 33061 cagccataaa acctaaaaac aggaatctaa ctttccctct atcctatctg tataaagagt 33121 ggcacaggct gggcgcggtg gctcacacct gtaatcccag cactttggga ggccgaggca 33181 ggcggatcac gaggtcagga gatcgagacc atcctggcta acacggtgaa accccgtctc 33241 tactaaaaat acaaaaaaat tagccgggtg tggtggcggg cgcctgtagt cccagctact 33301 cgggaggctg aggcaggaga atggcgtgaa cctgggaggc ggagcttgca gtgagccgag 33361 ttcacaccag tacactccag cctgggtgac agagcaagaa tctgtcttga aaaaaaaaaa 33421 aaaaaaaagt ggccataagg aaattccctg acctgccttg tttgggcgtc ctaagacccc 33481 catcccagag aagctcccag ccccataccc ggaaggaagg agcactgttc agagaggcca 33541 agaagaggcc gggtgcggtg gctcacgcct gtaatcccag cactttggga ggccgaggag 33601 ggcagatcac aaggtcaaga gatcgagacc atcctggcca acatggtgaa accccgtctc 33661 tactaaaaat acaaaactta gctgagcgtg gtggtgcatg cctgtaatcc cagctactca 33721 ggaggctgag gcaggagaat tgcttgaacc tgggaggtgg agattgcagt gagccaagac 33781 tgtgccactg cactccagcc tggcaacaca gtgagactcc gtctcaaaaa aaaagagaga 33841 gagagaggcc aagaagaatc tagacacaca ggcctggctg ggtttcccca ctcaggccat 33901 tagcattgga tcaggccctt tttgtccaac cctatttcta cacagctgtc cagacttgga 33961 tgaaccaaag cataaatata gacaatctcc ccttgtaact tggggtcttc attctgaatg 34021 ctcccgtgta tacacattaa atacatttgt atgtcttttc tccagttaat aaatctgctt 34081 catgtccatg attttcagta atgcttcagg ggccacgacc ccagaaagtc aatataacaa 34141 acagcaattt tgcaaagcaa agaacagtac cattgctcga ataataatgc aatgtcgagg 34201 ccactgccca ctgaatatta gctgccacca gcacagctgc cacgcagcac aggagagcaa 34261 tggacccaca gcaaaacctc tgctccccac tgaattattt acctcgcttt aaagaagaaa 34321 atcagctccc agcaggctct gagcagaaaa tgaaaaggaa cacacagaga ctatctaaat 34381 ccctgacccc aattacagct attgccagca gaatttatac accacggaac tgaattagag 34441 ccaactgcaa tcattacggg acgtgcaacc acattagtaa attcagacag gctcttctgt 34501 ttcattattc aatccaatgt cctttgatct gcgtgccctg gtgaaacaaa gccctttcaa 34561 gcaattgttt ccttccttcc ttccttcctt ccttccttcc ctccctccct ctttctctct 34621 ttctctcata cagaacctca ctatgttgcc caggctgcag tgcaatggcg caatctctgt 34681 tcactgcaac ctctgccccc caggttcaat tgattctccc gcctgcctca gcctcctgag 34741 tagctgggat tacaggcgcc ccctactatg cccaaccaat tttttttatt tttagtagag 34801 atggggtttc accatgttgg ccaggctggt ctcaaactcc tgacctcgtg atccacccac 34861 cttggcctcc caaagtgcgg agattacagg catgagccgc cgtgcctagc cttttttttt 34921 tttttttttt ttttttggat aaagaatctc actatgttgc ccaggctgca gtgcaatggc 34981 gcactctcgg ctcactacaa cccctgcctc cctggttcga gcgcttctag tgcctcagtc 35041 tcctgagtag ctgggactac aggtgcatgc cactacgccc aggtaatttt tgtgttttta 35101 gtaatgacgg ggtttcacca tgttggccag gatggtcttg aactcctggc atgaagtgat 35161 tcacccgcct cggcctccca aagtgctgta attacaggca tgagccaccg cccccggcct 35221 attttttatt tctttacatc ggattgtact ggtttcctct gaagtgtggg ttcatttaaa 35281 tcaactttct tgggtcagct gctgcagatg aagccactta gtgggtacta agcccataga 35341 gccctcctca gcttgaggaa gcccacagag gggaaagttc cgaggtttta caaagctggc 35401 acagggaccc cgagaaggac aaatcttgac ttccttcagg caaatcctaa ggtttctctt 35461 tgactccagt tcagcaatta aaaaatacct atgtgtatat atagaaagca tatgataatg 35521 gttcttgtga ctgaacttgt gaccggaaag aggtctggaa tcagacccca ggacagggtt 35581 cttggacctt gcacaagaaa gaatttgcgg cgagtccata aaatgaaagc aagtttatta 35641 gaaaagcaga ggaataaaga atggctcctc catagacgga gcagccccga gggccactgg 35701 ttgcccattt gtatggttct ttcttgagga tatgctaaac aaggagtggc ccattcatgc 35761 ctcctctttt tagaccatat agggtaactt cctgacattg ccgtggcatt tgtaaactgt 35821 catggtgctg gtgggaagac accagtgagg accacccgag gtcactctcg tcgccatctt 35881 ggatttggta aaatttggcc gacttcttta cggcaagctg tttcatcagc aaggtcttta 35941 tgacttgttt cttgtgtcaa cctcctatct catcctgtgc cttagaatgc cttgaccatc 36001 cgggaatgca gcccggcagc tctcggcctc attttacaca gcccctactc aacctggagt 36061 tgccctggtt cgaacacctc ccactaactc aacttaggtc caccggccct gtgcagtaaa 36121 gccgaacacc gccattggga ttgcagcgtg aggaagtgag gcgtttattc cgagctattt 36181 agagggcatc aagcaacctc cttgatgcct caagacccaa cctcctgggt ggccagtagc 36241 tgagggtttt caatggcaag gaggcagagg ttacaggcaa agccataata catgcaggct 36301 atacattgct ttgatctaaa aaggcgggat acctggaagc aggggcttaa tgtggattca 36361 aagtttctct gatttgtcat tggttaagga ggccaagttt gtctgagcca tttggggtca 36421 gcagaaatca atgttagttg tggccattgg tgtgacttcc tccagcacgt cccctcaccc 36481 cccttccacc cccctcccac cgccacatgc cccagggaag gaatttagaa caaacatttg 36541 taatgagaat tcaggcctca gtttctctta cccaacgtct atgagccagc agatggcatt 36601 ttttatttgg tgggggtctg ggttcctgaa aatcaactca gggccatata gtaagatgtt 36661 atcttggctg ggcatggtgg ctcacacccg caatcccagc actttgggag gccagggcag 36721 gcagattacc tgaggtcagg agttccagac cagactggcc aacatgggga agccctgtgt 36781 ctactaaaaa tacaaaaatg tgggcgtggt ggtgtgcacc tgtagtccca gctacttggg 36841 aggctgatgc atgagaatca cttgaaccca ggaggtgctg gaggttgcac tgagccaaga 36901 ttgcaccact acactgcaac ctgggtgaca gagcaagact ccatctcaaa aaaggatgtt 36961 atctttagtc tttacaggaa actaaacata ttgaggctct aacttccttg gctattgttc 37021 tacactacaa tcaccttctt gcttatcaag ctgctccatg tacttctcaa ggccagcgag 37081 gtgcctggaa tttcccttga aggaactcaa ggttttcctt tatttccatg ctagggagag 37141 tgcctggcag gcccctaaga gggttgtcct tgctccatct cactctctcc caggacggca 37201 tgattcacag agtaaaaaaa attaaatcat atgactaggt taggaccgca ggttgcaaca 37261 gaaagtgaca aggaaggctg ctgccccaat ggggaatctt attcccgggg ggcttgtttg 37321 ccaagggctg aattctactt tagccccatg gataagatga aatccattct gttctctaat 37381 tggttaagaa gtttatattt ccagcaggac aggaaatcaa acacagctgg agtctcctcc 37441 tccacacctc aacatgaata tccagggact ccctaatgct caagctcacc ggggctgcag 37501 acaccagagc tgagcaggcc gccctggtgc atgatctgct cctcggaacc catgaaccct 37561 ccacggtctc agcatccaag cagcccagcc tctccccaga aggtcaacca actggccccg 37621 ttcttgactt cccagtgacc atattatgca gttgcagatt gaagctctat tagagcagac 37681 attggtaatg agaattcagg cctcagtgtc tgtctgtaac ccaacagacg gtgtctgcag 37741 agatcgaagt attttgtcgt cgaagaggaa ggaatgatca ttcatcacaa aaagcaagac 37801 atctttggtg caaggaaaac tcgaggaaaa taccgcagac catgcaatga ggcactggtt 37861 gacggtgtgt tataaacccg tcttcccaga gtggcatgca cacggatccc tcaggacatg 37921 ggtgacacac agactatgct tcagcaggtc tgtctgggcc caagacacag tgtttctcat 37981 cagctcccag gggatgtcaa ggctgcagat ccatggatct cactttgcag gacagagact 38041 tggtaatggc ttcccagagt tgttacaatg caatcccaaa gactgggcag cttaaacaac 38101 aaccttgatt ctcccacagt cctggaagct ggaagtctga gatcaaggtg tgggcagggc 38161 cggttcctcc tgagtcctct ctcctgggct tgtagatgcc gtcttctccc tgagtcccca 38221 cgtggtcatc cctctgtgtg cgtctgtgtc ctcatctcct cttcttatga ggtgtcttag 38281 tccatttcag gctgctgtca cagcatacca tagactgggt ggcttataag caacagacat 38341 tgattctccc acagccctgg aggctggacg tcttgagatc aggatatggg caaggctgtt 38401 tcctcctgag gcctctgtcc tgggcttgta gacaccatct tctccctgtg tccccacgtg 38461 gtcatccctc tatgtgcatg tctgtgtcct catctgctct tcttatgaga tgtcttagtc 38521 cattgcaggc tgctatcaca gaataccata ggctgggtgg cttacaaacc acagactttt 38581 attctcccac agtcctggag gctggaattc tgagatcaag gcatgggcag agctggttcc 38641 tcctgaggcc tctcttcttg acttgtagac accctcttct ccctgtgtct ttacagggtc 38701 atccctctgt gtgtgtctgt gtccttatct actcttttta taaggacccc agtcctattg 38761 gatcagggca caacctcctg agctcattgt acccttctca cctctttaaa gaccccatca 38821 ccaaacacag tcatgttctg aggtcctagg gattaggcct tcaatatata aattttggag 38881 acgcacaatt caacccttac agaggtaact cattcattgg aatacagatg tgtttgtcca 38941 ctgtggctgc tgccatagca aagtcccaca gaccaggctg tttaaacagt ggacattgat 39001 tctcccacag gcctggaagc tggaagtctg agatcaaggt gtgggcaggg ctggttccta 39061 ctgagacctc tctccttggc ttgtggatgc catcttttcc ctaagtcctc acaggatcgt 39121 ccctgtgtgc atgtctgtgt cttcatctcg tctttttata aggtctgcag tcctgttgga 39181 tcagggccca acctagtgac ctcattttgc ctgaatcacc tctttttttt tttttttttt 39241 tttgagacag agcctcgctc tgtcgcccag gctggagtgc agtagtgcga tctcggctca 39301 ctgcaacctc cacctcccag gttcaagcga ttctcctgcc tcagcctccc aagtagctgg 39361 gactacaggc acgcccagct aatttttttt ttttttttgt atttttagta gagatggggt 39421 ttcaccgtgt tagccaggat gatctccatc tcctgacctc atgatctgcc agcctcagcc 39481 tcccaaagtg ctgggattac aggcatgagc caccgcaccc agcctgaatc acctctttaa 39541 agaccccatc tccaaacaca gtcacaattg gaggtcctgc aggttaggac ttccatttgt 39601 aaaattgagg agggtacaat ttggctcaca acagatttaa accatggaac atggtcaatt 39661 acaatgggga tgccattgga cagcctggtt cagtggagat aaccagggtt tgggaatcta 39721 gagagacagt ttcatctcag caacatcacg cgcctggcct acgctctcaa taatactctt 39781 ccttaatttt ttttttctga gacagagtct cgctctgttg cccaggctgg agtgcagtgg 39841 tgcaatctca gctcactgca gtgcaatggc acaatctcag ctcactgcaa cctctgcctc 39901 ccaggttcaa gcagttctct cacctcagcc tcccgagtag ctgggattac aggcacacgc 39961 ctccatgcct ggatcagacc agaatccagc cacatgcaga ttccagatca gcccagaatc 40021 tgattctgaa gcattcccag aggttgggat tcacagacta cagcagagtt tcccagcctc 40081 agcaccgtgg ctgcacacat cgctcacagg tttaggtgcc tctggcccct gatccatcgg 40141 gggaatcccc aggagctttg ccgtgtctgg cagagcccca cctgcaggca gaaccccact 40201 tttctacccc ctgtgtgcca atgagaaaga ggaaaatggc tggaatgagg ggggccctca 40261 caggaagggt cagttgttat ccaagaaagg ggagacattt cttggactcc gtgcttgtct 40321 gctaattggc tccaatattt gccagatgtc ttcacactca ggtgccaaac agccatagac 40381 tttttctgca cagccccttc ctactccaag aaggaaacta tggaatgctc agcttctggg 40441 ttatatgtgc catggctcta tgccctatgg ggaaaagatc ctacaagtgc attctgacag 40501 taagatccat ttagaaatgc ccagacaact ataccgtacc tcctatactg tacaggacaa 40561 ctatacctcc agacaactat acctccagac aactatacct cctacagcgt acaggagcat 40621 gacatttcca acctgtcccc tttcgcaggg ttagaagtta taactaacag caggttctcc 40681 actacagcac taatgacact tggggctgcg taactctctg tcatgggtgc tgtcctgtgc 40741 actgtaaggt gttgaacaac atgccttgtc tccacccacc gaatgcgaga acacccatcc 40801 cagtgcaact accaaaattg tttccagaca ttgccaagtg ttacttgagg aacacagtta 40861 tcccagggta gcagaagaat gtcatagata gatgatggac tgctagataa atagatagat 40921 tgatagatgg atggatagat acatagatag atacatagat acatagatag gtaatagaga 40981 tgagagttgg atagagaagt aggtagaaag atagatagat aaatagatag ataatagatg 41041 acagaaaatc attgacagat agattaggta gatgataggt gtatataata gagacagata 41101 ggtcgatgga taatggtaga cgatagatag atctagagat aggcagacag acag // LOCUS HSCCR5AB2 6059 bp DNA PRI 03-JAN-1998 DEFINITION Homo sapiens CC chemokine receptor 5 (CCR5) gene, complete cds. ACCESSION AF031237 NID g2739497 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6059) AUTHORS Mummidi,S., Ahuja,S.S., McDaniel,B.L. and Ahuja,S.K. TITLE The human CC chemokine receptor 5 (CCR5) gene. Multiple transcripts with 5'-end heterogeneity, dual promoter usage, and evidence for polymorphisms within the regulatory regions and noncoding exons JOURNAL J. Biol. Chem. 272 (49), 30662-30671 (1997) MEDLINE 98049523 REFERENCE 2 (bases 1 to 6059) AUTHORS Mummidi,S., Ahuja,S.S., McDaniel,B.L. and Ahuja,S.K. TITLE Direct Submission JOURNAL Submitted (23-OCT-1997) Medicine, University of Texas Health Science Center at San Antonio, 7703, Floyd Curl Drive, San Antonio, TX 78284, USA FEATURES Location/Qualifiers source 1..6059 /organism="Homo sapiens" /db_xref="taxon:9606" gene join(AF031236:1..1976,1..6059) /gene="CCR5" mRNA join(1..57,794..847,2751..6059) /gene="CCR5" /product="CC chemokine receptor 5B" mRNA join(1..57,559..847,2751..6059) /gene="CCR5" /product="CC chemokine receptor 5A" mRNA join(750..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(773..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(777..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(779..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(784..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(795..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(798..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(804..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(805..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" mRNA join(806..847,2751..6059) /gene="CCR5" /product="CCR5 truncated isoform" CDS 2762..3820 /gene="CCR5" /codon_start=1 /product="CC chemokine receptor 5" /db_xref="PID:g2739499" /translation="MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFG FVGNMLVILILINCKRLKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMCQ LLTGLYFIGFFSGIFFIILLTIDRYLAVVHAVFALKARTVTFGVVTSVITWVVAVFAS LPGIIFTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGIL KTLLRCRNEKKRHRAVRLIFTIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLD QAMQVTETLGMTHCCINPIIYAFVGEKFRNYLLVFFQKHIAKRFCKCCSIFQQEAPER ASSVYTRSTGEQEISVGL" BASE COUNT 1753 a 1210 c 1393 g 1703 t ORIGIN 1 cttcagatag attatatctg gagtgaagga tcctgccacc tacgtatctg gcatagtgtg 61 agtcctcata aatgcttact ggtttgaagg gcaacaaaat agtgaacaga gtgaaaatcc 121 ccactaagat cctgggtcca gaaaaagatg ggaaacctgt ttagctcacc cgtgagccca 181 tagttaaaac tctttagaca acaggttgtt tccgtttaca gagaacaata atattgggtg 241 gtgagcatct gtgtgggggt tggggtggga taggggatac ggggagagtg gagaaaaagg 301 ggacacaggg ttaatgtgaa gtccaggatc cccctctaca tttaaagttg gtttaagttg 361 gctttaatta atagcaactc ttaagataat cagaattttc ttaacctttt agccttactg 421 ttgaaaagcc ctgtgatctt gtacaaatca tttgcttctt ggatagtaat ttcttttact 481 aaaatgtggg cttttgacta gatgaatgta aatgttcttc tagctctgat atcctttatt 541 ctttatattt tctaacagat tctgtgtagt gggatgagca gagaacaaaa acaaaataat 601 ccagtgagaa aagcccgtaa ataaaccttc agaccagaga tctattctcc agcttatttt 661 aagctcaact taaaaagaag aactgttctc tgattctttt cgccttcaat acacttaatg 721 atttaactcc accctccttc aaaagaaaca gcatttccta cttttatact gtctatatga 781 ttgatttgca cagctcatct ggccagaaga gctgagacat ccgttcccct acaagaaact 841 ctccccggta agtaacctct cagctgcttg gcctgttagt tagcttctga gatgagtaaa 901 agactttaca ggaaacccat agaagacatt tggcaaacac caagtgctca tacaattatc 961 ttaaaatata atctttaaga taaggaaagg gtcacagttt ggaatgagtt tcagacggtt 1021 ataacatcaa agatacaaaa catgattgtg agtgaaagac tttaaaggga gcaatagtat 1081 tttaataact aacaatcctt acctctcaaa agaaagattt gcagagagat gagtcttagc 1141 tgaaatcttg aaatcttatc ttctgctaag gagaactaaa ccctctccag tgagatgcct 1201 tctgaatatg tgcccacaag aagttgtgtc taagtctggt tctctttttt ctttttcctc 1261 cagacaagag ggaagcctaa aaatggtcaa aattaatatt aaattacaaa cgccaaataa 1321 aattttcctc taatatatca gtttcatggc acagttagta tataattctt tatggttcaa 1381 aattaaaaat gagcttttct aggggcttct ctcagctgcc tagtctaagg tgcagggagt 1441 ttgagactca cagggtttaa taagagaaaa ttctcagcta gagcagctga acttaaatag 1501 actaggcaag acagctggtt ataagactaa actacccaga atgcatgaca ttcatctgtg 1561 gtggcagacg aaacattttt tattatatta tttcttgggt atgtatgaca actcttaatt 1621 gtggcaactc agaaactaca aacacaaact tcacagaaaa tgtgaggatt ttacaattgg 1681 ctgttgtcat ctatgacctt ctctgggact tgggcacccg gccatttcac tctgactaca 1741 tcatgtcacc aaacatctga tggtcttgcc ttttaattct cttttcgagg actgagaggg 1801 agggtagcat ggtagttaag agtgcaggct tcccgcattc aaaatcggtt gcttactagc 1861 tgtgtggctt tgagcaagtt actcaccctc tctgtgcttc aaggtccttg tctgcaaaat 1921 gtgaaaaata tttcctgcct cataaggttg ccctaaggat taaatgaatg aatgggtatg 1981 atgcttagaa cagtgattgg catccagtat gtgccctcga ggcctcttaa ttattactgg 2041 cttgctcata gtgcatgttc tttgtgggct aactctagcg tcaataaaaa tgttaagact 2101 gagttgcagc cgggcatggt ggctcatgcc tgtaatccca gcattctagg aggctgaggc 2161 aggaggatcg cttgagccca ggagttcgag accagcctgg gcaacatagt gtgatcttgt 2221 atctataaaa ataaacaaaa ttagcttggt gtggtggcgc ctgtagtccc cagccacttg 2281 gaggggtgag gtgagaggat tgcttgagcc cgggatggtc caggctgcag tgagccatga 2341 tcgtgccact gcactccagc ctgggcgaca gagtgagacc ctgtctcaca acaacaacaa 2401 caacaacaaa aaggctgagc tgcaccatgc ttgacccagt ttcttaaaat tgttgtcaaa 2461 gcttcattca ctccatggtg ctatagagca caagatttta tttggtgaga tggtgctttc 2521 atgaattccc ccaacagagc caagctctcc atctagtgga cagggaagct agcagcaaac 2581 cttcccttca ctacaaaact tcattgcttg gccaaaaaga gagttaattc aatgtagaca 2641 tctatgtagg caattaaaaa cctattgatg tataaaacag tttgcattca tggagggcaa 2701 ctaaatacat tctaggactt tataaaagat cactttttat ttatgcacag ggtggaacaa 2761 gatggattat caagtgtcaa gtccaatcta tgacatcaat tattatacat cggagccctg 2821 ccaaaaaatc aatgtgaagc aaatcgcagc ccgcctcctg cctccgctct actcactggt 2881 gttcatcttt ggttttgtgg gcaacatgct ggtcatcctc atcctgataa actgcaaaag 2941 gctgaagagc atgactgaca tctacctgct caacctggcc atctctgacc tgtttttcct 3001 tcttactgtc cccttctggg ctcactatgc tgccgcccag tgggactttg gaaatacaat 3061 gtgtcaactc ttgacagggc tctattttat aggcttcttc tctggaatct tcttcatcat 3121 cctcctgaca atcgataggt acctggctgt cgtccatgct gtgtttgctt taaaagccag 3181 gacggtcacc tttggggtgg tgacaagtgt gatcacttgg gtggtggctg tgtttgcgtc 3241 tctcccagga atcatcttta ccagatctca aaaagaaggt cttcattaca cctgcagctc 3301 tcattttcca tacagtcagt atcaattctg gaagaatttc cagacattaa agatagtcat 3361 cttggggctg gtcctgccgc tgcttgtcat ggtcatctgc tactcgggaa tcctaaaaac 3421 tctgcttcgg tgtcgaaatg agaagaagag gcacagggct gtgaggctta tcttcaccat 3481 catgattgtt tattttctct tctgggctcc ctacaacatt gtccttctcc tgaacacctt 3541 ccaggaattc tttggcctga ataattgcag tagctctaac aggttggacc aagctatgca 3601 ggtgacagag actcttggga tgacgcactg ctgcatcaac cccatcatct atgcctttgt 3661 cggggagaag ttcagaaact acctcttagt cttcttccaa aagcacattg ccaaacgctt 3721 ctgcaaatgc tgttctattt tccagcaaga ggctcccgag cgagcaagct cagtttacac 3781 ccgatccact ggggagcagg aaatatctgt gggcttgtga cacggactca agtgggctgg 3841 tgacccagtc agagttgtgc acatggctta gttttcatac acagcctggg ctgggggtgg 3901 ggtgggagag gtctttttta aaaggaagtt actgttatag agggtctaag attcatccat 3961 ttatttggca tctgtttaaa gtagattaga tcttttaagc ccatcaatta tagaaagcca 4021 aatcaaaata tgttgatgaa aaatagcaac ctttttatct ccccttcaca tgcatcaagt 4081 tattgacaaa ctctcccttc actccgaaag ttccttatgt atatttaaaa gaaagcctca 4141 gagaattgct gattcttgag tttagtgatc tgaacagaaa taccaaaatt atttcagaaa 4201 tgtacaactt tttacctagt acaaggcaac atataggttg taaatgtgtt taaaacaggt 4261 ctttgtcttg ctatggggag aaaagacatg aatatgatta gtaaagaaat gacacttttc 4321 atgtgtgatt tcccctccaa ggtatggtta ataagtttca ctgacttaga accaggcgag 4381 agacttgtgg cctgggagag ctggggaagc ttcttaaatg agaaggaatt tgagttggat 4441 catctattgc tggcaaagac agaagcctca ctgcaagcac tgcatgggca agcttggctg 4501 tagaaggaga cagagctggt tgggaagaca tggggaggaa ggacaaggct agatcatgaa 4561 gaaccttgac ggcattgctc cgtctaagtc atgagctgag cagggagatc ctggttggtg 4621 ttgcagaagg tttactctgt ggccaaagga gggtcaggaa ggatgagcat ttagggcaag 4681 gagaccacca acagccctca ggtcagggtg aggatggcct ctgctaagct caaggcgtga 4741 ggatgggaag gagggaggta ttcgtaagga tgggaaggag ggaggtattc gtgcagcata 4801 tgaggatgca gagtcagcag aactggggtg gatttggttt ggaagtgagg gtcagagagg 4861 agtcagagag aatccctagt cttcaagcag attggagaaa cccttgaaaa gacatcaagc 4921 acagaaggag gaggaggagg tttaggtcaa gaagaagatg gattggtgta aaaggatggg 4981 tctggtttgc agagcttgaa cacagtctca cccagactcc aggctgtctt tcactgaatg 5041 cttctgactt catagatttc cttcccatcc cagctgaaat actgaggggt ctccaggagg 5101 agactagatt tatgaataca cgaggtatga ggtctaggaa catacttcag ctcacacatg 5161 agatctaggt gaggattgat tacctagtag tcatttcatg ggttgttggg aggattctat 5221 gaggcaacca caggcagcat ttagcacata ctacacattc aataagcatc aaactcttag 5281 ttactcattc agggatagca ctgagcaaag cattgagcaa aggggtccca tataggtgag 5341 ggaagcctga aaaactaaga tgctgcctgc ccagtgcaca caagtgtagg tatcattttc 5401 tgcatttaac cgtcaatagg caaagggggg aagggacata ttcatttgga aataagctgc 5461 cttgagcctt aaaacccaca aaagtacaat ttaccagcct ccgtatttca gactgaatgg 5521 gggtgggggg ggcgccttag gtacttattc cagatgcctt ctccagacaa accagaagca 5581 acagaaaaaa tcgtctctcc ctccctttga aatgaatata ccccttagtg tttgggtata 5641 ttcatttcaa agggagagag agaggttttt ttctgttctt tctcatatga ttgtgcacat 5701 acttgagact gttttgaatt tgggggatgg ctaaaaccat catagtacag gtaaggtgag 5761 ggaatagtaa gtggtgagaa ctactcaggg aatgaaggtg tcagaataat aagaggtgct 5821 actgactttc tcagcctctg aatatgaacg gtgagcattg tggctgtcag caggaagcaa 5881 cgaagggaaa tgtctttcct tttgctctta agttgtggag agtgcaacag tagcatagga 5941 ccctaccctc tgggccaagt caaagacatt ctgacatctt agtatttgca tattcttatg 6001 tatgtgaaag ttacaaattg cttgaaagaa aatatgcatc taataaaaaa caccttcta // LOCUS HSCENPB 3717 bp DNA PRI 17-FEB-1997 DEFINITION Human hCENP-B gene for centromere autoantigen B (CENP-B). ACCESSION X55039 NID g29860 KEYWORDS centromere associated protein; centromere autoantigen B; chromosomal protein; hCENP-B gene; helix-loop-helix protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3717) AUTHORS Sullivan,K.F. TITLE Direct Submission JOURNAL Submitted (29-JAN-1991) K.F. Sullivan, Department of Molecular Biology, Research Institute of Scripps Clinic MB6, 10666 N Torrey Pines Road, La Jolla CA 92037, U S A REFERENCE 2 (bases 1 to 3717) AUTHORS Sullivan,K.F. and Glass,C.A. TITLE CENP-B is a highly conserved mammalian centromere protein with homology to the helix-loop-helix family of proteins JOURNAL Chromosoma 100 (6), 360-370 (1991) MEDLINE 91372020 REFERENCE 3 (bases 1 to 3717) AUTHORS Earnshaw,W.C., Sullivan,K.F., Machlin,P.S., Cooke,C.A., Kaiser,D.A., Pollard,T.D., Rothfield,N.F. and Cleveland,D.W. TITLE Molecular cloning of cDNA for CENP-B, the major human centromere autoantigen JOURNAL J. Cell Biol. 104 (4), 817-829 (1987) MEDLINE 87166180 COMMENT See also X55038 and X05299. FEATURES Location/Qualifiers source 1..3717 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral lymphocytes" /clone_lib="lambda EMBL 3 genomic" gene 936..2735 /gene="hCENP-B" CDS 936..2735 /gene="hCENP-B" /codon_start=1 /product="centromere autoantigen B (CENP-B)" /db_xref="PID:g29861" /db_xref="SWISS-PROT:P07199" /translation="MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLST ILKNKRAILASERKYGVASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKE KALRIAEELGMDDFTASNGWLDRFRRRHGVVSCSGVARARARNAAPRTPAAPASPAAV PSEGSGGSTTGWRAREEQPPSVAEGYASQDVFSATETSLWYDFLPDQAAGLCGGDGRP RQATQRLSVLLCANADGSEKLPPLVAGKSAKPRAGQAGLPCDYTANSKGGVTTQALAK YLKALDTRMAAESRRVLLLAGRLAAQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVK GHYRQAMLLKAMAALEGQDPSGLQLGLTEALHFVAAAWQAVEPSDIAACFREAGFGGG PNATITTSLKSEGEEEEEEEEEEEEEEGEGEEEEEEGEEEEEEGGEGEELGEEEEVEE EGDVDSDEEEEEDEESSSEGLEAEDWAQGVVEAGGSFGAYGAQEEAQCPTLHFLEGGE DSDSDSEEEDDEEEDDEDEDDDDDEEDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDR VQSHILHLEHDLVHVTRKNHARQAGVRGLGHQS" conflict 2683 /gene="hCENP-B" /citation=[3] /replace="t" conflict 2709 /gene="hCENP-B" /citation=[3] /replace="c" conflict 2713 /gene="hCENP-B" /citation=[3] /replace="t" conflict 3209..3210 /citation=[3] /replace="cca" conflict 3224..3226 /citation=[3] /replace="ct" conflict 3237..3239 /citation=[3] /replace="gt" conflict 3496 /citation=[3] /replace="n" polyA_signal 3520..3525 BASE COUNT 705 a 1133 c 1214 g 665 t ORIGIN 1 tagatttaca actgtaatgc tgattttaaa aataacttaa gtgtgtaaca ataggttaca 61 tacgctagtg ttcattcata gctggaatgc tacacaatca ttaaaaagcg tgttggcaat 121 gatacttacc gatatggcaa aattctcact acttgatgcc aggtaaaaca gcaggagatg 181 ccagatatac agtctataca gtctgatgcc agttttatat aataaatttt taaaaagaaa 241 acgacatgca tctagagaaa agactggaag gaaatgtatc aaaatgttac ctgtgcatct 301 ctccaggtga cgtgaaaagc gtttttcaac ttcttgaatt tttcaggttc cgcaaaaaaa 361 aaagtgcgtt atttttacct taattaaaaa aaaaaaaaac aaacaaaaac cctggttctt 421 attctttttg agcagcaagt cgctgtccgc cgggcgcagt agcgatgcac cgcgtccccg 481 cccgcgtcgc tgccgggtcg ccggaggccg cagcacgggc aggtccagca ggccgcagcg 541 cccccccgcc ggtatgtgcc cggcggcggg aagggggtgg gcgcggaggg cggggaggag 601 gcgccgggcg tccccgcgct tcccgcgaga tcccgctccg ccccgctcgc cgcgcgtccc 661 agctccccgc gccggccact tcctgccttc cgcgcccgcg ccgcccgccc tcgtctgcgc 721 ccgtcgcctg ccgcccgccg cccgggacgc ggcccgccgg cgtccccgga ggtgcccggc 781 ccggccgggt cgtcgccccg ccgccgcgcc gcgagccgct ttgtctcggg cggggcgcgc 841 gggagaggcc gccaggtgcc ccccgccacg ggcccgggcc cccgccccgg ggcggcggcg 901 gcggcgccgg gccccggggc ggggggcgcg ccgggatggg ccccaagagg cgacagctga 961 cgttccggga gaagtcacgg atcatccagg aggtggagga gaatccggac ctgcgcaagg 1021 gcgagatcgc gcggcgcttc aacatcccgc cgtccacgct gagcacgatc ctgaagaaca 1081 agcgcgccat cctggcgtcg gagcgcaagt acggggtggc ctccacctgc cgcaagacca 1141 acaagctgtc tccctacgac aagctcgagg gcttgctcat cgcctggttc cagcagatcc 1201 gcgccgccgg cctgccggtc aagggcatca tcctcaagga gaaggcgctg cgcatagccg 1261 aggagctggg catggacgac ttcaccgcct ccaacggctg gctggaccgc ttccgccggc 1321 gccacggcgt ggtgtcctgc agcggcgtgg cccgcgcccg cgcgcgaaac gctgcccccc 1381 gcaccccggc ggcgcctgcc agtccggccg cggtgccctc ggagggcagt ggcgggagca 1441 ctactggttg gcgcgctcgg gaggagcagc cgccgtcggt ggccgagggc tacgcctcgc 1501 aggacgtgtt cagcgccacc gagaccagtc tatggtacga cttcctgccc gaccaggccg 1561 cggggctgtg cggaggcgac ggacggccgc gtcaagccac ccagcgcctg agcgtcctgc 1621 tatgcgccaa tgccgacggc agcgagaagc tgcccccgct ggtggccggc aagtcggcca 1681 agccccgcgc aggccaagcc ggcctgccct gcgactacac cgccaactcc aagggtggtg 1741 tcaccaccca ggccctggcc aagtacttga aggccttgga cacccgaatg gctgcagagt 1801 ctcgccgggt cctgctgttg gccggccgct tggctgccca gtccttggac acctcgggcc 1861 tgcggcatgt gcagctggcc ttcttccctc ccggcaccgt gcatccgctg gagaggggag 1921 tggtccagca ggtgaagggc cactaccgcc aggccatgct gctcaaggcc atggccgcgc 1981 tagagggcca ggatccctca ggcctgcagc tgggtctcac ggaggccctg cactttgtgg 2041 ctgccgcctg gcaggcagtg gagccttcgg acatagccgc ctgctttcgt gaggctggct 2101 ttgggggtgg ccctaatgcc accatcacca cttccctcaa gagtgaggga gaggaagagg 2161 aggaggagga ggaagaagag gaggaggaag agggtgaagg agaggaagag gaggaggaag 2221 gggaggagga ggaggaggaa gggggggaag gagaggaatt gggggaggaa gaggaggtgg 2281 aggaggaggg tgatgttgat agtgatgaag aagaggagga agatgaggag agctcctcgg 2341 agggcttgga ggctgaggac tgggcccagg gagtagtgga ggccggtggc agcttcgggg 2401 cttatggtgc ccaggaggaa gcccagtgcc ctactctgca tttcctggaa ggtggggagg 2461 actctgattc agacagtgag gaagaggacg atgaggaaga ggatgatgaa gatgaagacg 2521 acgatgatga tgaggaggat ggtgatgagg tgcctgtacc cagctttggg gaggccatgg 2581 cttactttgc catggtcaag aggtacctga cctccttccc cattgatgac cgcgtgcaga 2641 gccacatcct ccacttggaa cacgatctgg ttcatgtgac caggaagaac cacgccaggc 2701 aggcgggagt tcgaggtctt ggacatcaaa gctgagtcac tggacctagc tgtgccccca 2761 acctagattg gcagcaccac cccagggcag aggactctct gggcacccgc tgtgcatgga 2821 gccagagtgc agagccccag atcctttagt aatgcttccc ctggtcctgc aacaggcccg 2881 gtcacctcgg ccgggcccgg ggctgaggtc agcctcactg cctgcttatt gcctctttct 2941 cagaatcctc tttcctcccc atttggccct gggctcaggg gaccaggtgg ggcgggtggg 3001 gagctgtccg gtgctaccac accgtgccct cagtggacta accacagcag cagccaggga 3061 tgggccctgg aggttcccgg ccggagagtg cctctcccct ctgccatcca cgtcaggtct 3121 ttggtggggg gaccccaaag ccattctggg aagggctcca gaagaaggtc cagcctaggc 3181 cccctgcaag gctggcagcc cccaccccca ccccccaggc cgccttgaga agcacagttt 3241 aactcactgc gggctcctga gcctgcttct gcctgctttc cacctcccca gtccctttct 3301 ctggccctgt ccatgtgact ttggcccttg gttttctttc cagattggag gtttccaaga 3361 ggccccccac cgtggaagta accaagggcg cttccttgtg ggcagctgca ggccccatgc 3421 ctctcctccc tctctctggc agggcccatc ctgggcagag gggcctgggg ctgggcccag 3481 agtccagccg tccagctgct cctttcccag tttgatttca ataaatctgt ccactcccct 3541 tttgtggggg tgaacgtttt aacagccaag ggtgcatcct tcatggtctg ggcttgcgtc 3601 tgtcttgggg gacttattcg tcctggctct ctttggtcct tgctctggtg ggacatggag 3661 gcaagtgttg agagggttgc cctgaccgga agaggggcag gaggagacct caagctt // LOCUS HSCHEMR1 1447 bp DNA PRI 02-OCT-1997 DEFINITION H.sapiens ChemR1 gene. ACCESSION Y08456 NID g2465081 KEYWORDS CC-chemokine receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1447) AUTHORS Samson,M., Stordeur,P., Labbe,O., Soularue,P., Vassart,G. and Parmentier,M. TITLE Molecular cloning and chromosomal mapping of a novel human gene, ChemR1, expressed in T lymphocytes and polymorphonuclear cells and encoding a putative chemokine receptor JOURNAL Eur. J. Immunol. 26 (12), 3021-3028 (1996) MEDLINE 97131825 REFERENCE 2 (bases 1 to 1447) AUTHORS Parmentier,M. TITLE Direct Submission JOURNAL Submitted (27-SEP-1996) M. Parmentier, Universite Libre de Bruxelles, I R I B H N ULB Campus Erasme, 808 Route de Lennik, 1070 Bruxelles, BELGIUM FEATURES Location/Qualifiers source 1..1447 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda DASHII" /clone="C1" /chromosome="3" /map="p21.3" gene 333..1400 /gene="ChemR1" CDS 333..1400 /gene="ChemR1" /codon_start=1 /product="CC-chemokine receptor ChemR1" /db_xref="PID:e274870" /db_xref="PID:g2465082" /translation="MDYTLDLSVTTVTDYYYPDIFSSPCDAELIQTNGKLLLAVFYCL LFVFSLLGNSLVILVLVVCKKLRSITDVYLLNLALSDLLFVFSFPFQTYYLLDQWVFG TVMCKVVSGFYYIGFYSSMFFITLMSVDRYLAVVHAVYALKVRTIRMGTTLCLAVWLT AIMATIPLLVFYQVASEDGVLQCYSFYNQQTLKWKIFTNFKMNILGLLIPFTIFMFCY IKILHQLKRCQNHNKTKAIRLVLIVVIASLLFWVPFNVVLFLTSLHSMHILDGCSISQ QLTYATHVTEIISFTHCCVNPVIYAFVGEKFKKHLSEIFQKSCSQIFNYLGRQMPRES CEKSSSCQQHSSRSSSVDYIL" BASE COUNT 363 a 357 c 300 g 427 t ORIGIN 1 agtagagacg gggtttccgc catgttggca aggctggtct tgaacccctg acctcaggtg 61 atctgcccac cttggcctcc caaagtgcta ggattacagg catgagccac agctcccggt 121 ctatcattta accttaatta catctttaaa ggcccaaata gtctcaccga ctccaaatag 181 tcacacccac accggaggtt gagcacttca acacatgaat ttggggagga cacagttcag 241 tccataacat ccccctaatt tttaaaaaat aaaaatgttt ttaaggagtg aatgtctttt 301 atgtgtctct gtgaccaggt cccgctgcct tgatggatta tacacttgac ctcagtgtga 361 caacagtgac cgactactac taccctgata tcttctcaag cccctgtgat gcggaactta 421 ttcagacaaa tggcaagttg ctccttgctg tcttttattg cctcctgttt gtattcagtc 481 ttctgggaaa cagcctggtc atcctggtcc ttgtggtctg caagaagctg aggagcatca 541 cagatgtata cctcttgaac ctggccctgt ctgacctgct ttttgtcttc tccttcccct 601 ttcagaccta ctatctgctg gaccagtggg tgtttgggac tgtaatgtgc aaagtggtgt 661 ctggctttta ttacattggc ttctacagca gcatgttttt catcaccctc atgagtgtgg 721 acaggtacct ggctgttgtc catgccgtgt atgccctaaa ggtgaggacg atcaggatgg 781 gcacaacgct gtgcctggca gtatggctaa ccgccattat ggctaccatc ccattgctag 841 tgttttacca agtggcctct gaagatggtg ttctacagtg ttattcattt tacaatcaac 901 agactttgaa gtggaagatc ttcaccaact tcaaaatgaa cattttaggc ttgttgatcc 961 cattcaccat ctttatgttc tgctacatta aaatcctgca ccagctgaag aggtgtcaaa 1021 accacaacaa gaccaaggcc atcaggttgg tgctcattgt ggtcattgca tctttacttt 1081 tctgggtccc attcaacgtg gttcttttcc tcacttcctt gcacagtatg cacatcttgg 1141 atggatgtag cataagccaa cagctgactt atgccaccca tgtcacagaa atcatttcct 1201 ttactcactg ctgtgtgaac cctgttatct atgcttttgt tggggagaag ttcaagaaac 1261 acctctcaga aatatttcag aaaagttgca gccaaatctt caactaccta ggaagacaaa 1321 tgcctaggga gagctgtgaa aagtcatcat cctgccagca gcactcctcc cgttcctcca 1381 gcgtagacta cattttgtga ggatcaatga agactaaata taaaaaacat tttcttgaat 1441 ggcatgc // LOCUS HSCIC1MCC 3093 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens gene for ClC-1 muscle chloride channel protein. ACCESSION Z25587 NID g397142 KEYWORDS chloride channel protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3093) AUTHORS Pusch,M., Steinmeyer,K., Koch,M.C. and Jentsch,T.J. TITLE Mutations in dominant human myotonia congenita drastically alter the voltage dependence of the CIC-1 chloride channel JOURNAL Neuron 15 (6), 1455-1463 (1995) MEDLINE 96112039 REFERENCE 2 (bases 1 to 3093) AUTHORS Jentsch,T.J. TITLE Direct Submission JOURNAL Submitted (19-AUG-1993) Thomas J Jentsch, Centre for molecular neurobiology, Hamburg, University, Martinistr.52, Hamburg, D-20246, Germany FEATURES Location/Qualifiers source 1..3093 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Caucasian male fetus placenta" /clone_lib="Stratagene Lambda FixII genomic library #946203" /chromosome="7" /map="7q" 5'UTR 1..87 /partial /note="sequence is derived from a genomic clone; shows sequence homology to the rat ClC-1 5`UTR region (GenBank # X62894)" CDS 88..3054 /note="88..596 is derived from a genomic clone; 597..3054 is identical to partial human ClC-1 cDNA clone (GenBank M97820)" /codon_start=1 /product="human ClC-1 muscle chloride channel" /db_xref="PID:g397143" /db_xref="SWISS-PROT:P35523" /translation="MEQSRSQQRGGEQSWWGSDPQYQYMPFEHCTSYGLPSENGGLQH RLRKDAGPRHNVHPTQIYGHHKEQFSDREQDIGMPKKTGSSSTVDSKDEDHYSKCQDC IHRLGQVVRRKLGEDWIFLVLLGLLMALVSWSMDYVSAKSLQAYKWSYAQMQPSLPLQ FLVWVTFPLVLILFSALFCHLISPQAVGSGIPEMKTILRGVVLKEYLTMKAFVAKVVA LTAGLGSGIPVGKEGPFVHIASICAAVLSKFMSVFCGVYEQPYYYSDILTVGCAVGVG CCFGTPLGGVLFSIEVTSTYFAVRNYWRGFFAATFSAFVFRVLAVWNKDAVTITALFR TNFRMDFPFDLKELPAFAAIGICCGLLGAVFVYLHRQVMLGVRKHKALSQFLAKHRLL YPGIVTFVIASFTFPPGMGQFMAGELMPREAISTLFDNNTWVKHAGDPESLGQSAVWI HPRVNVVIIIFLFFVMKFWMSIVATTMPIPCGGFMPVFVLGAAFGRLVGEIMAMLFPD GILFDDIIYKILPGGYAVIGAAALTGAVSHTVSTAVICFELTGQIAHILPMMVAVILA NMVAQSLQPSLYDSIIQVKKLPYLPDLGWNQLSKYTIFVEDIMVRDVKFVSASYTYGE LRTLLQTTTVKTLPLVDSKDSMILLGSVERSELQALLQRHLCPERRLRAAQEMARKLS ELPYDGKARLAGEGPPGAPPGRPESFAFVDEDEDEDLSGKSELPPSLALHPSTTAPLS PEEPNGPLPGHKQQPEAPEPAGQRPSIFQSLLHCLLGRARPTKKKTTQDSTDLVDNMS PEEIEAWEQEQLSQPVCFDSCCIDQSPFQLVEQTTLHKTHTLFSLLGLHLAYVTSMGK LRGVLALEELQKAIEGHTKSGVQLRPPLASFRNTTSTRKSTGAPPSSAENWNLPEDRP GATGTGDVIAASPETPVPSPSPEPPLSLAPGKVEGELEELELVESPGLEEELADILQG PSLRSTDEEDEDELIL" 3'UTR 3055..3093 /partial /note="sequence is identical to a partial human ClC-1 cDNA clone (GenBank M97820)" BASE COUNT 656 a 884 c 853 g 700 t ORIGIN 1 agcaagagca gaggcttaag gagctacact gggggaagga caggggcaag caggccaagg 61 cctggccggg gctcgggggg agggaatatg gagcaatccc ggtcacagca gcgtgggggt 121 gaacaaagct ggtggggtag tgacccccag taccagtata tgccctttga acactgcacc 181 agctacggac tgccctctga gaatgggggc ctccagcaca ggctccggaa ggatgcaggc 241 ccccgccaca acgtccaccc cacacagatt tatggccatc acaaagaaca attctcagac 301 agggagcagg acatagggat gcccaagaag acaggctcca gttctaccgt ggacagcaag 361 gatgaggatc actattctaa atgtcaagat tgtatccacc gcctgggaca ggtggtgaga 421 agaaaattag gggaagactg gatctttctg gtgcttctgg gactgctgat ggctctggtc 481 agctggagca tggactacgt cagtgccaaa agccttcagg cctacaagtg gtcctacgcg 541 cagatgcagc ccagccttcc tctgcagttc ctggtctggg tcaccttccc actagtcctc 601 atcctcttca gcgccctctt ctgccacctc atctctcccc aggctgttgg ctctggaatc 661 cccgaaatga agacaatact tcgtggggtt gtcctgaagg aatacctcac aatgaaagcc 721 tttgtggcca aggttgtcgc cctgactgcg ggcctgggca gtggcatccc cgtggggaaa 781 gagggcccct tcgtccacat tgccagcatc tgtgctgctg tcctcagcaa attcatgtct 841 gtgttctgcg gggtatatga gcagccatac tactactctg atatcctgac ggtgggctgt 901 gctgtgggag tcggctgttg ttttgggaca ccacttggag gagtgctatt tagcatcgag 961 gtcacctcca cctactttgc tgttcggaac tactggagag gattctttgc agccacgttc 1021 agcgcctttg tgtttcgagt gctggcagtg tggaacaagg atgctgtcac catcactgct 1081 ctgttcagaa ccaatttccg aatggatttc ccctttgacc tgaaggaact accagctttt 1141 gctgccatcg ggatttgctg tgggctcctg ggagctgtat ttgtgtatct gcatcgccaa 1201 gtcatgctcg gtgtccgaaa gcacaaggcc ctcagccagt ttcttgctaa gcaccgcctg 1261 ctgtatcctg gaattgttac ctttgtcatt gcctcattca ccttcccacc aggaatgggt 1321 caattcatgg ctggagagtt gatgccccgc gaagccatca gtactttgtt tgacaacaat 1381 acatgggtga aacacgcggg tgatcctgag agcctgggcc agtcagctgt gtggattcac 1441 ccccgggtca acgttgtcat catcatcttt ctcttcttcg tcatgaagtt ctggatgtcc 1501 atcgtggcca ccactatgcc cataccctgc ggaggcttca tgcctgtgtt tgtgctagga 1561 gctgcatttg gaaggctggt aggagaaatc atggccatgc tctttcctga tggtattttg 1621 tttgatgaca tcatctacaa gatcctacct gggggctatg cagtaattgg agcagcagcg 1681 ctgactggtg ccgtttccca cacagtctcc acagctgtga tttgcttcga attaacgggt 1741 cagattgctc acatcctgcc catgatggtg gctgttatct tggccaacat ggtggcccag 1801 agcctgcagc cctctctcta tgacagcatc atccaggtca agaagctacc ctacttgcct 1861 gaccttggct ggaaccagct cagcaaatat accatctttg ttgaggacat catggtacgt 1921 gatgtgaagt ttgtttcagc ttcttacaca tatggggagt tgcgaaccct gctccagacc 1981 accacagtca agactttacc actggttgac tcaaaagatt caatgatcct gctgggctcg 2041 gtggagcggt cggaactgca ggccctcctg cagcgccacc tgtgtcctga gcgcaggctg 2101 cgcgcagccc aagagatggc gcggaagttg tcggagctgc cttacgacgg gaaggcgcgg 2161 ctggctgggg aggggccccc cggcgcgcct ccaggccggc ccgagtcctt cgcctttgtg 2221 gatgaggatg aggacgaaga tctctctggc aagagcgagc ttcctccttc ccttgctctc 2281 cacccctcta ctactgcccc tctgtcccca gaagagccca atgggcctct gcctggccac 2341 aaacagcagc cggaagcacc agagcctgca ggtcaaagac cctccatctt ccagtccctg 2401 cttcactgct tgctgggcag agctcgcccc acaaagaaga aaacaaccca ggattccaca 2461 gatttagtgg ataacatgtc acctgaagag attgaggcct gggagcagga gcagctgagc 2521 cagcctgtct gttttgattc ctgctgtatt gaccagtctc ccttccagct ggtggagcag 2581 acaaccctgc acaagactca taccctgttt tcactccttg gcctccacct cgcttacgtg 2641 accagcatgg ggaagctcag gggcgtcctg gccctggagg agctacagaa ggccattgag 2701 gggcacacca agtctggggt gcagctccgc cctccccttg ccagcttccg gaacacgact 2761 tcaactcgaa agagtaccgg ggcacctcca tcttctgcag agaactggaa cctgcctgag 2821 gacaggcctg gggccactgg aacaggggat gtgattgctg cctccccaga gacccctgtg 2881 ccatctcctt ccccagagcc ccctctctcc ctggccccag gcaaggtaga gggcgagttg 2941 gaggagctgg agctggtgga gagtccaggg ctggaagagg agctggccga catcttgcag 3001 ggccccagcc tgcgatccac agacgaggag gatgaggatg aactgatcct ttgaccccct 3061 cccacgacct cctcataaag accgtggaga ggc // LOCUS HSDOPD1 1689 bp DNA PRI 06-APR-1993 DEFINITION H.sapiens dopamine D1 receptor gene. ACCESSION X55758 NID g288931 KEYWORDS dopamine D1 receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1689) AUTHORS Sunahara,R.K., Niznik,H.B., Weiner,D.M., Stormann,T.M., Brann,M.R., Kennedy,J.L., Gelernter,J.E., Rozmahel,R., Yang,Y., Israel,Y., Seeman,P. and O'Dowd,B.F. TITLE Human dopamine D1 receptor encoded by an intronless gene on chromosome 5 JOURNAL Nature 347 (6288), 80-83 (1990) MEDLINE 90370095 FEATURES Location/Qualifiers source 1..1689 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" CDS 274..1614 /codon_start=1 /product="dopamine D1 receptor" /db_xref="PID:g288932" /db_xref="SWISS-PROT:P21728" /translation="MRTLNTSAMDGTGLVVERDFSVRILTACFLSLLILSTLLGNTLV CAAVIRFRHLRSKVTNFFVISLAVSDLLVAVLVMPWKAVAEIAGFWPFGSFCNIWVAF DIMCSTASILNLCVISVDRYWAISSPFRYERKMTPKAAFILISVAWTLSVLISFIPVQ LSWHKAKPTSPSDGNATSLAETIDNCDSSLSRTYAISSSVISFYIPVAIMIVTYTRIY RIAQKQIRRIAALERAAVHAKNCQTTTGNGKPVECSQPESSFKMSFKRETKVLKTLSV IMGVFVCCWLPFFILNCILPFCGSGETQPFCIDSNTFDVFVWFGWANSSLNPIIYAFN ADFRKAFSTLLGCYRLCPATNNAIETVSINNNGAAMFSSHHEPRGSISKECNLVYLIP HAVGSSEDLKKEEAAGIARPLEKLSPALSVILDYDTDVSLEKIQPITQNGQHPT" BASE COUNT 375 a 451 c 426 g 437 t ORIGIN 1 ttcaggggct ttctggtgcc cttgacagtg acctgcagca agggagtcag aagacagatg 61 tagaaatcaa gagtgaccat ccacgggatt gacttggatt gccactcaag cggtcctctc 121 atggaatgtt ggtgaggccc tctgccaggg aagcaatctg gctgtgcaaa gtgctgcctg 181 gtggggagga ctcctggaaa tctgactgac ccctattccc tgcttaggaa cttgaggggt 241 gtcagagccc ctgatgtgct ttctcttagg aagatgagga ctctgaacac ctctgccatg 301 gacgggactg ggctggtggt ggagagggac ttctctgttc gtatcctcac tgcctgtttc 361 ctatcgctgc tcatcctgtc cacgctcctg gggaacacgc tggtctgtgc tgccgttatc 421 aggttccgac acctgcggtc caaggtgacc aacttctttg tcatctcctt ggctgtgtca 481 gatctcttgg tggcagtcct ggtcatgccc tggaaggcag tggctgagat tgctggcttc 541 tggccctttg ggtccttctg taacatctgg gtggcctttg acatcatgtg ctccactgca 601 tccatcctca acctctgtgt gatcagcgtg gacaggtatt gggctatctc cagccctttc 661 cggtatgaga gaaagatgac ccccaaggca gccttcatcc tgatcagtgt ggcatggacc 721 ttgtctgtac tcatctcctt catcccagtg cagctcagct ggcacaaggc aaaacccaca 781 agcccctctg atggaaatgc cacttccctg gctgagacca tagacaactg tgactccagc 841 ctcagcagga catatgccat ctcatcctct gtaataagct tttacatccc tgtggccatc 901 atgattgtca cctacaccag gatctacagg attgctcaga aacaaatacg gcgcattgcg 961 gccttggaga gggcagcagt ccacgccaag aattgccaga ccaccacagg taatggaaag 1021 cctgtcgaat gttctcaacc ggaaagttct tttaagatgt ccttcaaaag agaaactaaa 1081 gtcctgaaga ctctgtcggt gatcatgggt gtgtttgtgt gctgttggct acctttcttc 1141 atcttgaact gcattttgcc cttctgtggg tctggggaga cgcagccctt ctgcattgat 1201 tccaacacct ttgacgtgtt tgtgtggttt gggtgggcta attcatcctt gaaccccatc 1261 atttatgcct ttaatgctga ttttcggaag gcattttcaa ccctcttagg atgctacaga 1321 ctttgccctg cgacgaataa tgccatagag acggtgagta tcaataacaa tggggccgcg 1381 atgttttcca gccatcatga gccacgaggc tccatctcca aggagtgcaa tctggtttac 1441 ctgatcccac atgctgtggg ctcctctgag gacctgaaaa aggaggaggc agctggcatc 1501 gccagaccct tggagaagct gtccccagcc ctatcggtca tattggacta tgacactgac 1561 gtctctctgg agaagatcca acccatcaca caaaacggtc agcacccaac ctgaactcgc 1621 agatgaatcc tgccacacat gctcatccca aaagctagag gagattgctc tggggtttgc 1681 tattaagaa // LOCUS HSEAR2 2380 bp DNA PRI 25-JUN-1997 DEFINITION Human v-erbA related ear-2 gene. ACCESSION X12794 NID g31064 KEYWORDS DNA-binding protein; ear-2 gene; hormone receptor; receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2380) AUTHORS Miyajima,N. TITLE Direct Submission JOURNAL Submitted (01-SEP-1988) Miyajima N., Department of Oncology, The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai Minato-ku, Tokyo, 108 Japan REFERENCE 2 (bases 1 to 2380) AUTHORS Miyajima,N., Kadowaki,Y., Fukushige,S., Shimizu,S., Semba,K., Yamanashi,Y., Matsubara,K., Toyoshima,K. and Yamamoto,T. TITLE Identification of two novel members of erbA superfamily by molecular cloning: the gene products of the two are highly related to each other JOURNAL Nucleic Acids Res. 16 (23), 11057-11074 (1988) MEDLINE 89083547 COMMENT cell line=TIG-1; clone=lambda A14. FEATURES Location/Qualifiers source 1..2380 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="12-week old embryo" /tissue_type="lung" /chromosome="19" CDS 703..1914 /note="ear-2 gene product" /codon_start=1 /db_xref="PID:g31065" /db_xref="SWISS-PROT:P10588" /translation="MAMVTGGWGGPGGDTNGVDKAGGYPRAAEDDSASPPGAASDAEP GDEERPGLQVDCVVCGDKSSGKHYGVFTCEGCKSFFKRTIRRNLSYTCRSNRDCQIDQ HHRNQCQYCRLKKCFRVGMRKEAVQRGRIPHSLPGAVAASSGSPPGSALAAVASGGDL FPGQPVSELIAQLLRAEPYPAAAGRFGAGGGAAGAVLGIDNVCELAARLLFSTVEWAR HGFFPELPVADQVALLRMSWSELFVLNAAQAALPLHTAPLLAAAGLHAAPMAAERAVA FMDQVRAFQEQVDKLGRLQVDSAEYGCLKAIALFTPDACGLSDPAHVESLQEKAQVAL TEYVRAQYPSQPQRFGRLLLRLPALRAVPASLISQLFFMRLVGKTPIETLIRDMLLSG STFNWPYGSGQ" misc_feature 868..1065 /note="put. DNA-binding domain" BASE COUNT 345 a 837 c 853 g 345 t ORIGIN 1 ctgtgtgtca aaccggccgc agcgcgaggc cggcgcgcgg ggtcggtggc ggtggcggcg 61 gcgagagcga ggcttgggcc caggcgcagg cccaggccag gcaggagcgg ccgcagcggg 121 ggcggcagct ccagaggcgc cggcccagcg gccctgcagg cccggagcgg cagccgcggg 181 cgccggggga cccagggggc gcggcgatca ccccggcgcg cacccgccgc ccctcccccc 241 cgccacgggc ggccgcgcaa cttgcggcca aactttcccc ccggcctccc gcgctcggat 301 ggcggcccgc tgagttggcc gcagcccccg ggcgagcggc ggccaggcca gaggacgccc 361 cctgcgcggc tcgggccggg ccatggccgc gcgcccccgc cggggcccca gggcggcggg 421 acggagtcca tctgggcccc cggcccgcgc ctctagcgtg cccctcccgg ccccgggccc 481 ggccccgcgc acggtgggaa agttggcctg gaaccggccc gaccagttcc gcccgggcgc 541 gaggaccggc cgcagaagtt gctgcaaaac ttttttgggg ggtgcagccc gtgccccccg 601 cgcgccgggg ccgaatgcgc gccgcgtagg gtcccccggg ccgagagggg tgcccggagg 661 aagagcgcgg tgggggcgcc ccggccccgc tgccctgggg ctatggccat ggtgaccggc 721 ggctggggcg gccccggcgg cgacacgaac ggcgtggaca aggcgggcgg ctacccgcgc 781 gcggccgagg acgactcggc ctcgcccccc ggtgccgcca gcgacgccga gccgggcgac 841 gaggagcggc cggggctgca ggtggactgc gtggtgtgcg gggacaagtc gagcggcaag 901 cattacggtg tcttcacctg cgagggctgc aagagctttt tcaagcgaac gatccgccgc 961 aacctcagct acacctgccg gtccaaccgt gactgccaga tcgaccagca ccaccggaac 1021 cagtgccagt actgccgtct caagaagtgc ttccgggtgg gcatgaggaa ggaggcggtg 1081 cagcgcggcc gcatcccgca ctcgctgcct ggtgccgtgg ccgcctcctc gggcagcccc 1141 ccgggctcgg cgctggcggc agtggcgagc ggcggagacc tcttcccggg gcagccggtg 1201 tccgaactga tcgcgcagct gctgcgcgct gagccctacc ctgcggcggc cggacgcttc 1261 ggcgcagggg gcggcgcggc gggcgcggtg ctgggcatcg acaacgtgtg cgagctggcg 1321 gcgcggctgc tcttcagcac cgtggagtgg gcgcgccacg gcttcttccc cgagctgccg 1381 gtggccgacc aggtggcgct gctgcgcatg agctggagcg agctcttcgt gctgaacgcg 1441 gcgcaggcgg cgctgcccct gcacacggcg ccgctactgg ccgccgccgg cctccacgcc 1501 gcgcctatgg ccgccgagcg cgccgtggct ttcatggacc aggtgcgcgc cttccaggag 1561 caggtggaca agctgggccg cctgcaggtc gactcggccg agtatggctg cctcaaggcc 1621 atcgcgctct tcacgcccga cgcctgtggc ctctcagacc cggcccacgt tgagagcctg 1681 caggagaagg cgcaggtggc cctcaccgag tatgtgcggg cgcagtaccc gtcccagccc 1741 cagcgcttcg ggcgcctgct gctgcggctc cccgccctgc gcgcggtccc tgcctccctc 1801 atctcccagc tgttcttcat gcgcctggtg gggaagacgc ccattgagac actgatcaga 1861 gacatgctgc tgtcggggag taccttcaac tggccctacg gctcgggcca gtgaccatga 1921 cggggccacg tgtgctgtgg ccaggcctgc agacagacct caagggacag ggaatgctga 1981 ggcctcgagg ggcctcccgg ggcccaggac tctggcttct ctcctcagac ttctattttt 2041 taaagactgt gaaatgtttg tcttttctgt tttttaaatg atcatgaaac caaaaagaga 2101 ctgatcatcc aggcctcagc ctcatcctcc ccaggacccc tgtccaggat ggagggtcca 2161 atcctaggac agccttgttc ctcagcaccc ctagcatgaa cttgtgggat ggtggggttg 2221 gcttccctgg catgatggac aaaggcctgg cgtcggccag aggggctgct ccagtgggca 2281 ggggtagcta gcgtgtgcca ggcagatcct ctggacacgt aacctatgtc agacactaca 2341 tgatgactca aggccaataa taaagacatt tcctacctgc // LOCUS HSECPG 1669 bp DNA PRI 15-JAN-1991 DEFINITION Human ECP gene for eosinophil cationic protein. ACCESSION X55990 NID g31084 KEYWORDS ECP gene; eosinophil cationic protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1669) AUTHORS Simonsen,C.C. TITLE Direct Submission JOURNAL Submitted (11-OCT-1990) Simonsen C.C., Genelabs INC, Dept of Recombinant expression, 505 Penobscot DR, Redwood City, CA 94063, USA REFERENCE 2 (bases 1 to 1669) AUTHORS Simonsen,C.C., Kennedy,J., Comstock,L., Ashton,N. and McGrogan,M. JOURNAL Unpublished COMMENT See also entries X55987, X55988, X55989. FEATURES Location/Qualifiers source 1..1669 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon" prim_transcript <314..1286 /gene="ECP" mRNA join(<314..371,601..1268) /gene="ECP" exon <314..371 /gene="ECP" /number=1 /product="eosinophil cationic protein" gene 314..1286 /gene="ECP" intron 372..600 /gene="ECP" /number=1 exon 601..1268 /gene="ECP" /number=2 /product="eosinophil cationic protein" sig_peptide 606..686 /gene="ECP" CDS 606..1088 /gene="ECP" /codon_start=1 /product="eosinophil cationic protein" /db_xref="PID:g31085" /db_xref="SWISS-PROT:P12724" /translation="MVPKLFTSQICLLLLLGLMGVEGSLHARPPQFTRAQWFAIQHIS LNPPRCTIAMRAINNYRWRCKNQNTFLRTTFANVVNVCGNQSIRCPHNRTLNNCHRSR FRVPLLHCDLINPGAQNISNCRYADRPGRRFYVVACDNRDPRDSPRYPVVPVHLDTTI " mat_peptide 687..1085 /gene="ECP" /product="eosinophil cationic protein" polyA_signal 1245..1250 /gene="ECP" BASE COUNT 435 a 438 c 352 g 444 t ORIGIN 1 cgaacatctg caccccccgc cgcagtccca ttcacccctt ttaccttacc cctcacgttt 61 ctgccagcag cgtatagttc tcacagagtc cagatcccac cggcaaaact ctgtctaaca 121 caggatgact tggaattaga gtaagtatag cagaaagagc agcagggctg tccttgggta 181 tccgttcgtc agccaagtca tcaaataaaa aggatgattg cacaagtgga ccatgtgtca 241 atctgtgggt ttctgccatg gccagaccca ccaagggaag ctttatttaa acagttccaa 301 gtaggggaga ccagctgccc cagaacaacc agctggatca gttctcacag gagccacagc 361 tcagagactg ggtaagtcaa caatccccca gagctgggac aggaggggca gcgacagggc 421 agcacctgag tgagaggtga gctaagttag tgcttaggag atgtggcaca ctttggggac 481 aggaagaaaa ggaaatgcga ccccagagtg gcagcagagg ggcctgtggg ttgagacact 541 atagagtgtg tcataaccga gaccggatcg ggagtagtta cttctcttct tttcttacag 601 gaaacatggt tccaaaactg ttcacttccc aaatttgtct gcttcttctg ttggggctta 661 tgggtgtgga gggctcactc catgccagac ccccacagtt tacgagggct cagtggtttg 721 ccatccagca catcagtctg aacccccctc gatgcaccat tgcaatgcgg gcaattaaca 781 attatcgatg gcgttgcaaa aaccaaaata cttttcttcg tacaactttt gctaatgtag 841 ttaatgtttg tggtaaccaa agtatacgct gccctcataa cagaactctc aacaattgtc 901 atcggagtag attccgggtg cctttactcc actgtgacct cataaatcca ggtgcacaga 961 atatttcaaa ctgcaggtat gcagacagac caggaaggag gttctatgta gttgcatgtg 1021 acaacagaga tccacgggat tctccacggt atcctgtggt tccagttcac ctggatacca 1081 ccatctaagc tcctgtatca gcagtcctca tcatcactca tctgccaagc tcctcaatca 1141 tagccaagat cccatccctc catgtactct gggtatcagc aactgtcctc atcagtctcc 1201 ataccccttc agctttcctg agctgaagtc cttgtgaacc ctgcaataaa ctgctttgca 1261 aattcatctg gaagtgtctg tgtgtcttcc tcgggctctg ctgtcattta gtgacaatct 1321 gctctagaga tttgggttta tcatgaatct ctccccctca atatctgacc aaattccttg 1381 attcccccat catccttcat gtgatacctg attccaggcc tgccttaaaa aaaaatccaa 1441 ttgagtgaac ttagcattgg tctccctagc cttaatatct cctctaagca attttccatc 1501 cactgactcc tcccccaaca ccaacctata acttgtgtat agatctccac ttgttttagt 1561 tgtattccag aattgagccc aatttgatat tgaggtccat ttgatgcttt atcctccgac 1621 tatggttttt attgacatga tttctatccc aagaaaaaaa gagctgcag // LOCUS HSEYA2 1617 bp DNA PRI 05-SEP-1997 DEFINITION H.sapiens EYA2 gene. ACCESSION Y10261 NID g1834488 KEYWORDS EYA2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1617) AUTHORS Abdekhak,S., Kalatzis,V., Heilig,R., Compain,S., Samson,D., Vincent,C., Weil,D., Cruaud,C., Sahly,I., Leibovici,M., Bitner-Glindzicz,M., Francis,M., Lacomde,D., Vigneron,J., Charachon,R., Boven,K., Bedbeder,P., Van Regemorter,N., Weissenbach,J. and Petit,C. TITLE A human homologue of the Drosophila eyes absent gene underlies branchio-oto-renal (BOR) syndrome and identifies a novel gene family JOURNAL Nature Genet. 15 (2), 157-164 (1997) MEDLINE 97172972 REFERENCE 2 (bases 1 to 1617) AUTHORS Abdelhak,S. TITLE Direct Submission JOURNAL Submitted (23-DEC-1996) S. Abdelhak, Genetique Moleculaire Humaine, URA CNRS 1968, Institut Pasteur, 25 Rue du Dr Roux, 75724 Paris Cedex 15, FRANCE FEATURES Location/Qualifiers source 1..1617 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="9 week embryo" /chromosome="20" /map="20q13.1" gene 1..1617 /gene="EYA2" CDS 1..1617 /gene="EYA2" /codon_start=1 /db_xref="PID:e290305" /db_xref="PID:g1834489" /translation="MVELVISPSLTVNSDCLDKLKFNRADAAVWTLSDRQGITKSAPL RVSQLFSRSCPRVLPRQPSTAMAAYGQTQYSAGIQQATPYTAYPPPAQAYGIPSYSIK TEDSLNHSPGQSGFLSYGSSFSTSPTGQSPYTYQMHGTTGFYQGGNGLGNAAGFGSVH QDYPSYPGFPQSQYPQYYGSSYNPPYVPASSICPSPLSTSTYVLQEASHNVPNQSSES LAGEYNTHNGPSTPAKEGDTDRPHRASDGKLRGRSKRSSDPSPAGDNEIERVFVWDLD ETIIIFHSLLTGTFASRYGKDTTTSVRIGLMMEEMIFNLADTHLFFNDLEDCDQIHVD DVSSDDNGQDLSTYNFSADGFHSSAPGANLCLGSGVHGGVDWMRKLAFRYRRVKEMYN TYKNNVGGLIGTPKRETWLQLRAELEALTDLWLTHSLKALNLINSRPNCVNVLVTTTQ LIPALAKVLLYGLGSVFPIENIYSATKTGKESCFERIMQRFGRKAVYVVIGDGVEEEQ GAKKHNMPFWRISCHADLEALRHALELEYL" BASE COUNT 399 a 488 c 409 g 321 t ORIGIN 1 atggtagaac tagtgatctc acccagcctc actgtaaaca gcgattgtct ggataaactg 61 aagtttaacc gtgctgacgc tgctgtgtgg actctgagtg acagacaagg catcaccaaa 121 tcggcccccc tgagagtgtc ccagctcttc tccagatctt gcccacgtgt cctcccccgc 181 cagccttcca cagccatggc agcctacggc cagacgcagt acagtgcggg gatccagcag 241 gctaccccct atacagctta cccacctcca gcacaagcct atggaatccc ttcctacagc 301 atcaagacag aagacagctt gaaccattcc cctggccaga gtggattcct cagctatggc 361 tccagcttca gcacctcacc cactggacag agcccataca cctaccagat gcacggcaca 421 acagggttct atcaaggagg aaatggactg ggcaacgcag ccggtttcgg gagtgtgcac 481 caggactatc cttcctaccc cggcttcccc cagagccagt acccccagta ttacggctca 541 tcctacaacc ctccctacgt cccggccagc agcatctgcc cttcgcccct ctccacgtcc 601 acctacgtcc tccaggaggc atctcacaac gtccccaacc agagttccga gtcacttgct 661 ggtgaataca acacacacaa tggaccttcc acaccagcga aagagggaga cacagacagg 721 ccgcaccggg cctccgacgg gaagctccga ggccggtcta agaggagcag tgacccgtcc 781 ccggcagggg acaatgagat tgagcgtgtg ttcgtgtggg acttggatga gacaataatt 841 atttttcact ccttactcac ggggacattt gcatccagat acgggaagga caccacgacg 901 tccgtgcgca ttggccttat gatggaagag atgatcttca accttgcaga tacacatctg 961 ttcttcaatg acctggagga ttgtgaccag atccacgttg atgacgtctc atcagatgac 1021 aatggccaag atttaagcac atacaacttc tccgctgacg gcttccacag ttcggcccca 1081 ggagccaacc tgtgcctggg ctctggcgtg cacggcggcg tggactggat gaggaagctg 1141 gccttccgct accggcgggt gaaggagatg tacaatacct acaagaacaa cgttggtggg 1201 ttgataggca ctcccaaaag ggagacctgg ctacagctcc gagctgagct ggaagctctc 1261 acagacctct ggctgaccca ctccctgaag gcactaaacc tcatcaactc ccggcccaac 1321 tgtgtcaatg tgctggtcac caccactcaa ctaattcctg ccctggccaa agtcctgcta 1381 tatggcctgg ggtctgtgtt tcctattgag aacatctaca gtgcaaccaa gacagggaag 1441 gagagctgct tcgagaggat aatgcagaga ttcggcagaa aagctgtcta cgtggtgatc 1501 ggtgatggtg tggaagagga gcaaggagcg aaaaagcaca acatgccttt ctggcggata 1561 tcctgccacg cagacctgga ggcactgagg cacgccctgg aactggagta tttatag // LOCUS HSEYA3 1722 bp DNA PRI 05-SEP-1997 DEFINITION H.sapiens EYA3 gene. ACCESSION Y10262 NID g1834490 KEYWORDS EYA3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1722) AUTHORS Abdekhak,S., Kalatzis,V., Heilig,R., Compain,S., Samson,D., Vincent,C., Weil,D., Cruaud,C., Sahly,I., Leibovici,M., Bitner-Glindzicz,M., Francis,M., Lacomde,D., Vigneron,J., Charachon,R., Boven,K., Bedbeder,P., Van Regemorter,N., Weissenbach,J. and Petit,C. TITLE A human homologue of the Drosophila eyes absent gene underlies branchio-oto-renal (BOR) syndrome and identifies a novel gene family JOURNAL Nature Genet. 15 (2), 157-164 (1997) MEDLINE 97172972 REFERENCE 2 (bases 1 to 1722) AUTHORS Abdelhak,S. TITLE Direct Submission JOURNAL Submitted (23-DEC-1996) S. Abdelhak, Genetique Moleculaire Humaine, URA CNRS 1968, Institut Pasteur, 25 Rue du Dr Roux, 75724 Paris Cedex 15, FRANCE REMARK revised by submitter 31-JAN-1997 FEATURES Location/Qualifiers source 1..1722 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="9 week embryo" /chromosome="1" gene 1..1722 /gene="EYA3" CDS 1..1722 /gene="EYA3" /codon_start=1 /db_xref="PID:e299512" /db_xref="PID:g1834491" /translation="MEEEQDLPEQPVKKAKMQESGEQTISQVSNPDVSDQKPETSSLA SNLPMSEEIMTCTDYIPRSSNDYTSQMYSAKPYAHILSVPVSETAYPGQTQYQTLQQT QPDAVYPQATQTYGLPPFGALWPGMKPESGLIQTPSPSQHSVLTCTTGLTTSQPSPAH YSYPIQASSTNASLISTSSTIANIPAAAVASISNQDYPTYTILGQNQYQACYPSSSFG VTGQTNSDAESTTLAATTYQSEKPSVMAPAPAAQKLSSGDPSTSPSLSQTTPSKDTDD QSRKNMNSKNRGKKKADATSSQDSELERVFLWDLDETIIIFHSLLTGSYAQKYGKDPT VVIGSGLTMEKMIFEVADTHLFSNDLKECDQVHVEDVAPNDKGQNLNNYSFSTNGFSG SGGSGSHGSSVGVQGGVDWMRKLAFRYRKVREIYDKHKSNVGGLLSPQRKEALQKLKA EIEVLTNSWLGTALKSLLLIQSKKNCVNVLITTTQLLPALAKVLLYGLGKIFPIENIY SATKIGKESCFERIVTSLGKKLTYVVIGDGRDEEIAAKQHNMPFWRITNHGDLVSLHQ ALELDFL" BASE COUNT 544 a 411 c 353 g 414 t ORIGIN 1 atggaagaag agcaagattt accagagcaa ccagtgaaaa aagccaagat gcaggaatca 61 ggagagcaaa ctataagtca agtaagcaat ccagatgtca gtgatcagaa gcctgaaaca 121 tcaagccttg cttcaaacct tcccatgtca gaggaaatta tgacatgcac cgattacatc 181 cctcgctcat ccaatgatta tacctcacaa atgtattctg caaaacctta tgcacatatt 241 ctctcagttc ctgtttcgga aactgcttac cctggacaga ctcaatacca gacactacag 301 cagactcaac cagatgctgt ctaccctcag gcaacccaaa cgtatggact acctcctttt 361 ggtgcattgt ggccaggtat gaaacctgaa agtggtttaa ttcagactcc atctccaagt 421 caacacagtg ttcttacctg cactacaggg ttaaccacaa gccagccaag cccagcacat 481 tattcttatc ccattcaagc ttcaagcaca aatgccagcc tgatatctac ttcttctaca 541 attgccaata ttccagcagc agcagtagcc agcatctcaa accaggatta tcccacctat 601 actattcttg gtcagaatca gtaccaggcc tgctacccca gctccagctt tggagtcaca 661 ggtcagacta acagtgatgc agagagcacc acattagcag caaccacata ccagtcggag 721 aagcctagtg tcatggcgcc tgcacctgca gcacagaaac tttcctctgg agacccttct 781 acaagtccat ctttgtccca gactacacca agtaaagata ctgatgatca gtccaggaaa 841 aacatgaata gcaagaaccg gggcaagaag aaagctgatg ccacttcttc ccaagacagt 901 gaattagaac gggtatttct gtgggacttg gatgaaacca tcatcatctt ccactcactt 961 cttactggat cctatgccca gaaatatgga aaggacccaa cagtagtgat tggctcaggt 1021 ttaacaatgg aaaaaatgat ttttgaagtg gctgataccc atctattttc caatgactta 1081 aaagagtgtg accaggtaca tgtggaagat gtggctccta atgacaaggg ccaaaacttg 1141 aacaactaca gtttctcaac aaatggtttc agtggctcag gaggtagtgg cagccatggt 1201 tcatctgtgg gtgttcaggg aggtgtggac tggatgagga aactagcttt ccgctaccgg 1261 aaagtgagag aaatctatga taagcataaa agcaacgtgg gtggtctcct cagtccccaa 1321 aggaaggaag cactgcaaaa attaaaagca gaaattgaag ttttaacaaa ttcctggtta 1381 ggaactgcat taaagtcctt acttctcatc cagtccaaaa agaattgtgt gaatgttctg 1441 atcactacca cccagctgct tccagccctg gccaaggttc tcctatatgg actaggaaaa 1501 atatttccta ttgagaacat ctatagtgct accaaaattg gtaaggagag ctgctttgag 1561 agaattgtca caagccttgg aaagaaactc acatatgttg tgattggaga tggacgagat 1621 gaagaaattg cagccaaaca gcacaacatg cctttctgga ggatcacaaa ccatggagac 1681 ctagtatccc ttcaccaggc tttagagctt gattttctct aa // LOCUS HSH4BHIS 814 bp DNA PRI 09-NOV-1992 DEFINITION H.sapiens H4/b gene for H4 histone. ACCESSION X60482 NID g31996 KEYWORDS H4/b gene; histone H4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 814) AUTHORS Doenecke,D. TITLE Direct Submission JOURNAL Submitted (08-JUL-1991) D. Doenecke, Georg-August Univer, Inst fuer Biochemie, Zentrum 3 des Fachbereichs Medizin Bioce, Humboldtallee 23, 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 814) AUTHORS Doenecke,D. and Kardalinou,E. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..814 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Placenta" /clone="21" TATA_signal 203..218 gene 257..568 /gene="H4/b" CDS 257..568 /gene="H4/b" /codon_start=1 /product="H4 histone" /db_xref="PID:g31997" /db_xref="SWISS-PROT:P02304" /translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV KRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFG G" terminator 603..619 /note="Histone mRNA" BASE COUNT 221 a 187 c 202 g 204 t ORIGIN 1 gatcgcgcca ctgcattcca gcctgggcaa cagagcaaga acccgtctca aaaaaaaaaa 61 aaaaaaaaaa aaaaagcacc ctgtgaggaa acagtactaa ttatttgata ttctgggaaa 121 agtgggggac aactgtcagg cttctttgtc gaaagtttat gaactgatgg ctcagttaat 181 ggctgcaagt atagtgtgtg tgtatatata tatatatacc tagcagtatt tattaaatcc 241 cagctgtggt ttcaagatgt ctggccgcgg taagggcgga aagggtctag gtaagggtgg 301 cgccaagcgt caccgtaagg tattgcgtga caatatccaa ggaatcacca agcccgctat 361 ccgccgcctg gctcgccgcg gcggcgtcaa gcgtatttct ggcctcattt atgaggaaac 421 tcgcggagtg ctgaaagttt tcctggaaaa tgtaatccgc gatgctgtca cctacacgga 481 acacgccaaa cgcaagacag tcacagccat ggacgtggtg tacgcgctca agcgccaggg 541 acgcactctt tatggcttcg gcggctgagc ttacctctac agtacactac cgcaaaacca 601 acggcccttt tcagggccac ctatccactc aggagaaaga gtagtagtca ctgctaaaag 661 tgtagtttca cgtgtttagt agctccggtt ttcaagttaa atggtcttat tacgccttgg 721 cttcatatct tactggccgg tgaggcatta gtgtattaaa gtttattttc actcttgctg 781 tgtcgcccat gctggagtaa tcaatggcgc gatc // LOCUS HSH4GHIS 738 bp DNA PRI 08-DEC-1995 DEFINITION H.sapiens H4/g gene for H4 histone. ACCESSION X60486 NID g32003 KEYWORDS H4/g gene; histone H4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 738) AUTHORS Doenecke,D. TITLE Direct Submission JOURNAL Submitted (08-JUL-1991) D. Doenecke, Georg-August Univer, Inst fuer Biochemie, Zentrum 3 des Fachbereichs Medizin Bioce, Humboldtallee 23, 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 738) AUTHORS Drabent,B., Kardalinou,E., Bode,C. and Doenecke,D. TITLE Association of histone H4 genes with the mammalian testis-specific H1t histone gene JOURNAL DNA Cell Biol. 14 (7), 591-597 (1995) MEDLINE 95352203 FEATURES Location/Qualifiers source 1..738 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Leucocyte" /clone="D4.1" TATA_signal 185..191 gene 232..543 /gene="H4/g" CDS 232..543 /gene="H4/g" /codon_start=1 /product="H4 histone" /db_xref="PID:g32004" /db_xref="SWISS-PROT:P02304" /translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV KRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFG G" terminator 595..610 /note="Histone mRNA" BASE COUNT 217 a 178 c 174 g 169 t ORIGIN 1 aatacagcgc attcaacttg caaacaccct tccactccca caaagagcaa gctgtcactg 61 gccaatcaaa acaatgaacc ataatgaaac agtttttctt gctccaccca ctcggtgacc 121 aaatttgaaa aaaaaaaaaa accgcgccaa ctcatgttgt tttcaatcag gtccgccaag 181 tttgtattta aggaactgtt tcagttcata ccttccactg cgataggaat catgtctggt 241 cgcggcaaag gcggaaaagg cttggggaag ggtggtgcta agcgccatcg taaggtgctc 301 cgggataaca tccagggcat tacaaaaccg gctatccgcc gtttggctcg gcgcggtggg 361 gtcaagcgca tttccggtct tatctatgag gagactcgag gtgtgcttaa ggttttctta 421 gagaacgtta ttcgagacgc cgtcacctat acggagcacg ccaagcgcaa aactgtcaca 481 gccatggatg tagtatatgc cctaaaacgt caggggcgca ctctgtatgg cttcggcggc 541 tgaatctaag aatacgcggt ctcctgagaa cttcaaaaaa caaaaaaacc caaaggccct 601 tttcagggcc gctcacaaag tcgtttaaag agctgaaatg cgttgcgaga atgagtttgg 661 atgacagaaa taaccgtgac agcctgcata agaatgaatt gtgtttgcca tgaccggcca 721 cactgtgaca aaatttca // LOCUS HSHB2B 734 bp DNA PRI 28-FEB-1995 DEFINITION H.sapiens HB2B gene for high sulfur keratin. ACCESSION X63338 S47244 NID g311881 KEYWORDS hair microfibrill matrix protein; HB2B gene; high sulphur keratin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 734) AUTHORS Zhumabayeva,B.D. TITLE Direct Submission JOURNAL Submitted (25-NOV-1991) B.D. Zhumabayeva, Institute of Molecular Genetics, USSR Academy of Science, Kurchatov sq. 46, Moscow, 123182, USSR REFERENCE 2 (bases 1 to 734) AUTHORS Zhumabaeva,B.D., Gening,L.V. and Gazaryan,K.G. TITLE Cloning and structural characterization of human hair sulfur-rich keratin genes JOURNAL Mol. Biol. 26, 550-555 (1992) REFERENCE 3 (bases 1 to 734) AUTHORS Zhumabaeva,B.D., Gening,L.V. and Gazarian,K.G. TITLE [Cloning and structural characteristics of human hair keratin genes rich in sulfur] JOURNAL Mol. Biol. 26, 813-820 (1992) FEATURES Location/Qualifiers source 1..734 /organism="Homo sapiens" /db_xref="taxon:9606" CAAT_signal 13..17 TATA_signal 59..66 gene 142..669 /gene="HB2B" CDS 142..669 /gene="HB2B" /codon_start=1 /product="high sulfur keratin" /db_xref="PID:g311882" /translation="MTCCQTSFCGYPSCSTSGTCGSSCCQPSCCETSCCQPSCCETSC CQPSCCQTSFCDFLASQLVDLQLSCCQPSCCETSCCQPSCCQTSSCGTGCGIGGGIGY GQEGSSGAVSTRIRWCRPDCRVEGTCLPPCCVVSCHTPTCCQLHHAEASCCRPSYCGQ SCCRPVCCCYSCSHC" BASE COUNT 161 a 234 c 183 g 156 t ORIGIN 1 aattatccaa cacaataagg cagagcttct gaattatgta aacagtagct ggccaggctt 61 ataaaaggcc aatgtggcag ccatcaccaa aactcagaaa ctcctccaag caacccagac 121 ttcataccag ctcccaacac catgacctgc tgccagacca gcttctgtgg atatcccagc 181 tgctccacca gtgggacatg cggctccagc tgctgccagc caagctgctg tgagaccagc 241 tgctgccagc caagctgctg tgagaccagc tgctgccagc caagctgctg ccagaccagc 301 ttctgcgatt tcctagcttc tcaactagtg gacctgcagc tcagttgctg ccagccaagc 361 tgctgtgaga ccagctgctg ccagccaagc tgctgccaga ccagctcctg cggaactggc 421 tgtggcattg gtggtggcat tggctatggc caggagggca gcagtggagc tgtgagcacc 481 cgtatcaggt ggtgccgccc agactgccgt gtggagggta cctgcctgcc cccctgctgt 541 gtggtgagct gccacacccc aacctgctgc cagctgcacc acgccgaggc ctcctgctgc 601 cgcccatcct actgtggaca gtcctgctgc cgcccagtct gctgctgcta ctcctgtagc 661 cactgctaaa gcagtttgct gatttaactg aaattccatt tcagttccat tcagttaagc 721 aataattcta agaa // LOCUS HSHGM07EG 3459 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens HGMP07E gene for olfactory receptor. ACCESSION X65857 S59676 NID g425220 KEYWORDS G protein-coupled receptor; HGMP07E gene; olfactory receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3459) AUTHORS Parmentier,M. TITLE Direct Submission JOURNAL Submitted (27-APR-1992) M. Parmentier, Universite Libre de Bruxelles, I.R.I.B.H.N. ULB Campus Erasme, 808 route de Lennik, 1070 Bruxelles, BELGIUM REFERENCE 2 (bases 1 to 3459) AUTHORS Schurmans,S., Muscatelli,F., Miot,F., Mattei,M.G., Vassart,G. and Parmentier,M. TITLE The OLFR1 gene encoding the HGMP07E putative olfactory receptor maps to the 17p13-->p12 region of the human genome and reveals an MspI restriction fragment length polymorphism JOURNAL Cytogenet. Cell Genet. 63 (3), 200-204 (1993) MEDLINE 93251832 FEATURES Location/Qualifiers source 1..3459 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda charon 4a" /chromosome="17" /map="p12-13" gene 883..1821 /gene="HGMP07E" CDS 883..1821 /gene="HGMP07E" /note="putative olfactory receptor" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g425221" /db_xref="SWISS-PROT:P34982" /translation="MDGGNQSEGSEFLLLGMSESPEQQQILFWMFLSMYLVTVVGNVL IILAISSDSRLHTPVYFFLANLSFTDLFFVTNTIPKMLVNLQSHNKAISYAGCLTQLY FLVSLVALDNLILAVMAYDRYVAICCPLHYTTAMSPKLCILLLSLCWVLSVLYGLIHT LLMTRVTFCGSRKIHYIFCEMYVLLRMACSNIQINHTVLIATGCFIFLIPFGFVIISY VLIIRAILRIPSVSKKYKAFSTCASHLGAVSLFYGTLCMVYLKPLHTYSVKDSVATVM YAVVTPMMNPFIYSLRNKDMHGALGRLLDKHFKRLT" BASE COUNT 1074 a 768 c 633 g 984 t ORIGIN 1 gaattcatgc ttggtgacac ttctcttcat attagaaacc ttgatatgtt tatcaattta 61 tttaattttt actttacccc tcactggaga atttgaaggt ttcgcctgat tcttgttaac 121 atctgagtgc catgatttat ttgttgacag tggggatgag atagtttttc attataacat 181 ttttcctgat tctcgttaaa tataacaata ataggcaaca tttctgagtg ctaattgcct 241 accaactcct tgtgttaagc actttactac ttaattttcc cccaccctat aactcatgat 301 tcgcatgaag aaactgagtc ttaaggaact ctggaaccct acctgaggtt tcacagcaag 361 taagtgcagt cgtactatct tactagcagg gaaagacttg atttcaagta ttctaaactc 421 tatttctagt agtctttcag tggatagtcc cacttttttc agatagatca tcatatcaac 481 aaataataat tggtcctact cctttcaaat acttttattt tcttcctttt ttgaactact 541 tatactgcat gtatctatat taattatttt aacacatata catatcataa cacactaatc 601 accaaacact tcaagaatag tgtaagatga caggattgaa aataagaatt acacattatt 661 cctttaacat tgagtttccc agctttgaag tagctgaaat aattatatcg cataaaaact 721 ttgttatatt tttcactttc ttattttcaa aaattataaa attgggtgta agacattctt 781 aattctaaga aaatgttgat tttgcttatc ttcatgtttt tattcaatta aggacttttg 841 gtaaacattt gctggtgtta atgttaaaag agagttgggg aaatggatgg aggcaaccag 901 agtgaaggtt cagagttcct tctcctgggg atgtcagaga gtcctgagca gcagcagatc 961 ctgttttgga tgttcctgtc catgtacctg gtcacggtgg tgggaaatgt gctcatcatc 1021 ctggccatca gctctgattc ccgcctgcac acccccgtgt acttcttcct ggccaacctc 1081 tccttcactg acctcttctt tgtcaccaac acaatcccca agatgctggt gaacctccag 1141 tcccataaca aagccatctc ctatgcaggg tgtctgacgc agctctactt cctggtctcc 1201 ttggtggccc tggacaacct catcctggct gtgatggcat atgaccgcta tgtggccatc 1261 tgctgccccc tccactacac cacagccatg agccctaagc tctgtatctt actcctttcc 1321 ttgtgttggg tcctatccgt cctctatggc ctcatacaca ccctcctcat gaccagagtg 1381 accttctgtg ggtcacgaaa aatccactac atcttctgtg agatgtatgt attgctgagg 1441 atggcatgtt ccaacattca gattaatcac acagtgctga ttgccacagg ctgcttcatc 1501 ttcctcattc cctttggatt cgtgatcatt tcctatgtgc tgattatcag agccatcctc 1561 agaataccct cagtctctaa gaaatacaaa gccttctcca cctgtgcctc ccatttgggt 1621 gcagtctccc tcttctatgg gacactttgt atggtatacc taaagcccct ccatacctac 1681 tctgtgaagg actcagtagc cacagtgatg tatgctgtgg tgacacccat gatgaatccc 1741 ttcatctaca gcctgaggaa caaggacatg catggggctc tgggaagact cctagataaa 1801 cactttaaga ggctgacatg agggcaattt ggaaagacag cattaaagtg gagactagga 1861 atatccttca ccctatgtaa gggattgtcc tgtgtgttat acagcagtga ttgggacatg 1921 gctccagctc agagacagca tatagatatg tggtgataaa aaagacatat ttgtaacctg 1981 gtgtccccca ggtctcatca gccttggccg taaataaggt cacactaaca ccaacactag 2041 aatgttgcag ggtcaaattc ttcaatgtac ttgactacag ggccacattc ttggccttat 2101 ctgactatat ccagtttaaa cctagaagtg tctctcatct agcacacatc caaagtacag 2161 aaagtaaata gtagctgata agaaagttag tcacatggct gtggaggttt gaaagagatt 2221 gcaatcatac atatttgtat cagctgatcc agcacgtgat atagacctcg acaggtggtg 2281 ttcaattcat ttgacattca tgcagtcatt catcaactca ttctattcat aatgacagtg 2341 tacagaggcc tcaaactggg ttacaaatgt gaggtcacag tctactcggg gaagtacata 2401 aatttacatt aaacataaat ggacctaact catcaattaa aaagtaatgt tcaacatact 2461 gattaaaaaa taaaatatag atagatgtat ttaaagagac acgatgaaag cacaggaata 2521 tataaagctt gaaattaaaa agagaaacag acatatttgg aaaatactaa atattttaag 2581 tgaaatattg ctgcaatgac atcaggtaaa agatatttca aaacgggtga aaggagtgat 2641 gtcaacaaga tggtggaatc gacagttgta tctctcatcc accaacatac agactaattt 2701 atcaaccatc cacagatgaa aatacctttg tgagagctcc agaatccaag tgaaagttta 2761 tagcaccctg gtagtacaca gaagtagaaa aaacaccata ttgaacattg tagaaaaaac 2821 tctgtcacat tacctgaatc acccctcacc caagccagca cagagtagca caaagagaga 2881 tcccctcatc tcatgagttc ttccataagt aaaaaagaaa ataaaacata tgtacaactt 2941 cccctgactt tcaggatgct aaccaagagg accacttctg tcatgcctca ccaagaatac 3001 tgaggcaatt cacatggcca gacccctctg agtagctaag aacaagaaaa aaaaaaaaat 3061 aagaaaatgg ttgggagctc ttaatagtca gtatgtgtat tttaacaact ggggccttgc 3121 accctacagt gggcctgtgc atggtgccca gaagctggcc catccaccca catccccaag 3181 cactaggcct gcctgcccat agaccctgcc aactggccag cccagaatat ctggctaagc 3241 tgactggtga aaaacagttc ctatcaaatt ggactttccc tatcaaaacc agcctgtaaa 3301 gactaaaaga gatgactgct tcttcaaatg tgcaaacaca aatgtgaagc tacaggaaac 3361 acaaggaatc aaggaaactt agcaaagcca aaggaacaaa ataaagcttc agaaaccatc 3421 aaagaaatgg agaaatatga actgcctgac aaagaattc // LOCUS HSHISH1 1464 bp DNA PRI 19-SEP-1997 DEFINITION H.sapiens histamine H1 receptor gene. ACCESSION X76786 NID g442517 KEYWORDS histamine H1 receptor; membrane associated protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1464) AUTHORS De Backer,M.D., Gommeren,W., Moereels,H., Nobels,G., Van Gompel,P., Leysen,J.E. and Luyten,W.H. TITLE Genomic cloning, heterologous expression and pharmacological characterization of a human histamine H1 receptor JOURNAL Biochem. Biophys. Res. Commun. 197 (3), 1601-1608 (1993) MEDLINE 94107375 REFERENCE 2 (bases 1 to 1464) AUTHORS De Backer,M.D. TITLE Direct Submission JOURNAL Submitted (16-DEC-1993) M.D. De Backer, Janssen Pharmaceutica n.v., Turnhoutseweg 30, 2340 Beerse, BELGIUM FEATURES Location/Qualifiers source 1..1464 /organism="Homo sapiens" /isolate="healthy volunteer R.W." /db_xref="taxon:9606" /haplotype="diploid" /tissue_type="white blood cells" /clone_lib="lambda EMBL3/ genomic cDNA" /clone="17.2" /sub_clone="4kbBam/pUC18, 0.5kbBam /pUC18" /dev_stage="adult" CDS 1..1464 /codon_start=1 /product="histamine H1 receptor" /db_xref="PID:g442518" /db_xref="SWISS-PROT:P35367" /translation="MSLPNSSCLLEDKMCEGNKTTMASPQLMPLVVVLSTICLVTVGL NLLVLYAVRSERKLHTVGNLYIVSLSVADLIVGAVVMPMNILYLLMSKWSLGRPLCLF WLSMDYVASTASIFSVFILCIDRYRSVQQPLRYLKYRTKTRASATILGAWFLSFLWVI PILGWNHFMQQTSVRREDKCETDFYDVTWFKVMTAIINFYLPTLLMLWFYAKIYKAVR QHCQHRELINRSLPSFSEIKLRPENPKGDAKKPGKESPWEVLKRKPKDAGGGSVLKSP SQTPKEMKSPVVFSQEDDREVDKLYCFPLDIVHMQAAAEGSSRDYVAVNRSHGQLKTD EQGLNTHGASEISEDQMLGDSQSFSRTDSDTTTETAPGKGKLRSGSNTGLDYIKFTWK RLRSHSRQYVSGLHMNRERKAAKQLGFIMAAFILCWIPYFIFFMVIAFCKNCCNEHLH MFTIWLGYINSTLNPLIYPLCNENFKKTFKRILHIRS" BASE COUNT 352 a 406 c 359 g 347 t ORIGIN 1 atgagcctcc ccaattcctc ctgcctctta gaagacaaga tgtgtgaggg caacaagacc 61 actatggcca gcccccagct gatgcccctg gtggtggtcc tgagcactat ctgcttggtc 121 acagtagggc tcaacctgct ggtgctgtat gccgtacgga gtgagcggaa gctccacact 181 gtggggaacc tgtacatcgt cagcctctcg gtggcggact tgatcgtggg tgccgtcgtc 241 atgcctatga acatcctcta cctgctcatg tccaagtggt cactgggccg tcctctctgc 301 ctcttttggc tttccatgga ctatgtggcc agcacagcgt ccattttcag tgtcttcatc 361 ctgtgcattg atcgctaccg ctctgtccag cagcccctca ggtaccttaa gtatcgtacc 421 aagacccgag cctcggccac cattctgggg gcctggtttc tctcttttct gtgggttatt 481 cccattctag gctggaatca cttcatgcag cagacctcgg tgcgccgaga ggacaagtgt 541 gagacagact tctatgatgt cacctggttc aaggtcatga ctgccatcat caacttctac 601 ctgcccacct tgctcatgct ctggttctat gccaagatct acaaggccgt acgacaacac 661 tgccagcacc gggagctcat caataggtcc ctcccttcct tctcagaaat taagctgagg 721 ccagagaacc ccaaggggga tgccaagaaa ccagggaagg agtctccctg ggaggttctg 781 aaaaggaagc caaaagatgc tggtggtgga tctgtcttga agtcaccatc ccaaaccccc 841 aaggagatga aatccccagt tgtcttcagc caagaggatg atagagaagt agacaaactc 901 tactgctttc cacttgatat tgtgcacatg caggctgcgg cagaggggag tagcagggac 961 tatgtagccg tcaaccggag ccatggccag ctcaagacag atgagcaggg cctgaacaca 1021 catggggcca gcgagatatc agaggatcag atgttaggtg atagccaatc cttctctcga 1081 acggactcag ataccaccac agagacagca ccaggcaaag gcaaattgag gagtgggtct 1141 aacacaggcc tggattacat caagtttact tggaagaggc tccgctcgca ttcaagacag 1201 tatgtatctg ggttgcacat gaaccgcgaa aggaaggccg ccaaacagtt gggttttatc 1261 atggcagcct tcatcctctg ctggatccct tatttcatct tcttcatggt cattgccttc 1321 tgcaagaact gttgcaatga acatttgcac atgttcacca tctggctggg ctacatcaac 1381 tccacactga accccctcat ctaccccttg tgcaatgaga acttcaagaa gacattcaag 1441 agaattctgc atattcgctc ctaa // LOCUS HSHISH2B 843 bp DNA PRI 12-SEP-1993 DEFINITION Human histone H2b gene. ACCESSION X00088 NID g32112 KEYWORDS histone; histone H2B. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 843) AUTHORS Zhong,R., Roeder,R.G. and Heintz,N. TITLE The primary structure and expression of four cloned human histone genes JOURNAL Nucleic Acids Res. 11 (21), 7409-7425 (1983) MEDLINE 84069776 FEATURES Location/Qualifiers source 1..843 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 297..303 /note="Hogness Box" precursor_RNA 324..805 /note="primary transcript" CDS 370..747 /note="histone H2b" /codon_start=1 /db_xref="PID:g32113" /db_xref="SWISS-PROT:P06899" /translation="MPEPAKSAPAPKKGSKKAVTKAQKKDGKSAAHRKESYSIYVYKV LKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLL PGELAKHAVSEGTKAVTKYTSAK" misc_signal 781..796 /note="dyad symmetry pot. transcription termination" BASE COUNT 208 a 233 c 207 g 195 t ORIGIN 1 cttggcctta gcgcgggctt tgcctccctg cttgccacgt ccagacatag cgagcgcaac 61 tcactacgag caaccacaaa gtgaacggga aaggcggcgc tttttataaa cactattggg 121 cgcgaaaaag aagacgtgtt gttggttggg actgcagttt aatttcaacc aatagtagtg 181 cgtcttctgg atttgcgaat cctgattggg cagacctgac ctctgacgtt accctgaata 241 actaccaatc agacacaaga cttcaactct tcaccttatt tgcataagcg attctatata 301 aaagcgcctt gtcataccct gctcacgctg tttttccttt tcgttggcgc tttatagcta 361 cacagtgcta tgccagagcc agcgaagtct gctcccgccc cgaaaaaggg ctccaagaag 421 gcggtgacta aggcgcagaa gaaagacggc aagagcgcag cgcaccgcaa ggagagctat 481 tccatctatg tgtacaaggt tctgaagcag gtccaccctg acaccggcat ttcgtccaag 541 gccatgggca tcatgaattc gtttgtgaac gacattttcg agcgcatcgc aggtgaggct 601 tcccgccttg cgcattacaa caagcgctcg accatcacct ccagggagat ccagacggcc 661 gtgcgcctgc tgctgcctgg ggagttggcc aagcacgccg tgtccgaggg tactaaggcc 721 gtcaccaagt acaccagcgc taagtaaaca gtgagttggt tgcaaactct caaccctaac 781 ggctctttta agagccaccc atgttctcaa agaaagagct ggtgcttgta tttcctcctc 841 gct // LOCUS HSHISH3 698 bp DNA PRI 12-SEP-1993 DEFINITION Human histone H3 gene. ACCESSION X00090 NID g32114 KEYWORDS histone; histone H3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 698) AUTHORS Zhong,R., Roeder,R.G. and Heintz,N. TITLE The primary structure and expression of four cloned human histone genes JOURNAL Nucleic Acids Res. 11 (21), 7409-7425 (1983) MEDLINE 84069776 FEATURES Location/Qualifiers source 1..698 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 93..97 /note="CAAT box" promoter 114..120 /note="TATA box" precursor_RNA 148..659 /note="put. primary transcript" CDS 186..596 /note="histone H3" /codon_start=1 /db_xref="PID:g32115" /db_xref="SWISS-PROT:P16106" /translation="MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRP GTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEACEAYLV GLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA" misc_signal 634..649 /note="dyad symmetry" BASE COUNT 169 a 183 c 177 g 169 t ORIGIN 1 acggtaatga caggaatctc tcttaatctg caactaggca cagagatggg ccaatccaag 61 aagggcgcgg ggatttttga attttcttgg gtccaatagt tggtggtctg actctataaa 121 agaagagtag ctctttcctt tcctccacag acgtctctgc aggcaagctt ttctgtggtt 181 ttgccatggc tcgtactaaa cagacagctc ggaaatccac cggcggtaaa gcgccacgca 241 agcagctggc taccaaggct gctcgcaaga gcgcgccggc taccggcggt gtgaaaaagc 301 ctcaccgtta ccgtccgggt actgtggctc tgcgtgagat ccgccgctac caaaagtcga 361 ccgagttgct gattcggaag ctgccgttcc agcgcctggt gcgagaaatc gcccaagact 421 tcaagaccga tcttcgcttc cagagctctg cggtaatggc gctgcaggag gcttgtgagg 481 cctacttggt agggctcttt gaggacacaa acctttgcgc catccatgct aagcgagtga 541 ctattatgcc caaagacatc cagctcgctc gccgcattcg cggagaaaga gcgtaaatgt 601 aaagttactt tttcatcagt cttaaaaccc aaaggctctt ttcagagcca cccacttatt 661 ccaacgaaag tagctgtgat aattttttgt tgtctcaa // LOCUS HSICAAR 3598 bp DNA PRI 18-APR-1997 DEFINITION H.sapiens ICAAR gene. ACCESSION Y08569 NID g1644377 KEYWORDS ICAAR gene; islet cell autoantigen releted. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3598) AUTHORS Smith,P.D., Barker,K.T., Wang,J., Lu,Y.J., Shipley,J. and Crompton,M.R. TITLE ICAAR, a novel member of a new family of transmembrane, tyrosine phosphatase-like proteins JOURNAL Biochem. Biophys. Res. Commun. 229 (2), 402-411 (1996) MEDLINE 97127415 REFERENCE 2 (bases 1 to 3598) AUTHORS Smith,P.D. TITLE Direct Submission JOURNAL Submitted (06-SEP-1996) P.D. Smith, Institute of Cancer Research, Cell Biology Experimental Pathology, 15 Cotswold Road, Sutton, Surrey SM2 5NG, UK FEATURES Location/Qualifiers source 1..3598 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" gene 293..3340 /gene="ICAAR" CDS 293..3340 /gene="ICAAR" /codon_start=1 /product="Islet Cell Autoantigen Releted" /db_xref="PID:e273864" /db_xref="PID:g1644378" /translation="MGPPLPLLLLLLLLLPPRVLPAAPSSVPRGRQLPGRLGCLLEEG LCGASEACVNDGVFGRCQKVPAMDFYRYEVSPVALQRLRVALQKLSGTGFTWQDDYTQ YVMDQELADLPKTYLRRPEASSPARPSKHSVGSERRYSREGGAALANALRRHLPFLEA LSQAPASDVLARTHTAQDRPPAEGDDRFSESILTYVAHTSALTYPPGSRTQLREDLLP RTLGQLQPDELSPKVDSGVDRHHLMAALGAYAAQRPPAPPGEGSLEPQYLLRAPSRMP RPLLAPAAPQKWPSPLGDSEDPSSTGDGARIHTLLKDLQRQPAEVRRLNGLELDGMAE LMAGLMQGVDHGVARGSPGRAALGESGEQADGPKATLRGDSFPDDGVQDDDDRLYQEV HRLSATLGGLLQDHGSRLLPGALPFARPLDMERKKSEHPESSLSSEEETAGVENVKSQ TYSKDLLGQQPHSEPGAAAFGELQNQMPGPSKEEQSLPAGAQEALSDGLQLEVQPSEE EARGYIVTDRDPLRPEEGRRLVEDVARLLQVPSSAFADVEVLGPAVTFKVSANVQNVT TEDVEKATVDNKDKLEETSGLKILQTGVGSKSKLKFLPPQAEQEDSTKFIALTLVSLA CILGVLLASGLIYCLRHSSQHRLKEKLSGLGGDPGADATAAYQELCRQRMATRPPDRP EGPHTSRISSVSSQFSDGPIPSPSARSSASSWSEEPVQSNMDISTGHMILSYMEDHLK NKNRLEKEWEALCAYQAEPNSSFVAQREENVPKNRSLAVLTYDHSRVLLKAENSHSHS DYINASPIMDHDPRNPAYIATQGPLPATVADFWQMVWESGCVVIVMLTPLAENGVRQC YHYWPDEGSNLYHIYEVNLVSEHIWCEDFLVRSFYLKNLQTNETRTVTQFHFLSWYDR GVPSSSRSLLDFRRKVNKCYRGRSCPIIVHCSDGAGRSGTYVLIDMVLNKMAKGAKEI DIAATLEHLRDQRPGMVQTKEQFEFALTAVAEEVNAILKALPQ" BASE COUNT 803 a 1096 c 1043 g 655 t 1 others ORIGIN 1 ttcagatact gctagtgaga ctatacattg gtacatacaa ctgttttgga aaagtatttg 61 gcagtatcta ctgaagctga ataatatgac ccagcaatcc tgcccctggg tatatactca 121 acagaaatgc ataaatatgt tccctaaaag acatatacta gaatgttcat agcagccggt 181 cagcagtaga atgggtaaat aaatatcggt atattcacat aatcaagtat acacagcaat 241 gaaaataaac aactacacac atggactgag cgccgccggc caggccgcgg ggatggggcc 301 gccgctcccg ctgctgctgc tgctactgct gctgctgccg ccacgcgtcc tgcctgccgc 361 cccttcgtcc gtcccccgcg gccggcagct cccggggcgt ctgggctgcc tgctcgagga 421 gggcctctgc ggagcgtccg aggcctgtgt gaacgatgga gtgtttggaa ggtgccagaa 481 ggttccggca atggactttt accgctacga ggtgtcgccc gtggccctgc agcgcctgcg 541 cgtggcgttg cagaagcttt ccggcacagg tttcacgtgg caggatgact atactcagta 601 tgtgatggac caggaacttg cagacctccc gaaaacctac ctgaggcgtc ctgaagcatc 661 cagcccagcc aggccctcaa aacacagcgt tggcagcgag aggaggtaca gtcgggaggg 721 cggtgctgcc ctggccaacg ccctccgacg ccacctgccc ttcctggagg ccctgtccca 781 ggccccagcc tcagacgtgc tcgccaggac ccatacggcg caggacagac cccccgctga 841 gggtgatgac cgcttctccg agagcatcct gacctatgtg gcccacacgt ctgcgctgac 901 ctaccctccc gggtcccgga cccagctccg cgaggatctc ctgccgcgga ccctaggcca 961 gctccagcca gatgagctca gccctaaggt ggacagtggt gtggacagac accatctgat 1021 ggcggccctc ggtgcctatg ctgcccagag gcccccagct ccccccgggg agggcagcct 1081 ggagccacag taccttctgc gtgcaccctc aagaatgccc aggcctttgc tggcaccagc 1141 cgccccccag aagtggcctt cacctctggg agattccgaa gacccctcca gcacaggcga 1201 tggagcacgg attcataccc tcctgaagga cctgcagagg cagccggctg aggtgaggcg 1261 cctgaatggc ctggagctgg acggcatggc cgagctgatg gctggcctga tgcaaggcgt 1321 ggaccatgga gtagctcgag gcagccctgg gagagcggcc ctgggagagt ctggagaaca 1381 ggcggatggc cccaaggcca ccctccgtgg agacagcttt ccagatgacg gagtgcagga 1441 cgacgatgat agactttacc aagaggtcca tcgtctgagt gccacactcg ggggcctcct 1501 gcaggaccac gggtctcgac tcttacctgg agccctcccc tttgcaaggc ccctcgacat 1561 ggagaggaag aagtccgagc accctgagtc ttccctgtct tcagaagagg agactgccgg 1621 agtggagaac gtcaagagcc agacgtattc caaagatctg ctggggcagc agccgcattc 1681 ggagcccggg gccgctgcgt ttggggagct ccaaaaccag atgcctgggc cctcgaagga 1741 ggagcagagc cttccagcgg gtgctcagga ggccctcagc gacggcctgc aattggaggt 1801 ccagccttcc gaggaagagg cgcggggcta catcgtgaca gacagagacc ccctgcgccc 1861 cgaggaagga aggcggctgg tggaggacgt cgcccgcctc ctgcaggtgc ccagcagtgc 1921 gttcgctgac gtggaggttc tcggaccagc agtgaccttc aaagtgagcg ccaatgtcca 1981 aaacgtgacc actgaggatg tggagaaggc cacagttgac aacaaagaca aactggagga 2041 aacctctgga ctgaaaattc ttcaaaccgg agtcgggtcg aaaagcaaac tcaagttcct 2101 gcctcctcag gcggagcaag aagactccac caagttcatc gcgctcaccc tggtctccct 2161 cgcctgcatc ctgggcgtcc tcctggcctc tggcctcatc tactgcctcc gccatagctc 2221 tcagcacagg ctgaaggaga agctctcggg actagggggc gacccaggtg cagatgccac 2281 tgccgcctac caggagctgt gccgccagcg tatggccacg cggccaccag accgacctga 2341 gggcccgcac acgtcacgca tcagcagcgt ctcatcccag ttcagcgacg ggccgatccc 2401 cagcccctcc gcacgcagca gcgcctcatc ctggtccgag gagcctgtgc agtccaacat 2461 ggacatctcc accggccaca tgatcctgtc ctacatggag gaccacctga agaacaagaa 2521 ccggctggag aaggagtggg aagcgctgtg cgcctaccag gcggagccca acagctcgtt 2581 cgtggcccag agggaggaga acgtgcccaa gaaccgctcc ctggctgtgc tgacctatga 2641 ccactcccgg gtcctgctga aggcggagaa cagccacagc cactcagact acatcaacgc 2701 tagccccatc atggatcacg acccgaggaa ccccgcgtac atcgccaccc agggaccgct 2761 gcccgccacc gtggctgact tttggcagat ggtgtgggag agcggctgcg tggtgatcgt 2821 catgctgaca cccctcgcgg agaacggcgt ccggcagtgc taccactact ggccagatga 2881 aggctccaat ctctaccaca tctatgaggt gaacctggtc tccgagcaca tctggtgtga 2941 ggacttcctg gtgaggagct tctatctgaa gaacctgcag accaacgaga cgcgcaccgt 3001 gacgcagttc cacttcctga gttggtatga ccgaggagtc ccttcctcct caaggtccct 3061 cctggacttc cgcagaaaag taaacaagtg ctacaggggc cgttcttgtc caataattgt 3121 tcattgcagt gacggtgcag gccggagcgg cacctacgtc ctgatcgaca tggttctcaa 3181 caagatggcc aaaggtgcta aagagattga tatcgcagcg accctggagc acttgaggga 3241 ccagagaccc ggcatggtcc agacgaagga gcagtttgag ttcgcgctga cagccgtggc 3301 tgaggaggtg aacgccatcc tcaaggccct tccccagtga gcggcagcct caggggcctc 3361 aggggagccc ccaccccacg gatgttgtca ggaatcatga tctgacttta attgtgtgtc 3421 ttctattata actgcatagt aatagggccc ttagctctcc cgtagtcagc gcagtttagc 3481 agttaaaagt gtatttttgt ttaatcaaac aataataaag anagatttgt ggaaaaatcc 3541 agttacgggt ggaagggaat cggttcatca attttcactt gcttaaaaaa aggaatcc // LOCUS HSIFD3 1626 bp DNA PRI 17-DEC-1994 DEFINITION Human gene for leukocyte (alpha) interferon H. ACCESSION V00533 J00215 NID g32635 KEYWORDS interferon. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1626) AUTHORS Lawn,R.M., Adelman,J., Dull,T.J., Gross,M., Goeddel,D. and Ullrich,A. TITLE DNA sequence of two closely linked human leukocyte interferon genes JOURNAL Science 212 (4499), 1159-1162 (1981) MEDLINE 81201124 FEATURES Location/Qualifiers source 1..1626 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 346..1348 /note="messenger RNA LeIF H" CDS 415..984 /note="reading frame LeIF H" /codon_start=1 /db_xref="PID:g32636" /db_xref="SWISS-PROT:P01570" /translation="MALPFALMMALVVLSCKSSCSLGCNLSQTHSLNNRRTLMLMAQM RRISPFSCLKDRHDFEFPQEEFDGNQFQKAQAISVLHEMMQQTFNLFSTKNSSAAWDE TLLEKFYIELFQQMNDLEACVIQEVGVEETPLMNEDSILAVKKYFQRITLYLMEKKYS PCAWEVVRAEIMRSLSFSTNLQKRLRRKD" BASE COUNT 534 a 305 c 282 g 504 t 1 others ORIGIN 1 ggtcccagtc ccatttggtc attcttaaca ctatgttaaa taaaaagatt aaactttaca 61 ctccttataa atagatatgt acagtatatc aacaaatata tggtatgtct gtgttattaa 121 aatttaatgg gactttgaat ttagaaagaa atttctaaaa agcccatggg gngggaaaga 181 tgaggtaata ctgaaaataa aagtggttga gaaactgctc tacacccatg tagacaggac 241 ataaaggaaa gccaaaagag aagtagaaaa aaacatgaag acgcttcaga aaatggaagc 301 tagtatgttc cttatttaag acctatgcac agagcaaggt cttcagaaaa cctacaaccc 361 aaggttcagt gttacccctc atcaaccagc ccagcagcat cttcagggtt cccaatggca 421 ttgccctttg ctttaatgat ggccctggtg gtgctcagct gcaagtcaag ctgctctctg 481 ggctgtaatc tgtctcaaac ccacagcctg aataacagga ggactttgat gctcatggca 541 caaatgagga gaatctctcc tttctcctgc ctgaaggaca gacatgactt tgaatttccc 601 caggaggaat ttgatggcaa ccagttccag aaagctcaag ccatctctgt cctccatgag 661 atgatgcagc agaccttcaa tctcttcagc acaaagaact catctgctgc ttgggatgag 721 accctcctag aaaaattcta cattgaactt ttccagcaaa tgaatgacct ggaagcctgt 781 gtgatacagg aggttggggt ggaagagact cccctgatga atgaggactc catcctggct 841 gtgaagaaat acttccaaag aatcactctt tatctgatgg agaagaaata cagcccttgt 901 gcctgggagg ttgtcagagc agaaatcatg agatccctct ctttttcaac aaacttgcaa 961 aaaagattaa ggaggaagga ttgaaaagtg gttcatcatg gaaatgattc tcattgacta 1021 atacatcatc tcacactttc atgagttctt ccatttcaaa gactcacttc tcctataacc 1081 accacaagtt gaatcaaaat tttcaaatgt tttcaggagt gtaaagaagc atcatgtata 1141 cctgtgcagg cactagtcct ttacagatga ccatgctgat gtctcctttc atctatttat 1201 ttaaatattt atttatttaa ctatttttat tatttaaatt attttttatg taatatcatg 1261 tgtaccttta cattgtggtt aatataacaa atatgttctt catatttagc caatatatta 1321 atttcctttt tcattaaatt tttactatac aaaatttctt gtgtttggtt attttttaag 1381 ataaaaagtc aagcctgact gtacaacctg atttcaaaat agatgattta atcaagttac 1441 ctatcataat tttattcaag ttatagaaaa atcaattttc tataccaggt tatatgttgc 1501 cttaaggatg taaacatgaa tataaaaaat acagctcctg ttctcttgta tctttgattt 1561 ttgtcaggaa ataaatctaa aaacaataat aatgctgaat taatatcagt tatacaaact 1621 gctgta // LOCUS HSIFD6 997 bp DNA PRI 17-DEC-1994 DEFINITION Gene for human fibroblast interferon beta 1. ACCESSION V00535 NID g32639 KEYWORDS interferon; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 997) AUTHORS Lawn,R.M., Adelman,J., Franke,A.E., Houck,C.M., Gross,M., Najarian,R. and Goeddel,D.V. TITLE Human fibroblast interferon gene lacks introns JOURNAL Nucleic Acids Res. 9 (5), 1045-1052 (1981) MEDLINE 81198952 FEATURES Location/Qualifiers source 1..997 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 15..860 /note="messenger RNA" mRNA 15..853 /note="messenger RNA" CDS 97..660 /note="interferon beta 1" /codon_start=1 /db_xref="PID:g32640" /db_xref="SWISS-PROT:P01574" /translation="MTNKCLLQIALLLCFSTTALSMSYNLLGFLQRSSNFQCQKLLWQ LNGRLEYCLKDRMNFDIPEEIKQLQQFQKEDAALTIYEMLQNIFAIFRQDSSSTGWNE TIVENLLANVYHQINHLKTVLEEKLEKEDFTRGKLMSSLHLKRYYGRILHYLKAKEYS HCAWTIVRVEILRNFYFINRLTGYLRN" BASE COUNT 300 a 200 c 201 g 296 t ORIGIN 1 ggccataccc atggagaaag gacattctaa ctgcaacctt tcgaagcctt tgctctggca 61 caacaggtag taggcgacac tgttcgtgtt gtcaacatga ccaacaagtg tctcctccaa 121 attgctctcc tgttgtgctt ctccactaca gctctttcca tgagctacaa cttgcttgga 181 ttcctacaaa gaagcagcaa ttttcagtgt cagaagctcc tgtggcaatt gaatgggagg 241 cttgaatact gcctcaagga caggatgaac tttgacatcc ctgaggagat taagcagctg 301 cagcagttcc agaaggagga cgccgcattg accatctatg agatgctcca gaacatcttt 361 gctattttca gacaagattc atctagcact ggctggaatg agactattgt tgagaacctc 421 ctggctaatg tctatcatca gataaaccat ctgaagacag tcctggaaga aaaactggag 481 aaagaagatt tcaccagggg aaaactcatg agcagtctgc acctgaaaag atattatggg 541 aggattctgc attacctgaa ggccaaggag tacagtcact gtgcctggac catagtcaga 601 gtggaaatcc taaggaactt ttacttcatt aacagactta caggttacct ccgaaactga 661 agatctccta gcctgtgcct ctgggactgg acaattgctt caagcattct tcaaccagca 721 gatgctgttt aagtgactga tggctaatgt actgcatatg aaaggacact agaagatttt 781 gaaattttta ttaaattatg agttattttt atttatttaa attttatttt ggaaaataaa 841 ttatttttgg tgcaaaagtc aacatggcag ttttaatttc gatttgattt atataaccat 901 ccatattata aaattgccag tacctattag ttgttctttt taaaatatac ctgcaaagta 961 gtatactttg gttcctgcct taaggaattt aaaattc // LOCUS HSKI67 14041 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens DNA for Ki-67 antigen 5'-region (exon 1 & 2). ACCESSION X94762 NID g1944550 KEYWORDS Ki-67 gene; monoclonal antibody. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14041) AUTHORS Gerdes,J. TITLE Direct Submission JOURNAL Submitted (08-JAN-1996) J. Gerdes, Molecular Immunology, Forschungszentrum Borstel, Parkallee 22, D- 23845, Borstel, FRG REFERENCE 2 (bases 1 to 14041) AUTHORS Gerdes,J. TITLE Sequence of the human Ki-67 protein gene 5' and promoter region JOURNAL Unpublished FEATURES Location/Qualifiers source 1..14041 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa S3" /clone_lib="HeLa S3" /clone="EMBL3" /chromosome="10" /map="q25" mRNA join(11797..11903,12245..>12430) /gene="Ki-67" exon 11797..11903 /gene="Ki-67" /number=1 gene 11797..12430 /gene="Ki-67" intron 11904..12244 /gene="Ki-67" /number=1 exon 12245..>12430 /gene="Ki-67" /number=2 CDS 12335..12430 /gene="Ki-67" /codon_start=1 /product="monoclonal antibody Ki-67" /db_xref="PID:e220305" /db_xref="PID:g1869801" /translation="MWPTRRLVTIKRSGVDGPHFPLSLSTCLFGR" BASE COUNT 3477 a 3189 c 3327 g 4048 t ORIGIN 1 ctgcaggtcg actcaaaaca tgctgaatcc accattccct ttggcttgac ccaatcttgg 61 tatccaaata atttcagctt tagcttagtt gaggcttctt tgtattctgt ccttaaaaac 121 cttccatcct acaattaatg cacacttaac ggccaaggta cacagagtcc aactggcaac 181 tacagactaa attgccaaag cttcactgtt ttgagaggac ttctgacaac caactgtact 241 tatgataata ctcagaccgc aataatattt tatctacaca attaccagtt tggtgattta 301 tagaacgccg gatgctatta tcgagtttaa ttgactgtat ggaaccctag tctttacttg 361 atgtaattaa atgttaatat acttataaaa actgggtaca taaaccaata tctgtcagcc 421 agtcctaaat gttccttggg cttctgcatg catagcattt tgcagcagaa agatgatggc 481 ttttggagaa ggaaaacagt cagaatccac aactgtaggg aatatggatt ggattcttaa 541 cttgctggat tttcagttta tttatcttta aaatggtgat gattaattct tggattacaa 601 gatggttgtt aagattaagt aagattacat ggaatgtctg gcctgatgcc tagcaaagat 661 tagctgctca ataaggatcc atttccatac ccaccctagg cacatgttta ttgcatgcct 721 aacatcttca aggcattatg tcatgtcaag agccgagatg ggccccttac ctgctttact 781 tctctccctg ttttcattgt tgatctcccc tactagtatg tgagctccat ggagaaagag 841 gtcatagctt tcaatcactg ctttatccat gatagctaga ataatgtcta gcatacacca 901 gacacttgat ctagcattaa tgaatgaaat ccagatatgg gttgaataaa ttcagcgagg 961 gtaataagga ggagtgggaa tttcatgcaa gagacattat cataaagttg agcaggtttg 1021 tgctttggga tcaagttgaa agtaattcta atcccgactc tgtcctttat aagctgtgca 1081 gccttgcaga agtgacttaa cctctctgag cctcaggttc caattttgca aaatcttgtc 1141 ttgaatccta aagaacaact cacagggctt ctctaaggac aaaataaggc agtggttgga 1201 aagtacctgg catctaatgg ttccggatta ttgagtggcc accaaaggct gattttccca 1261 gggtagaact tgagaaagct ttgcatgaca tcaggatgtt ctcttctggg gattaccggt 1321 gtcttctcct ccctctgatc ctcactgggc caaggtggga gcacctggct tctgtctctc 1381 ctgagctgct ttaagccagg tcactaatgg agctgagatg tggagtgggg tcaaagggca 1441 ctccagagtt ccacagagag ccatgtgtgg accatgtgac tctactgacc ttgagccatc 1501 aggacagtgt actcactgtt ccaggaagtg gaacttgctg ggggaggcct gccaacagag 1561 gctctggacc ttagttcttt tccttttttt gtttcctctg aaaaaaattg tatgttaaaa 1621 ataagtcact ttcagaagaa gaggttttta gaaagccaga tgttcctgta gtggtagcaa 1681 cttttgggac tagcaaaaat aagctgccgt catccatctc ctggtttcct gcatgccctg 1741 ggtgacagct attagattgc cttcaggcct ttggaaggaa gaatgagtaa actttttcca 1801 tcaaaggcta gatagtaaat atttcagtct ttacagacca tgtggtctcc atcacagcta 1861 ctcaactctg ctgctgtagc tgggaagcag ccgtagatga tatgtaggca aataaacatg 1921 gctgtgtttc aataaaactt tatttacaca aacaggaggc aggctaggtt tctagttcat 1981 aggtcatact ttgctgtact ctgtgttaaa acttcctgat ttggagaaag acattggttt 2041 ctctatttat ttttctgctt cctttcctta atggggcagg aggagtgaca cccagagagg 2101 agggggcatg aagcagccca tgggggtctt gtggggttgg tgtttcttta cacagaactg 2161 tgctaccact gtggtcaagt cgatgtattc tagcaatcgt ggcaagccag aacagaagac 2221 acaaactggt tccctaaaca agacagagac cgtgggggac gtgccaggag cttccacaga 2281 gacaggtatg ggattgcggg tgttaagtat atgtcattga gtatctaagt gtagaaattt 2341 atcagtgtgg acatacacaa gtgtttcatt tcttctttta aatttatgtt tataaagaag 2401 ggtgtgtgtg tgtgtatgac attctgtgat gggagtaata attactaatg tttgttgaga 2461 gcttgctatg tgccaggcac tgtgctgtct tacatgttat ttaatcctca tgacagctta 2521 tcaggaatgt acttttgggg tggacagacc caacaccagg tcgtgagggt gatgaagtcc 2581 ggtggagtca gaggaatgag aaaagacagt ttgagagaga aagtgtgtcc agggggccaa 2641 cgcaagtatg gaggctgcaa agaccccaag ctctggaagc ccagactatt tattggtgat 2701 caaacagaga acaggtgttg agaatgtggg ggtcgaaagg gcaagtgcat gatctacagc 2761 tgtgacggct tagcatttcc tttgcagcat atggaacatg ttctgctgct tgagatagtg 2821 gagagcggtt cttttaactc aagatacaat cagtcctggg agagcaagga gccagcaagt 2881 ctagacacat tccagaggcc acgaggggtt ttatgccttg agccctggat tctttccaag 2941 ccacgagggg ttttatgccc tgggcttagg ttatggtgct tcagggtagg cttctaccct 3001 ttagcacaga gcttggtgtt ccaaaggcca cgaggggttt tagaccctgg accccggaca 3061 tgttccaaga ctcttttaca gtatgtcaga catgcaagcc ctgcctcagc ttctcccaac 3121 actcagcttt tctcccaaca atgtactatt attatccaca tttcatggat gaagaaactg 3181 aggagggccc accatgcaga tttcatctcc tgtcttctat gtgatatggg ctctgggtgg 3241 cattaccact tttgatgcta cgggaatgaa gtgtgatggg ggacatggtg aactgaagca 3301 agattttagc cagtcagaac tcaaggatgt ggctgtgatg aaaggaagtg ctggaaaggg 3361 gtagagagat gacaggactt gggcattcat accggggaca agccaggcat ccatgtttgc 3421 taggagtctc tgcctgctca ccatcaggga ccactcaaga gatagcacct gaagacagct 3481 gattataact tattactgag gaaggggcct cacacacacc tgactgaatg ggaaggacct 3541 gggtttagcc cctggctctg tgaatataac ttagcttgag ccaagctggg ctttgagatg 3601 aaccaatctc agaattttct tctaccacct gctagctctg tgactctcag aatgtgacct 3661 cacccccact ttgatggatt tacagggcaa tttgcctgtt gcaatgcttt gcggaatatc 3721 tctggccatt tcccagctgt gatatttggg tgatgttatt taactcttgg tttcatcttg 3781 cataaagtgt ggatagtgac agtacctaac tcatggggct cttttgagga ttgaatgagg 3841 taatccatgg aaatggacta gcacggtgca tagcacatag ttcccagtac atgcaagctg 3901 ctatcgttat tgttcccatt ttaaagatga ggaagtggag gctcagaaat tttcttctga 3961 ctcatccaaa gtcacaaaga aagtgccata accaaggcat gcatcatcct ctgataccaa 4021 atcttgtgct tctgacactt catcatattc tatttcattc attcattcat gagcatttaa 4081 agagtttccc tcaagctctc aagagttaaa gtactaggct ctttgatgag tctagtgaat 4141 aaaagggatg cacacttcct gccctggtag acttaggctg gtagtgggag acagaaaata 4201 agtaaatgag taataaacat tttagaatgt aactcaaagg gtttgagaaa cagcctcgtc 4261 aagcataaac aaggtggtat gatagagaat gactgaagaa gggaggaggc tgtttgcatt 4321 tgattttgat gtggaagatg cccagagaca gccacatcac gtgctagggc aagagtgttc 4381 catgggggaa cagcaagtgc caagaccctg agaccgtgaa gagcttggct tgggcttctg 4441 caggcttcag acgctgacac agctgccagg aaccaggttt aagcctgtcc ttctgtgtcg 4501 agaaaacctt tcctcacctt ttcccgactg atctgaagtt tctctgagac gggggatgac 4561 ccatggccat ctgtttcttt cgctgctcct caaataaaaa tggaaaagat agcactgttg 4621 cgtgagaaaa gatgaaagat gctattctgg catctgagtg ggcagagctc tgggctgggg 4681 ccccctttct ttgtagcctt gcccttttta tgcctgtttg tccacctgta aaattagcag 4741 ctgggccaga ccgtcccttc tctctttaaa tttgatagag cattggaatc cctgcatctg 4801 ctcccatgct gtttttggga aagaagactc actgggttgg gcgggaccac ttttcaggct 4861 ggttggactc tccagctggc ctccaacttg aagcatcttc actcgtttca actgaaggtg 4921 actgttctct ccccctgttc tctcagcctc tcctcggcag gttgaggctg gcgctgaccc 4981 aacagagggc ctccttcttt catcgcactt tctccttggt cacagtgcat ctcacggtgt 5041 ctgctcacaa actggtgcct gggaaggctg gggcccgtga gaagacagaa tctgcttgcc 5101 ataccccatg atgttctttt ttcataatgt acaaatgcca cccaggcaga ggcccatagc 5161 agaccatgta gagaagatgg tctgaacaga ggctcttatt ttcagggaac tcttggtact 5221 ttttgtgtcc cagcaaatgt agctaggtac catgaaggga ggacataatt gtagtcccat 5281 aggatgtcag aagtcaaagg attcttagaa ggtgggggac agtgcctata atcccagtgc 5341 tttggaaggc ccaggaggga ggatcactta gggccagcag tttgagacca gcctgagcac 5401 ataggaagaa cccatctcta caaaaaattt aaaaattagc caagtttggt ggtacatgcc 5461 tgtagtccta gctacttggg aggctggcac aggaggattg cttgagccca ggagttccag 5521 gcttcaagtg agctatgata tgacactgca ctccagcctg ggtgacagag caagaccccg 5581 tctcttaaca acagcaacaa aaaggttctc ggagagggtt caacttgctg cttctgtcac 5641 acatggagaa atgctggccc tagagtctac atgacttatt caaagtcaca cagtagtaag 5701 gggtgacaga agctgggatt tcccatgcag accaatagca taccacaatt agccctgatg 5761 atctataagg gatatgatgg gcagactcag atacagtttc tttaaaacat tttgtttgtt 5821 tggtgagatt atgccagctg accacatgta atacatctgc cttttccatt tctaatttat 5881 agattgggat taaatattaa tttaacacac aacttggcac tgactacgtg ccaggtacta 5941 ttccaagtgt ttacaaatat taaattaccc aatcctcata acagccttaa gagttaggca 6001 ctattatcct agttttacag atggagaaat gaagcacata gaagttacat aagttggcca 6061 aggtcatata actaagtaaa aaaaaatagc agagcttaga tgctggctct ggagttcata 6121 gctttaactg ctaccttgtt ctgtgtctcc ctgggcaaag aacatcatcc accacccaag 6181 gtgggcattt ctgggggctg cttaacctca ttgaactctt tatttaatat gtgagaaaac 6241 tggagaccca aaagggcatg aaggcatcag atttggttct agggcaccgc cgtttctgtt 6301 acccctgggc ttttcccact gtactgtgct gccttggaag tcgaaacacg tctctgatct 6361 agaagacacc cagcagcaag ttcattattc tcccttgaag ttccttggct aacttccacc 6421 tttgttgttt tcattcctgg atttaatata atgggccaca ttaacctgag ttgctgcaca 6481 aagtctcata gaaaaagaac caaatgaaca gatcagacac tgttttcttg gtccttgggg 6541 aatgctcgag ccatcagaga gacatcatct tatttataat aaattcactg ctcaaagagc 6601 tttaaaacat ttttttatat taggagtatt gtttgtatct gttgagtcca tgaatgtatg 6661 acattggatg actttataaa acattctact aattggattt gttaaagcaa ataaatcaaa 6721 cactaattgg aacgtgccct gccattattt ttgcttactg aattagatgt aatggtgtga 6781 aggccttctt accatatttg tgcacttatt taaacacccc cccggcacac acttctccaa 6841 attcaattat atgttcagaa aaggaattca atttatggaa ggctggttta cattcattcc 6901 gggggcaaga atcagctacc cacgataggg tgttcacaat taataagccc cacttagata 6961 ttaccctgcc ttcaaagcat ttgttggcac gagggtggaa tctccatgat ttgggggttg 7021 aggtatttta tctttggaat atgtgtccac agagcaggat tggcttcaag ggcactgagt 7081 catacaggcc ctggaactca gaagagaaga gcccgtgctt ggtttaatgc tctgctatca 7141 ccatcttgat ttctattttt tttttttttt tttttgagac agagttgcgc tcatgttgct 7201 tagattggag tgcaatggca tgatctcggc tcaccacaac ctccgcctcc caggttcaag 7261 cgattctcct gtgtcagcct cccgagtagc tggaattaca ggcatgcgcc accacgcctg 7321 gctaattttg tatttttagt gaagacggga tttcaccatg ttggccaggc tcggtctcaa 7381 actccagcct caggagatcc acccgcctag gcctcccaaa gtgctgggat tacaggcatg 7441 agccatcgcg cctggcacca tcttgatttc ttaattttga agaagagact ctgcatttta 7501 gtcttgcaag gggcgctcca aattttgtag gtggacctat cttggaggtc gccttggttg 7561 tcagaccttc tggccaagaa acaaacccat gattggagtc ttggctcagt ccctgatctg 7621 acgctgagtg catcatccac tcctgcctca gtttccttat ctataaagtg gaagtactac 7681 tccttaccaa tgaagaacac aagcttgtag tgtagatgaa gtgtttaatt aaatctaaag 7741 agttagtgtt ctttactcaa tcccttatcc gtagggctag cagacaatgg gagacgtgtt 7801 cctccctcca gttcccagag acgggaagaa acattctgca atcgagtctt tggaggagag 7861 ggtcagtagc acatggcata cattcaaatg gcaatacacg ggactgtatt aaaattcagt 7921 gactgctaaa actgggaatt tttatgtcgt taaagagaag tcagccagct atagcctacc 7981 agccaaatct gagtgttcac tggtttttat atagcccaca aataaggaat ggtttttaca 8041 tttttaaatg gttaggaatc aaaagaagaa tactattttg ggacatgtga aaatcatatg 8101 aaatgcaaca tgcagtgcta ataaataaag tcttattaga acacagtcat gcccattctt 8161 tttgtattgt ttctgtacaa caataacatt gttgagtact agtgacaggg aacatgtggc 8221 cccctgtgat gggtgggatt ctaagacagc ccccaagatt cctggcctct ggggtacaca 8281 tcctgagtac ttcccactcc ttgcgtggac tgtgcatgtg atggatttca ctgtcttgat 8341 tagcttatct tagtgaatga caaaggtgaa ggaatttgcg atgtctcccc catgtgactg 8401 tgtttggaga caaggcctat aagaaggtta aatgaggtta tatgcatgag accttagttc 8461 cattgggctg gggaagatgc atcagagtac acaggcacca aggaaaggct gtgtgaggac 8521 acagtgagaa ggcggccatc tgccagccag gaagcaagcc ctcacctgac cctgcccaac 8581 tttggtctga gattcctggc ctccagaact gtgtaaaaat aaatgtctat tgtttaagtc 8641 acacagtttg tggcattttt ttacagcagc cctagcaggc taatacaacc tgcaaagctg 8701 aaaatattta ctatctagcc ttttatagaa aaagtttaac aaccccccag ttaaaaaata 8761 actgctttga ccacattatt tttctgtgcc ctgtaaccat ttattcttgc tacatcttga 8821 atctgtctga acagctcaca tgtgtgagct cttcacatac cagcgattac tatgtgaggc 8881 ccacaccacc ccctgttctc tgcatgtgcc tgccttcctg actttaagcc ccctactcca 8941 tttcttctcc tccctataac actatggcct ccgttatgag gtgaaggggc agagcctacc 9001 ccctatgcct ggtcctccag aagtgcagat tggcatcctg ggcctgggca gtgaatgggg 9061 gcagaaggca tgcagactgg ctgaatgcca tttttacagg gaattgggga agaactgtgc 9121 agccacttat ttccctcaaa caggttcctt tcgaggtcaa ggttatcttt tcagttcatc 9181 accatggcaa cctagtataa tgagtaggta catagactga gctagtctgg gttcaaagcc 9241 acgctttgcc acctgttatt tgtgtgatct tgggatagtt ttttttttta acctttctat 9301 gccttcattt ctttgtctat aaaattgggg ttaatgatca tatccatctc gtggggtgat 9361 tgggataatt aaatgagtta ctactggtga acactgagaa cagtgctggg cactgaacat 9421 ggttatcttg actatctctc agatctagtt gggatctgtg tctgctcagc agctttctgt 9481 ccccgtctct ttgaccttgc attgtgatta cttggtcctt gcaaaccgtt atcaagctca 9541 gctcacagtt tactccttag cagtctcagc tgtcattctg gtcctctcgc aaatactctc 9601 atttctcctt gaatgacttc tccacctgtc cgtggtctca acttactctt ctccatttcc 9661 tggtggcctt atggcgcttc cttagatcaa ccttggactt cccttctttc tcggattcag 9721 aatctccatc tgttttggcc cttcatgtga tcagagcaaa tgttcttttt tttttaattt 9781 tttaattttt aattttttat tattattata ccttaagttt tagggtacat gtgcacaatg 9841 tgcaggtttg ttacatatgt atacatgtgc catgcaggtg tgtggcaccc attaactcgt 9901 catttagcat taggtatatc tcctaatgct atccctcccc actcccccca ccccacaaca 9961 gtccccagag tgtgatgttc cccttcctgt gtccatgtgt tctcattgtt caattcccat 10021 ctatgagtga gaacatgtgg tgtttggttt tttgtccttg cgatagttta ctgagaatga 10081 tgatttccaa tttcatccat gtccctacaa aggacatgaa ctcatcattt tttatggctg 10141 catagtattc catggtgtat atgtgaccaa cattttctta atccagtcta tcgttgttgg 10201 acatttgggt tggttccaag tctttgctat tgtgaatagt gctgcaataa acatacgtgt 10261 gcatgtgtct ttatagcagc atgatttata gtcctttggg tatataccca gtaatgggat 10321 ggctgggtca aatggtattt ctagttctag atccctgagg aatcgccaca ctgacttcca 10381 caatggttga actagtttac agtcccacca acagtgtaaa agtgttccta tttctccaca 10441 tcctctccag cacctgttat ttcctgactt tttaatgata gccattctaa ctggtatgag 10501 atggtatctc attgtggttt tgatttgcat ttctctgatg gccagtgatc atgagcattt 10561 tttcatgtgt cttttggctg cataaacgtc ttcttttgag aagtgtctgt tcatatcctt 10621 tggccacttc ttgatggggt tgtttttttc ttgtaaattt gtttgagttc attgtagatt 10681 ctggatatta gccctttgtc agatgagtag gttgcgaaaa tttctcccat tttgtaggtt 10741 gcctgttcac tctgatggta gtttcttaaa ggtcccagcc tgatttctga gttcttcgct 10801 cttactcccc tgcaaggcct cccctgggtt caccgtggcc cccaggagaa gcccaatgac 10861 agtggatgag agctttctct gtgggcccca gcctggctcc tctacatgaa tggaggtcct 10921 ttttaatata ttcaggcatt cactgaacaa taaagtgcaa gcatatacta tgggacagac 10981 tctgaggaaa gagacccttg tggcatgtcc ctgttgctat ggagctttag ttggagggac 11041 gcagtcctgt ggtctcagtc ccttccccca ccccagggtc cccagtgctg acgtctcccc 11101 caagaatcat gagaatgtat gtggagtgag ctactaggac tttcagcacc agctatggat 11161 cttcctgctt cagcttccca aagtgcgggg attaccggcg tgagccaccg cgtcccgcct 11221 gtttttaatt tttagcagtt ttaaatcttc tggcaatgag taatgttata gtgtcccagg 11281 tgtttggtcc tttaagaaaa ggatagcagg acgggttatc tcggtgggag cgtgttccac 11341 ctctgccctc cgccagccgc cctcgccggg gatgcaccca ggtattttcc tccggatgcg 11401 tgagtggctc gcccgcggac acgccagccc cgccccgcga gcccggcttc cccgccccct 11461 ccccgcccac gccctgcgct cccttcctat tggtcccatg ccgcgctttc ccgttcaatc 11521 gcagcgctta gcgccagaat ttgaatcttc gttttcgttt gaattgggcg ggcgcgccgg 11581 gctggaagaa ggaagtggag ggctgacgct gcgggcgggc gggaggactc gactcggtgg 11641 gagccgctag agccgggcgc ccggggacgt agcctgtagg gccaccgggt ccccgtcaga 11701 ggcggcggcg ggagcagcgg ggactgcagg ccggggtgca gcgaacgcta ccccgcgggc 11761 tgcggcccgg tgtgtgcgga gcgtggcggg cgcagcttac cgggcggagg tgagcgcggc 11821 gccggctcct cctgcggcgg actttgggtg cgacttgacg agcggtggtt cgacaagtgg 11881 ccttgcgggc cggatcgtcc caggtgagct gcggccggga ctcctgggag ctgtccgggg 11941 tcgagggctg agccgcgggg accccccgag ctctgcgggg acgggcaggg gacagacgcg 12001 cggcctgggc ccgactcctc ctgggctctg gcgagggcgt ctcggtggaa gctcccagga 12061 ggcgcaggcg ctggcgacaa caccccagct gccggacttt ggggcgccgg gggctgcggt 12121 cggatcgtcc tggggtcccc gctagctagc tgcggtgtgc gcttctgtgg ccggggatag 12181 gtgaattggg cacggggccc ttcttagctc ctttgctatc agagtaactc gcacctcttt 12241 tgcagtggaa gagttgtaaa tttgcttctg gccttcccct acggattata cctggccttc 12301 ccctacggat tatactcaac ttactgttta gaaaatgtgg cccacgagac gcctggttac 12361 tatcaaaagg agcggggtcg acggtcccca ctttcccctg agcctcagca cctgcttgtt 12421 tggaaggtga gccccgcggg cgcgcgcgga cgttttagct gagaaagagg aaaatacctt 12481 agtcacagaa taaagtccag aaacgcgctc taggattggg tcctgccgtc acttttcctt 12541 ggtgcttctc ccattcgtta ctaagttgac atagttgtgt ttttttgttt tgtaagtata 12601 aatttgatgc tagtttgtat gtttaagtgg ttttaaaaat caagccaatt aaaaaaatcg 12661 atttgctaat gttgcggtaa agaaagatgt agatgatctt catatgtcac tggctgcagg 12721 caggcgtctg aagacactgt gcgcccccgg gtgcctccac agtgggcatc cctggccact 12781 ggggacacag agaatgaagg aaggaagcca tacacttgcc tcttggctcc ttgtggcaat 12841 aggaaaatgg gacagaaagt cttcctgcct ggaattcgag aacgtttcct cttatattgc 12901 tgtcctgttt ggtggtggta ataaccctgt ccctgtataa caggtatagt tgctgacagt 12961 gccccatcag cctctgacat gtgtcccact agaggcctga gaaggggggt catttgcccg 13021 tgtggtcatt tcccagacca cacgggattc aggcctcacc ctgtcctccc aaccccatcc 13081 cggcctcacc ctgtcctccc aaccccatcc cctaccttgt tcctcccgcc taaggcattc 13141 caagcctttg ttcaccttgg tacttcttac acacttggaa gtttacaatg ggaaagttct 13201 ccccttcctc aagcttgtgt gatttccatc attcaggcat caggtgaaat gtcacctctg 13261 aggaagcctt gcatgaaaac ttgtatttcc aaccccacag cttcaggggt gagttgtggg 13321 tttgtctccc actaggctga gctccccaaa ggcaaagact gtcttgttac taatcacgtg 13381 tgtagggcca gggatggtgt ctggcatagg gggtgtgatc aatacctagc attctggcaa 13441 gggtcagtag aaacatgggc tggagccatg gatttgcaat ctcttgagaa cagtggttct 13501 cacaggaggg tcatctaacc ttccaggatt catttgaatt tccatgtcac aatataggac 13561 aggggcatgc gctgtcatcg cggaggagag gtcagggtgg gcatcctacc aggctcaggg 13621 cagccccaca atacagatgc atctggccca tgatgtcaac agtgccaaag ctcagaaaca 13681 cccatctata gtcatcctcc tgccaaacaa attctcagtt gtaagggata gtctttcctt 13741 tgctctgatg tcctgttacc ttaaaatcaa attattttat tggggttaag gaagcttttc 13801 actttccata gatacctttc tcttgaaaag gaaaaatata aacatttcat ctccagtggc 13861 agtccttttg cttttttata cagtactcct tgaatatatc ttcatgcaag attttataat 13921 ttagaaataa ttcctaagtg tttggtcaca tgacctgggt aggaagagag attcttagac 13981 tccaaaggtt cagatggaga acagacaggc cattgcatag tttattcaaa tttttaagat 14041 c // LOCUS HSMACHR 1386 bp DNA PRI 12-SEP-1993 DEFINITION Human gene for M1 muscarinic acetylcholine receptor. ACCESSION X52068 NID g34450 KEYWORDS acetylcholine receptor; G protein-coupled receptor; M1 muscarinic acetylcholine receptor; muscarinic acetylcholine receptor; receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1386) AUTHORS Chapman,C.G. TITLE Direct Submission JOURNAL Submitted (08-MAR-1990) Chapman C.G., Smithkline Beecham Pharmaceuticals, Biosciences Research Centre, Great Burgh, Yew Tree Bottom Road, Epsom Surrey KT18 5XQ, U K REFERENCE 2 (bases 1 to 1386) AUTHORS Chapman,C.G. and Browne,M.J. TITLE Isolation of the human ml (Hml) muscarinic acetylcholine receptor gene by PCR amplification JOURNAL Nucleic Acids Res. 18 (8), 2191 (1990) MEDLINE 90245684 COMMENT Data kindly reviewed (05-APR-1991) by Chapman C.G. FEATURES Location/Qualifiers source 1..1386 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" CDS 1..1383 /note="M1 muscarinic acetylcholine receptor (AA 1-460)" /codon_start=1 /db_xref="PID:g34451" /db_xref="SWISS-PROT:P11229" /translation="MNTSAPPAVSPNITVLAPGKGPWQVAFIGITTGLLSLATVTGNL LVLISFKVNTELKTVNNYFLLSLACADLIIGTFSMNLYTTYLLMGHWALGTLACDLWL ALDYVASNASVMNLLLISFDRYFSVTRPLSYRAKRTPRRAALMIGLAWLVSFVLWAPA ILFWQYLVGERTVLAGQCYIQFLSQPIITFGTAMAAFYLPVTVMCTLYWRIYRETENR ARELAALQGSETPGKGGGSSSSSERSQPGAEGSPETPPGRCCRCCRAPRLLQAYSWKE EEEEDEGSMESLTSSEGEEPGSEVVIKMPMVDPEAQAPTKQPPRSSPNTVKRPTKKGR DRAGKGQKPRGKEQLAKRKTFSLVKEKKAARTLSAILLAFILTWTPYNIMVLVSTFCK DCVPETLWELGYWLCYVNSTINPMCYALCNKAFRDTFRLLLLCRWDKRRWRKIPKRPG SVHRTPSRQC" BASE COUNT 280 a 462 c 384 g 260 t ORIGIN 1 atgaacactt cagccccacc tgctgtcagc cccaacatca ccgtcctggc accaggaaag 61 ggtccctggc aagtggcctt cattgggatc accacgggcc tcctgtcgct agccacagtg 121 acaggcaacc tgctggtact catctctttc aaggtcaaca cggagctcaa gacagtcaat 181 aactacttcc tgctgagcct ggcctgtgct gacctcatca tcggtacctt ctccatgaac 241 ctctatacca cgtacctgct catgggccac tgggctctgg gcacgctggc ttgtgacctc 301 tggctggccc tggactatgt ggccagcaat gcctccgtca tgaatctgct gctcatcagc 361 tttgaccgct acttctccgt gactcggccc ctgagctacc gtgccaagcg cacaccccgc 421 cgggcagctc tgatgatcgg cctggcctgg ctggtttcct ttgtgctctg ggccccagcc 481 atcctcttct ggcagtacct ggtaggggag cggacagtgc tagctgggca gtgctacatc 541 cagttcctct cccagcccat catcaccttt ggcacagcca tggctgcctt ctacctccct 601 gtcacagtca tgtgcacgct ctactggcgc atctaccggg agacagagaa ccgagcacgg 661 gagctggcag cccttcaggg ctccgagacg ccaggcaaag ggggtggcag cagcagcagc 721 tcagagaggt ctcagccagg ggctgagggc tcaccagaga ctcctccagg ccgctgctgt 781 cgctgctgcc gggcccccag gctgctgcag gcctacagct ggaaggaaga agaggaagag 841 gacgaaggct ccatggagtc cctcacatcc tcagagggag aggagcctgg ctccgaagtg 901 gtgatcaaga tgccaatggt ggaccccgag gcacaggccc ccaccaagca gcccccacgg 961 agctccccaa atacagtcaa gaggccgact aagaaagggc gtgatcgagc tggcaagggc 1021 cagaagcccc gtggaaagga gcagctggcc aagcggaaga ccttctcgct ggtcaaggag 1081 aagaaggcgg ctcggaccct gagtgccatc ctcctggcct tcatcctcac ctggacaccg 1141 tacaacatca tggtgctggt gtccaccttc tgcaaggact gtgttcccga gaccctgtgg 1201 gagctgggct actggctgtg ctacgtcaac agcaccatca accccatgtg ctacgcactc 1261 tgcaacaaag ccttccggga cacctttcgc ctgctgctgc tttgccgctg ggacaagaga 1321 cgctggcgca agatccccaa gcgccctggc tccgtgcacc gcactccctc ccgccaatgc 1381 tgatag // LOCUS HSMFH1 3289 bp DNA PRI 14-MAY-1997 DEFINITION H.sapiens MFH-1 gene. ACCESSION Y08223 NID g1869804 KEYWORDS mesenchyme fork head-1 protein; MFH-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3289) AUTHORS Miura,N. TITLE Direct Submission JOURNAL Submitted (18-SEP-1996) N. Miura, Akita University School of Medicine, Department of Biochemistry, 1-1-1 Hondo, Akita 010, JAPAN FEATURES Location/Qualifiers source 1..3289 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1197..2702 /gene="MFH-1" CDS 1197..2702 /gene="MFH-1" /codon_start=1 /product="Mesenchyme Fork Head-1" /db_xref="PID:e303016" /db_xref="PID:g1869805" /translation="MQARYSVSDPNALGVVPYLSEQNYYRAAGSYGGMASPMGVYSGH PEQYSAGMGRSYAPYHHHQPAAPKDLVKPPYSYIALITMAIQNAPEKKITLNGIYQFI MDRFPFYRENKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGS FLRRRRRFKKKDVSKEKEERAHLKEPPPAASKGAPATPHLADAPKEAEKKVVIKSEAA SPALPVITKVETLSPESALQGSPRSAASTPAGSPDGSLPEHHAAAPNGLPGFSVENIM TLRTSPPGGELSPGAGRAGLVVPPLALPYAAAPPAAYGQPCAQGLEAGAAGGYQCSMR AMSLYTGAERPAHMCVPPALDEALSDHPSGPTSPLSALNLAAGQEGALAATGHHHQHH GHHHPQAPPPPPAPQPQPTPQPGAAAAQAASWYLNHSGDLNHLPGHTFAAQQQTFPNV REMFNSHRLGIENSTLGESQVSGNASCQLPYRSTPPLYRHAAPYSYDCTKY" BASE COUNT 639 a 1125 c 925 g 600 t ORIGIN 1 gaattcggag gattaagttg tcagtcagca cgttgctacc ttcccctcta tgcactccgc 61 tgcctggctc ctcggcgggg agcgagggaa actcagtttg tagggtttac ctctaaaacc 121 tcgataggtt atccttgacg accccgagcc tggaaactcc ctgttgatga ttaattattt 181 gattaaataa gtataacatc caggagaggc cctgccattc caatccagcg cgtttgcttt 241 tgaatccatt acacctgggc ccccataatt aggaaatcta attattcgct tcatcactca 301 ttaataagaa aaatgtccca ggatcattgc tacttacaag gtctttggga gagatatttt 361 actctattaa tccattctat tttatatttc aaattgattt tttttaacag aggaaagtgg 421 ctatcttttt gttttgggca tgtgggccca ttcaccaaaa tgtgatcata aaataaattt 481 taataagata taacttttta aaaagttttc aagtgaagac ggagtcgccg cggaggccgg 541 ggcggcgggg tcttagagcc gacggattcc tgcgctcctc gccccgattg gcgccggact 601 cctctcagct gccgggtgat tggctcaaag ttccgggagg gggcgtggcc cgaggaaagt 661 aaaaactcgc tttcagcaag aagacttttg aaacttttcc caatccctaa aagggacttg 721 gcctcttttt ctgggctcag cggggcagcc gctcggaccc cggcgcgctg accctcgggg 781 ctgccgattc gctgggggct tggagagcct cctgcgcccc tcctcgcgcg ggccgagggt 841 ccaccttggt ccccaggccg cggcgtctcc gctgggtccg cggccgcccg cctgcccgcg 901 ctgccgccgc cgggtcctgg agccagcgag gagcggggcc ggcgctgcgc ttgcccgggg 961 cgcgccctcc aggatgccga tccgcccggt ccgctgaaag cgcgcgcccc tgctcggccc 1021 gagcgacgac gaccgcgcac cctcgccccg gaggctgcca ggagaccggg gccgcccctc 1081 ccgctcccct cctctccccc tctggctctc tcgcgctctc tcgctctcag ggcccccctc 1141 gctcccccgg ccgcagtccg tgcgcgaggg cgccggcgag ccgtctcgga agcagcatgc 1201 aggcgcgcta ctccgtgtcc gaccccaacg ccctgggagt ggtgccctac ctgagcgagc 1261 agaattacta ccgggctgcg ggcagctacg gcggcatggc cagccccatg ggcgtctatt 1321 ccggccaccc ggagcagtac agcgcgggga tgggccgctc ctacgcgccc taccaccacc 1381 accagcccgc ggcgcctaag gacctggtga agccgcccta cagctacatc gcgctcatca 1441 ccatggccat ccagaacgcg cccgagaaga agatcacctt gaacggcatc taccagttca 1501 tcatggaccg cttccccttc taccgggaga acaagcaggg ctggcagaac agcatccgcc 1561 acaacctctc gctcaacgag tgcttcgtca aggtgccccg cgacgacaag aagcccggca 1621 agggcagtta ctggaccctg gacccggact cctacaacat gttcgagaac ggcagcttcc 1681 tgcggcgccg gcggcgcttc aaaaagaagg acgtgtccaa ggagaaggag gagcgggccc 1741 acctcaagga gccgcccccg gcggcgtcca agggcgcccc ggccaccccc cacctagcgg 1801 acgcccccaa ggaggccgag aagaaggtgg tgatcaagag cgaggcggcg tccccggcgc 1861 tgccggtcat caccaaggtg gagacgctga gccccgagag cgcgctgcag ggcagcccgc 1921 gcagcgcggc ctccacgccc gccggctccc ccgacggttc gctgccggag caccacgccg 1981 cggcgcccaa cgggctgcct ggcttcagcg tggagaacat catgaccctg cgaacgtcgc 2041 cgccgggcgg agagctgagc ccgggggccg gacgcgcggg cctggtggtg ccgccgctgg 2101 cgctgccata cgccgccgcg ccgcccgccg cctacggcca gccgtgcgct cagggcctgg 2161 aggccggggc cgccgggggc taccagtgca gcatgcgagc gatgagcctg tacaccgggg 2221 ccgagcggcc ggcgcacatg tgcgtcccgc ccgccctgga cgaggccctc tcggaccacc 2281 cgagcggccc cacgtcgccc ctgagcgctc tcaacctcgc cgccggccag gagggcgcgc 2341 tcgccgccac gggccaccac caccagcacc acggccacca ccacccgcag gcgccgccgc 2401 ccccgccggc tccccagccc cagccgacgc cgcagcccgg ggccgccgcg gcgcaggcgg 2461 cctcctggta tctcaaccac agcggggacc tgaaccacct ccccggccac acgttcgcgg 2521 cccagcagca aactttcccc aacgtgcggg agatgttcaa ctcccaccgg ctggggattg 2581 agaactcgac cctcggggag tcccaggtga gtggcaatgc cagctgccag ctgccctaca 2641 gatccacgcc gcctctctat cgccacgcag ccccctactc ctacgactgc acgaaatact 2701 gacgtgtccc gggacctccc ctccccggcc cgctccggct tcgcttccca gccccgaccc 2761 aaccagacaa ttaaggggct gcagagacgc aaaaaagaaa caaaacatgt ccaccaacct 2821 tttctcagac ccgggagcag agagcgggca cgctagcccc cagccgtctg tgaagagcgc 2881 aggtaacttt aattcgccgc cccgtttctg ggatcccagg aaacccctcc aaagggacgc 2941 agcccaacaa aatgagtatt ggtcttaaaa tccccctccc ctaccaggac ggctgtgctg 3001 tgctcgacct gagctttcaa aagttaagtt atggacccaa atcccatagc gagcccctag 3061 tgactttctg taggggtccc cataggtgta tgggggtctc tatagataat atatgtgctg 3121 tgtgtaattt taaatttctc caaccgtgct gtacaaatgt gtggatttgt aatcaggcta 3181 ttttgttgtt gttgttgttg ttcagagcca ttaatataat atttaaagtt gagttcactg 3241 gataagtttt tcatcttgcc caaccatttc taactgccaa attgaattc // LOCUS HSMYELIN 757 bp DNA PRI 17-FEB-1997 DEFINITION H.sapiens gene for myelin protein zero. ACCESSION Z31718 NID g469516 KEYWORDS myelin; myelin protein zero. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 757) AUTHORS Rautenstrauss,B., Nelis,E., Grehl,H., Pfeiffer,R.A. and Van Broeckhoven,C. TITLE Identification of a de novo insertional mutation in P0 in a patient with a Dejerine-Sottas syndrome (DSS) phenotype JOURNAL Hum. Mol. Genet. 3 (9), 1701-1702 (1994) MEDLINE 95135435 REFERENCE 2 (bases 1 to 757) AUTHORS Rautenstrauss,B. TITLE Direct Submission JOURNAL Submitted (01-APR-1994) Rautenstrauss B., University of Erlangen, Institute of Human Genetics, Schwabachanlage 10, Erlangen, Bavaria, F.R.G., D-91045 FEATURES Location/Qualifiers source 1..757 /organism="Homo sapiens" /isolate="patient H7" /db_xref="taxon:9606" /dev_stage="infant" /tissue_type="muscle" /chromosome="1q22-23" /sex="Male" gene 1..756 /gene="P0" CDS 1..756 /gene="P0" /standard_name="Major Protein Zero MPZ, P0" /function="glycoprotein, transmembrane protein" /codon_start=1 /evidence=experimental /product="myelin protein zero" /db_xref="PID:g469517" /translation="MAPGAPSSSPSPILAVLLFSSLVLSPAQAIVVYTDREVHGAVGS RVTLHCSFWSSEWVSDDISFTWRYQPEGGRDAISIFHYAKGQPYIDEVGTFKERIQWV GDPRWKDGSIVIHNLDYSDNGTFTCDVKNPPDIVGKTSQVTLYVFEKVPTRYGVVLGA VIGGVLGVVLLLLLLFYVVRYCWLRRQAALQRRLSAMEKGKLHKPGKDASKRGRQTPV LYAQCWTTAEAPKLSVRRRPRGWGSLARIRNSG" mutation 663..664 /gene="P0" /note="cf. EMBL accession number D10537" /label=INS707GC /replace="" BASE COUNT 158 a 203 c 236 g 160 t ORIGIN 1 atggctcctg gggctccctc atccagcccc agccctatcc tggctgtgct gctcttctct 61 tctttggtgc tgtccccggc ccaggccatc gtggtttaca ccgacaggga ggtccatggt 121 gctgtgggct cccgggtgac cctgcactgc tccttctggt ccagtgagtg ggtctcagat 181 gacatctcct tcacctggcg ctaccagccc gaaggaggca gagatgccat ttcgatcttc 241 cactatgcca agggacaacc ctacattgac gaggtgggga ccttcaaaga gcgcatccag 301 tgggtagggg accctcgctg gaaggatggc tccattgtca tacacaacct agactacagt 361 gacaatggca cgttcacttg tgacgtcaaa aaccctccag acatagtggg caagacctct 421 caggtcacgc tgtatgtctt tgaaaaagtg ccaactaggt acggggtcgt tctgggagct 481 gtgatcgggg gtgtcctcgg ggtggtgctg ttgctgctgc tgcttttcta cgtggttcgg 541 tactgctggc tacgcaggca ggcggccctg cagaggaggc tcagtgctat ggagaagggg 601 aaattgcaca agccaggaaa ggacgcgtcg aagcgcgggc ggcagacgcc agtgctgtat 661 gcgcaatgct ggaccacagc agaagcacca aagctgtcag tgagaagaag gccaaggggc 721 tgggggagtc tcgcaaggat aagaaatagc ggttagc // LOCUS HSNEURVGF 3038 bp DNA PRI 14-NOV-1997 DEFINITION H.sapiens vgf gene. ACCESSION Y12661 NID g2244658 KEYWORDS neuroendocrine-specific protein; vgf gene; VGF protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3038) AUTHORS Canu,N., Possenti,R., Ricco,A.S., Rocchi,M. and Levi,A. TITLE Cloning, structural organization analysis, and chromosomal assignment of the human gene for the neurosecretory protein VGF JOURNAL Genomics 45 (2), 443-446 (1997) MEDLINE 98008940 REFERENCE 2 (bases 1 to 3038) AUTHORS Canu,N. TITLE Direct Submission JOURNAL Submitted (03-APR-1997) N. Canu, Istituto di Neurobiologia, C.N.R, Viale Marx 43, 00137 Roma, ITALY FEATURES Location/Qualifiers source 1..3038 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="EMBL3/SP6/T7" /tissue_type="placenta" mRNA join(1..196,708..3038) exon 1..196 intron 197..707 exon 708..3038 gene 724..2574 /gene="vgf" CDS 724..2574 /gene="vgf" /codon_start=1 /product="neuro-endocrine specific protein VGF" /db_xref="PID:e315149" /db_xref="PID:g2244659" /translation="MKALRLSASALFCLLLINGLGAAPPGRPEAQPPPLSSEHKEPVA GDAVPGPKDGSAPEVRGARNSEPQDEGELFQGVDPRALAAVLLQALDRPASPPAPSGS QQGPEEEAAEALLTETVRSQTHSLPAAGEPEPAAPPRPQTPENGPEASDPSEELEALA SLLQELRDFSPSSAKRQQETAAAETETRTHTLTRVNLESPGPERVWRASWGEFQARVP ERAPLPPPAPSQFQARMPDSGPLPETHKFGEGVSSPKTHLGEALAPLSKAYQGVAAPF PKARRAESALLGGSEAGERLLQQGLAQVEAGRRQAEATRQAAAQEERLADLASDLLLQ YLLQGGARQRGLGGRGLQEAAEERESAREEEEAEQERRGGEERVGEEDEEAAEAAEAE ADEAERARQNALLFAEEEDGEAGAEDKRSQEETPGHRRKEAEGTEEGGEEEDDEEMDP QTIDSLIELSTKLHLPADDVVSIIEEVEEKRNRKKKAPPEPVPPPRAAPAPTHVRSPQ PPPPPPSARDELPDWNEVLPPWDREEDEVYPPGPYHPFPNYIRPRTLQPPSALRRRHY HHALPPSRHYPGREAQARHAQQEEAEAEERRLQEQEELENYIEHVLLRRP" polyA_signal 3019..3027 BASE COUNT 524 a 1047 c 1022 g 445 t ORIGIN 1 ccagcgtgct gaagccggag cgagctagcc gcccggagcc gcgccgaccc agctgagccc 61 agcccacggg acgccagacc tcgaccgtcg ctcctacccc ggccaccgct cggagccgag 121 gcggacgcgt cccgatcttc ccctgtcccc accctgcccc gaccctcctc tccacctctc 181 gcgtcgtgac accagctggt aaatactccg ctgttcgtcc ctcaaaccct cggcagccag 241 ccgtgggcgt gagggagggt tctctctcct ctcgatgggg gtgttgcaaa cacagcgggg 301 agccccctgg taagggtccc cggtaaacgg gggagtcgca gctttttctc ttgctgctga 361 agtcgcccac gcaccatccg gggagtccta cggggaggga gcagagattt ttttttcccc 421 catattgctg ctgcttagta cgtgggcgat ggcagtgaga tggctcaggg aaggccgagg 481 aggccctggg taagcgaggg cttcgggggt tattttccca tttacacggc tccagagatc 541 ggcacaacat cttcctcctt tgctcctaaa cgttcctctt ctgggtaagg tttgggggat 601 cagggaagcc ccgggtttcc tgctgaaagg tgggggaagg gaacgtagac ctagagaggg 661 gggaattctt acagaaatcc tctttttttg gtcccttcta tttttcagtc tccggcagcc 721 tccatgaaag ccctcagatt gtcggcttcc gccctcttct gccttctgct gatcaacggg 781 ttaggggcag caccccctgg tcgccctgag gcgcagcctc ctcctctcag ctctgagcat 841 aaagagccgg tagccgggga cgcagtgccc gggccaaagg atggcagcgc cccagaggtc 901 cgaggcgctc ggaattccga gccgcaggac gagggagagc ttttccaggg cgtggatccc 961 cgggcgctgg ccgcggtgct gctgcaggca ctcgaccgtc ccgcctcacc cccggcacca 1021 agcggctccc agcaggggcc ggaggaagaa gcagctgaag ctctgctgac cgagaccgtg 1081 cgcagccaga cccacagcct cccggcggcc ggagagcccg agcccgcggc gccccctcgc 1141 cctcagactc cggagaatgg gcccgaggcg agcgatccct ccgaggagct cgaggcgcta 1201 gcgtccctgc tccaggaact gcgagatttc agtccaagta gcgccaagcg ccagcaggag 1261 acggcggcag cagagacgga aacccgcacg cacacgctga cccgagtgaa tctggagagc 1321 ccggggccag agcgcgtatg gcgcgcttcc tggggagagt tccaggcgcg tgtcccggag 1381 cgcgcgcccc tgccgccccc ggccccctct caattccagg cgcgtatgcc cgacagcggg 1441 ccccttcccg aaacccacaa gttcggggaa ggagtgtcct cccccaaaac acacctaggc 1501 gaggcattgg cacccctgtc caaggcgtac caaggcgtgg ccgccccgtt ccccaaggcg 1561 cgccgggccg agagcgcact cctgggcggc tccgaggcgg gcgagcgcct tctccagcaa 1621 gggctggcgc aggtggaggc cgggcggcgg caggcggagg ccacgcggca ggccgcggcg 1681 caggaagagc ggctggccga cctcgcctcg gacctgctgc tccagtattt gctgcagggc 1741 ggggcccggc agcgcggcct cgggggtcgg gggctgcagg aggcggcgga ggagcgagag 1801 agtgcaaggg aggaggagga ggcggagcag gagagacgcg gcggggagga gagggtgggg 1861 gaagaggatg aggaggcggc cgaggcggcg gaggcagagg cggacgaggc ggagagggcg 1921 cggcagaacg cgctcctgtt cgcggaggag gaggacgggg aagccggcgc cgaggacaag 1981 cgctcccagg aggagacgcc gggccaccgg cggaaggagg ccgaggggac agaggagggc 2041 ggggaggagg aggacgacga ggagatggat ccgcagacga tcgacagcct cattgagctg 2101 tccaccaaac tccacctgcc agcggacgac gtggtcagca tcatcgagga ggtggaggag 2161 aagcggaacc gaaagaagaa agcccctccc gagcccgtgc cgcccccccg tgccgccccc 2221 gcccccaccc acgtccgctc cccgcagccc ccgcccccgc ccccgtccgc acgagacgag 2281 ctgccggact ggaacgaggt gctcccgccc tgggatcggg aggaggacga ggtgtacccg 2341 ccagggccgt accacccttt ccccaactac atccggccgc ggacactgca gccgccctcg 2401 gccttgcgcc gccgccacta ccaccacgcc ttgccgcctt cgcgccacta tcccggccgg 2461 gaggcccagg cccggcacgc gcagcaggag gaggcggagg cggaggagcg ccggctgcag 2521 gagcaggagg agctggagaa ttacatcgag cacgtgctgc tccggcgccc gtgactgccc 2581 ttcccggtcc cgcccccgcg cgcccccgcc gcgcgcgcgc gccggcgccc ccctccgtgt 2641 tgctccccct cggtgtttgc atgcgccccg ccctgcccct tgcccctgtc cccgggctgc 2701 gtcgggacct gccagacccc cctcccgggt cctgagcccg aactcccaga gctcacccgc 2761 gggtgaccgg ggccagccca ggagggcggg tggtttgtgc gagttccctt gccacgcgcc 2821 ccggccccat caagtcctct ggggacgtcc ccgtcggaaa ccggaaaaag cagttccagt 2881 taattgtgtg aagtgtgtct gtctgtcctc ccagtcgggc ctcccacgag cccctccagc 2941 ctctccaagt cgctgtgaat gaccccttct ttcctttctc tgttgtaaat accctcacgg 3001 aggaaatagt tttgctaaga aataaaagtg actatttt // LOCUS HSNPYY2S2 3444 bp DNA PRI 01-MAR-1997 DEFINITION Human type 2 neuropeptide Y receptor (NPY Y2) gene, partial exon 2, and complete cds. ACCESSION U50146 NID g1853982 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3444) AUTHORS Ammar,D.A., Eadie,D.M., Wong,D.J., Ma,Y.Y., Kolakowski,L.F. Jr., Yang-Feng,T.L. and Thompson,D.A. TITLE Characterization of the human type 2 neuropeptide Y receptor gene (NPY2R) and localization to the chromosome 4q region containing the type 1 neuropeptide Y receptor gene JOURNAL Genomics 38 (3), 392-398 (1996) MEDLINE 97131518 REFERENCE 2 (bases 1 to 3444) AUTHORS Thompson,D.A. and Ammar,D.A. TITLE Direct Submission JOURNAL Submitted (28-FEB-1996) D.A. Thompson, Dept. of Ophthalmology and Biological Chemistry, Kellogg Eye Center, University of Michigan Medical School, 1000 Wall St., Ann Arbor, MI 48105-0714, USA FEATURES Location/Qualifiers source 1..3444 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q31" mRNA join(U50145:<584..1786,84..>3270) /gene="NPY Y2" /product="type 2 neuropeptide Y receptor" 5'UTR join(U50145:<584..1786,84..131) /gene="NPY Y2" gene join(U50145:<584..1906,1..3303) /gene="NPY Y2" intron 1..83 /gene="NPY Y2" exon 84..>3270 /gene="NPY Y2" /number=2 CDS 132..1277 /gene="NPY Y2" /note="NPY Y2 receptor; G protein-coupled receptor" /codon_start=1 /product="type 2 neuropeptide Y receptor" /db_xref="PID:g1853984" /translation="MGPIGAEADENQTVEEMKVEQYGPQTTPRGELVPDPEPELIDST KLIEVQVVLILAYCSIILLGVIGNSLVIHVVIKFKSMRTVTNFFIANLAVADLLVNTL CLPFTLTYTLMGEWKMGPVLCHLVPYAQGLAVQVSTITLTVIALDRHRCIVYHLESKI SKRISFLIIGLAWGISALLASPLAIFREYSLIEIIPDFEIVACTEKWPGEEKSIYGTV YSLSSLLILYVLPLGIISFSYTRIWSKLKNHVSPGAANDHYHQRRQKTTKMLVCVVVV FAVSWLPLHAFQLAVDIDSQVLDLKEYKLIFTVFHIIAMCSTFANPLLYGWMNSNYRK AFLSAFRCEQRLDAIHSEVSVTFKAKKNLEVRKNSGPNDSFTEATNV" polyA_signal 3248..3254 /gene="NPY Y2" terminator 3276..3283 /gene="NPY Y2" terminator 3296..3303 /gene="NPY Y2" BASE COUNT 858 a 777 c 858 g 951 t ORIGIN 1 tttctttgct tcacctttgt gttttcctcg ttccattggt ttttgttgtt gttgttttgt 61 tttattttgt tttttctttt taggttgtag actcttgtgc tggttgcagg ccaagtggac 121 ctgtactgaa aatgggtcca ataggtgcag aggctgatga gaaccagaca gtggaagaaa 181 tgaaggtgga acaatacggg ccacaaacaa ctcctagagg tgaactggtc cctgaccctg 241 agccagagct tatagatagt accaagctga ttgaggtaca agttgttctc atattggcct 301 actgctccat catcttgctt ggggtaattg gcaactcctt ggtgatccat gtggtgatca 361 aattcaagag catgcgcaca gtaaccaact ttttcattgc caatctggct gtggcagatc 421 ttttggtgaa cactctgtgt ctaccgttca ctcttaccta taccttaatg ggggagtgga 481 aaatgggtcc tgtcctgtgc cacctggtgc cctatgccca gggcctggca gtacaagtat 541 ccacaatcac cttgacagta attgccctgg accggcacag gtgcatcgtc taccacctag 601 agagcaagat ctccaagcga atcagcttcc tgattattgg cttggcctgg ggcatcagtg 661 ccctgctggc aagtcccctg gccatcttcc gggagtattc gctgattgag atcatcccgg 721 actttgagat tgtggcctgt actgaaaagt ggcctggcga ggagaagagc atctatggca 781 ctgtctatag tctttcttcc ttgttgatct tgtatgtttt gcctctgggc attatatcat 841 tttcctacac tcgcatttgg agtaaattga agaaccatgt cagtcctgga gctgcaaatg 901 accactacca tcagcgaagg caaaaaacca ccaaaatgct ggtgtgtgtg gtggtggtgt 961 ttgcggtcag ctggctgcct ctccatgcct tccagcttgc cgttgacatt gacagccagg 1021 tcctggacct gaaggagtac aaactcatct tcacagtgtt ccacatcatc gccatgtgct 1081 ccacttttgc caatcccctt ctctatggct ggatgaacag caactacaga aaggctttcc 1141 tctcggcctt ccgctgtgag cagcggttgg atgccattca ctctgaggtg tccgtgacat 1201 tcaaggctaa aaagaacctg gaggtcagaa agaacagtgg ccccaatgac tctttcacag 1261 aggctaccaa tgtctaagga agctgtggtg tgaaaatgta tggatgaatt ctgaccagag 1321 ctatgaatct ggttgatggc ggctcacaag tgaaaactga tttcccattt taaagaagaa 1381 gtggatctaa atggaagcat ctgctgttta attcctggaa aactggctgg gcagagcctg 1441 tgtgaaaata ctggaattca aagataaggc aacaaaatgg tttacttaac agttggttgg 1501 gtagtaggtt gcattatgag taaaagcaga gagaagtact tttgattatt ttcctggagt 1561 gaagaaaact tgaacaagaa attggtatta tcaaagcatt gctgagagac ggtgggaaaa 1621 taagttgact ttcaaatcac gttaggacct ggattgagga ggtgtgcagt tcgctgctcc 1681 ctgcttggct tatgaaaaca ccactgaaca gaaatttctc cagggagcca caggctctcc 1741 ttcatcgcat tttgattttt ttgttcattc tctagacaaa atccatcagg gaatgctgca 1801 ggaaacgatt gccaactata cgaatggctt cgaggagata aactgaaatt tgctatataa 1861 ttaatatttt ggcagatgat aggggaactc ctcaacactc agtgggccaa ttgttcttaa 1921 aaccaattgc acgtttggtg aaagtttctt caactctgaa tcaaaagctg aaattctcag 1981 aattacagga aatgcaaacc atcatttaat ttctaatttc aagttacatc cgctttatgg 2041 agatactatt tagataacaa gaatacaact tgatactttt attgttatac ctttttgaac 2101 atgtatgatt tctgttgtta tttacctttt taaacagata aatatttttt tttcatttta 2161 gagtagcgga atctaatctt aatctaatct tttaggagta tatttcagag aaattccaag 2221 cacaccagta tgaccatcct tatttcagaa atgacaatgc atagaggaaa agtaatatgt 2281 gcaaagcctc cgaagaggat ggttaagtaa agacttaggt taccagtatc aggctttcgt 2341 ttttgtatgt aggtagctct actgcctcct cttaaaacca acaaaggaaa gagagactgg 2401 ctgcaaactt ttagaaggaa tggcttcgaa tagggttcct gggaggaatc ccgaggaaat 2461 agacgctgct gctctgctga ttgtctccac tatcctgttt tgctcctacc cactaatcca 2521 gcctgggagg ctctgggcat tagcggaagg cttcaccaca aggagacagg agcgagtatt 2581 ccataggcat gcgctcctag tggcacgagt ggcttgggtc aggatcaaag agtgaaggat 2641 tcggaagtca gctatctgga gagagagaga gattgtgttt tattcgtgtc ccatagcttt 2701 cctatcctat ccctatccta gcttttaacc tgagccagag ctcactacac aggttcctgg 2761 ctatcgagtc tgaatctgca ctactcaact tataaactgt ctgcagacac ctgttaggga 2821 aattgctgat catgggcggc aggatctgaa ctcgctttac cttcttgttt ggagcacagg 2881 gaccgcccag ctagaggagc accagcgcac tgcgccccag ccctgggcga gggtgcggag 2941 gatttgttct cggtgcaatc ctgctggcgc ttttccgggg ttctgcgcgg atccagctcc 3001 ccatctctgc tcctacacac acaaaagaaa acaactctcg attggaagtt gtggaatttt 3061 ctcagcccct acgaggcgcg gggattctcc agccccggcc ctcctcccgc cagcctgagg 3121 tctccttcgc tcgcctgcct tgctagggac cgcagtccct cagccgcagc tgggtctgtc 3181 cgccccgcct ttgccctcgc cttttcccgg ggcggatttg gtgaagtcgg cctcaagtcc 3241 aggaggtctg tcttcgccgg gccagctctc gcggaactgg ggggtagaga gcaaagggag 3301 agattcgtgg aagggaaggg aggtaggggt ggcgcaaacg cccagagtat caaacttggg 3361 ggtggcacag taggtgacag cagcagctgc aggtggtggc tggggacccg cgagggggcg 3421 cccctctggg tagggtctgg ctga // LOCUS HSOTF3CG 1998 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens OTF3C gene encoding octamer binding protein 3-like sequence. ACCESSION Z11901 NID g297788 KEYWORDS octamer-binding protein; octamer-binding protein 3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1998) AUTHORS Bell,G.I. TITLE Direct Submission JOURNAL Submitted (03-APR-1992) Graeme I Bell, Howard Hughes Medical Institute, University of, Chicago, 5841 S. Maryland Ave., Chicago, IL, 60637, USA REFERENCE 2 (bases 1 to 1998) AUTHORS Takeda,J., Seino,S. and Bell,G.I. TITLE Human Oct3 gene family: cDNA sequences, alternative splicing, gene organization, chromosomal location, and expression at low levels in adult tissues JOURNAL Nucleic Acids Res. 20 (17), 4613-4620 (1992) MEDLINE 93027160 FEATURES Location/Qualifiers source 1..1998 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /dev_stage="Fetal" /tissue_type="Liver" /clone_lib="Charon 4A library of T. Maniatis" /clone="Lambda hoct3.17" /chromosome="8" gene 443..1522 /gene="OTF3C" CDS 443..1522 /gene="OTF3C" /codon_start=1 /label=retroposon /product="octamer binding protein 3_like sequence" /db_xref="PID:g297789" /db_xref="SWISS-PROT:Q06416" /translation="MAGHLASDFAFSPPPGGGGDGPWGAEPGWVDPLTWLSFQGPPGG PGIGPGVGPGSEVWGIPPCPPPYELCGGMAYCGPQVGVGLVPQGGLETSQPESEAGVG VESNSNGASPEPCTVPPGAVKLEKEKLEQNPEKSQDIKALQKELEQFAKLLKQKRITL GYTQADVGLILGVLFGKVFSQKTICRFEALQLSFKNICKLRPLLQKWVEEADNNENLQ EICKAETLMQARKRKRTSIENRVRGNLENLFLQCPKPTLQISHIAQQLGLEKDVVRVW FCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFPPAPGPHFGTPGYGSPHFTALYS SVPFPEGEVFPPVSVITLGSPMHSN" BASE COUNT 523 a 475 c 542 g 458 t ORIGIN 1 cgttcccaca atctgcagga aataagaatc agaaagaagg acacaccatt ccatagcaca 61 gaacacaagt agatggctcc catgaagaat ctagcatcat ggtatggcag aagcagaatg 121 gactcaggaa tcatggtgcc tacactggaa tttcagatcg gccactgaca caatggacac 181 attattcaac atttccaaat cttggcattc ttatccacaa agtgaagata ataattgtca 241 attcacaggt gattatgatt taaagagatt acttttgaag agttcctaac acattcagtc 301 aacatttaat gatgcttcag gcacagagtt cattgctagt gagcgtatga cacacacagc 361 catacggtca cagagctttc aatgaaaagt aacataattg ctcatttcac caggcccccg 421 gcttggggcg ccttccttcc ccatggcggg acacctggct tcggatttcg ccttctcgcc 481 ccctccaggc ggtgggggtg atgggccatg gggggcggag ccgggctggg ttgatcctct 541 gacctggcta agcttccaag gccctcctgg agggccagga atcgggccgg gggttgggcc 601 aggctctgag gtgtggggga ttcccccttg ccccccgccg tatgagttat gtggggggat 661 ggcgtactgt gggcctcagg ttggagtggg gctagtgccc caaggcggct tggagacctc 721 tcagcctgag agcgaagcag gagtcggggt ggagagcaac tccaatgggg cctccccgga 781 accctgcacc gtcccccctg gtgccgtgaa gctggagaag gagaagctag agcaaaaccc 841 ggagaagtcc caggacatca aagctctgca gaaagaactc gagcaatttg ccaagctcct 901 gaagcagaag aggatcaccc tgggatatac acaggccgat gtggggctca tcctgggggt 961 tctatttggg aaggtgttca gccaaaagac catctgccgc tttgaggctc tgcagcttag 1021 cttcaagaac atctgtaagc tgcggccctt gctgcagaag tgggtggagg aagctgacaa 1081 caatgaaaat cttcaggaga tatgcaaagc agaaaccctc atgcaggccc gaaagagaaa 1141 gcgaaccagt atcgagaacc gagtgagagg caacctggag aatttgttcc tgcagtgccc 1201 gaaacccaca ctgcagatca gccacatcgc ccagcagctt gggctcgaga aggatgtggt 1261 ccgagtgtgg ttctgtaacc ggcgccagaa gggcaagcga tcaagcagcg actatgcaca 1321 acgagaggat tttgaggctg ctgggtctcc tttctcaggg ggaccagtgt cctttcctcc 1381 ggccccaggg ccccattttg gtaccccagg ctatgggagc cctcacttca ctgcactgta 1441 ctcctcggtc cctttccctg agggggaagt ctttccccca gtctccgtca tcactctggg 1501 ctctcccatg cattcaaact gaggtgcctg cccttctagg aatggggaac aggggagggg 1561 aggagctagg gaaagagaac ctggagtttg tgccagggct tttgggatta agttcttcat 1621 tcactaagga aggaattggg aacacaaagg gtgggggcag gggagtttgg ggcaactggt 1681 tggagggaag gtgaagttca atgatgctct tgattttatt cccacatcat gtatcacttt 1741 tttcttaaat aaagaagcct gggacacagt aaaaaaaaaa aaaaaagaaa gaaaagaaaa 1801 gaaaagtaac ataattgagt aataattttt taagtgtggt aaaatatgtg gcacatagta 1861 ggtattcaat aaatacttgc ttttttcctc ctttttacct tttcctagag tcctacttga 1921 tggtagctgt agtttttctt caagacatcc tatttgtcct gatacaaaac ctcttttgac 1981 cgctaacggc atagtccg // LOCUS HSPEG1 1501 bp DNA PRI 29-JUL-1997 DEFINITION H.sapiens PEG1/MEST gene. ACCESSION Y10620 NID g2285949 KEYWORDS PEG1/MEST gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1501) AUTHORS Riesewijk,A.M., Hu,L., Schulz,U., Tariverdian,G., Hoglund,P., Kere,J., Ropers,H.H. and Kalscheuer,V.M. TITLE Monoallelic expression of human PEG1/MEST is paralleled by parent-specific methylation in fetuses JOURNAL Genomics 42 (2), 236-244 (1997) MEDLINE 97336048 REFERENCE 2 (bases 1 to 1501) AUTHORS Kalscheuer,V.M.M. TITLE Direct Submission JOURNAL Submitted (20-JAN-1997) V.M.M. Kalscheuer, Max-Planck Institut fuer Molekulare Genetik, Ihnestrasse 73, D- 14195 Berlin, FRG REMARK revised by submitter 29-JUL-1997 COMMENT Related sequences: D78611 & Y11534. FEATURES Location/Qualifiers source 1..1501 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cosmid library 113" /clone="ICRfc113G0353Q4" /sub_clone="4.3HpT7T3" /chromosome="7" /map="q32" exon 479..722 /gene="PEG1/MEST" /number=1 gene 479..1501 /gene="PEG1/MEST" CDS 697..726 /gene="PEG1/MEST" /codon_start=1 /db_xref="PID:e332036" /db_xref="PID:g2285950" /translation="MVRRDRLRR" intron 723..>1501 /gene="PEG1/MEST" /number=1 BASE COUNT 242 a 414 c 527 g 318 t ORIGIN 1 gagggatggg agcaggcgcc acggccggca ccccagagcc ctgctgcccc ttagttcgag 61 cggccatcct cctgtggggc ttgtgggcag cctgtggggt ttgtgggcgg cctgtggggt 121 ttgtgggtgg tctaaggaaa gagttggggc actcaggggt ctgctgtttt tgcccgtggc 181 cttaactcat caggggaggg tttctgcagc agaatctcgg gctcagggtt ggcggttaac 241 gagggagcag cggggtcttg gggagggggc tcgacacccc tgaaggtgcc ccctaaagga 301 gccactgtta gaggggcacc ccatctttgt ggccatggcg gtggtagagc ggctgggagg 361 ggctctgcgg cgagcaaggg agcaggcggt aggggttttg cggcgatggg cgggctaggg 421 gcggggcgcg ggtgggctct aaaagtcggt gcccactcgc tccgcgctgc cgcggcaacc 481 agcacacccc ggcacctcct ctgcggcagc tgcgcctcgc aagcgcagtg ccgcagcgca 541 cgccggagtg gctgtagctg cccggcgcgg cgccgccctg cgcgggctgt gggctgcggg 601 ctgcgccccc gctgctggcc agctctgcac ggctgcgggc tctgcggcgc ccggtgctct 661 gcaacgctgc ggcgggcggc atgggataac gcggccatgg tgcgccgaga tcgcctccgc 721 aggtgagtgt gcggtgggaa cgagggggtg tggctggcgg ccctgggact agggcgcagg 781 cgagcggagg actgtgtgcc cgtgtccgag ctggggctgc ctctgggccg aaactctacc 841 gacaggcggc acgcattccg cgcccgctct gcctacttga ggagggggtg tcactcctgc 901 ccgcaatgga atgttcagaa cgcgggacct ccttgggtta ggatttctag accccgggat 961 cgtcgtggtg agatttagga tttctggacc ccagcgtcat cttgatatga cttaggatcc 1021 ataatgaccc tggtctcacc ctgatgcgaa ttgggatttt tagatcctgg catcaccctg 1081 gtgcgattta ggatttttat actcagtcat tgctgcagca tgatttagga tttctaaccc 1141 ccagcatcgc cctggtttga tttaggatat ttagactccg gcttccctct ggtgcgattc 1201 aggattctta gactccgccg ttgccgtggc gcgatttagg atttatagat cccggcaaag 1261 ccctggtgcg atgtaggatt tttagaaccc cagcatcgct ctggtgcgac ttaaaggata 1321 ggccccagca tcgccctggt gcgatgtagg atttttagaa ccccggtatc tccgtggcgc 1381 accttaggat ttcaagaacg ggataatcgc agtgccgaga tcgccgcggt gcagcttagg 1441 atttcaagac ccaggtatca cggtggcggg agtcaccgca gtgactagaa ctcgcagtgc 1501 c // LOCUS HSRFXAP 2785 bp DNA PRI 18-SEP-1997 DEFINITION H.sapiens RFXAP mRNA. ACCESSION Y12812 NID g2073409 KEYWORDS 36kD subunit; RFX DNA-binding complex; RFXAP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2785) AUTHORS Durand,B., Sperisen,P., Emery,P., Barras,E., Zufferey,M., Mach,B. and Reith,W. TITLE RFXAP, a novel subunit of the RFX DNA binding complex is mutated in MHC class II deficiency JOURNAL EMBO J. 16 (5), 1045-1055 (1997) MEDLINE 97224131 REFERENCE 2 (bases 1 to 2785) AUTHORS Reith,W. TITLE Direct Submission JOURNAL Submitted (29-APR-1997) W. Reith, Department of Genetics & Microbiology, University of Geneva Medical School, 1 rue Michel-Servet, CH-1211 Geneva 4, SWITZERLAND FEATURES Location/Qualifiers source 1..2785 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" /cell_line="Namalwa" gene 117..935 /gene="RFXAP" CDS 117..935 /gene="RFXAP" /codon_start=1 /product="36kD subunit of RFX DNA-binding complex" /db_xref="PID:e314739" /db_xref="PID:g2073410" /translation="MEAQGVAEGAGPGAASGVPHPAALAPAAAPTLAPASVAAAASQF TLLVMQPCAGQDEAAAPGGSVGAGKPVRYLCEGAGDGEEEAGEDEADLLDTSDPPGGG ESAASLEDLEDEETHSGGEGSSGGARRRGSGGGSMSKTCTYEGCSETTSQVAKQRKPW MCKKHRNKMYKDKYKKKKSDQALNCGGTASTGSAGNVKLEESADNILSIVKQRTGSFG DRPARPTLLEQVLNQKRLSLLRSPEVVQFLQKQQQLLNQQVLEQRQQQFPGTSM" BASE COUNT 807 a 512 c 662 g 804 t ORIGIN 1 cccggtatag gcgcctttta ccccagcgtg tcctgagtct ttggttcgcg aagtgccgtt 61 aggccaagca ggtgctaaaa gcccggggtc gtggaccccg gccaggtctt agcagcatgg 121 aggcgcaggg tgtagcggag ggcgcggggc cgggcgccgc cagcggcgtg ccccaccccg 181 cggccctagc cccggctgcg gctcccacct tggcgccagc ctcggtggcg gccgcggcct 241 ctcaattcac cctgctagtg atgcaaccct gtgctgggca ggacgaggct gcggcccccg 301 ggggcagcgt tggggcgggc aagcccgtta ggtacctgtg cgaaggggcc ggggatggcg 361 aagaggaggc tggggaggac gaggcggacc tgttagacac ttcggaccct ccggggggag 421 gcgagagcgc ggctagtttg gaggatctag aggacgagga gactcactcg gggggcgagg 481 gcagcagcgg gggcgcccgg aggcggggca gcggtggggg cagcatgagc aagacctgca 541 cctacgaagg ctgcagcgag accacgagcc aggtggccaa gcagcgcaaa ccgtggatgt 601 gcaagaaaca ccgcaacaag atgtacaagg acaagtataa aaagaagaag agcgaccagg 661 ccctgaactg cggtgggact gcctcgactg gcagcgcggg aaacgtcaaa ctcgaggaaa 721 gtgcagataa catactctcc attgttaaac aaagaacagg atcttttggg gatcgtcctg 781 caagacctac tcttttagaa caagtgttaa atcaaaaaag actgtcgtta ctaagaagtc 841 cagaagtagt gcaattttta cagaaacagc aacagctatt aaatcagcaa gttttggagc 901 aaagacaaca gcagtttcca ggaacatcaa tgtgagggaa cttaccaaga acatctacat 961 ggtttttatc ttattgtaat agatgagcat atttttttac cagacataaa tggggtaata 1021 atctatgcct gtagaacata aacattttcc tgtaaatgta tgtgtgcatt tggggataag 1081 taagtattgc actttgtgca tctaatcttt cagattactg tgagtttgaa gaagtcagct 1141 tatctttcca aataacattt aattataatg ttttttaaaa aatatattcc tcttcagtca 1201 ttgttactga gggtaatgaa gcagttactt tctgtgggag tcataaagtt aatagatatt 1261 aatcttgact catctagctc agtggttctc atcaagggtc aatttgattg tcatagtgac 1321 cttgaaaacc actggctttt agtgagtggc caggaaatgc taaatgttct gcagtgtcag 1381 gggtagtccc acatactaaa gattgtctca cccgcagtgc caataacact cctaagaaat 1441 gttgatggct attttgtggt gctaacatgt agttggggca cctacaattg ggttctctta 1501 ataacctttc tttgcagtta agactgaagc tgtcaaagag gtaagcacat tttatataga 1561 cgtaaggaaa gtgattattg tttaatatct gtgaatttag gatgtgcatc tcttttcaga 1621 ggtgtgttag taaaacctga cggattaact aagcacactg ggatgtgtct cctacagttg 1681 gcttctctct ttgatgttac ctgttagtgc tgatctctta aagcagacat ttcttgtttg 1741 ttgaatttgt gaacagtata gatctcagcc caccaatgcc aagacaaaat tatttttctt 1801 atacttattt tttattaaac aaaatgaaaa agatcctttt caaaaaggtg atcctgaaaa 1861 taaaactaac actccagtat tttgtcattg tttttcgcaa ttgagctatc tgaaaactgt 1921 tattcctaag taatgttcaa aaatgataag taatctggat acctttttct tatactttct 1981 cctaggaaaa ctttaaaact ttaaaaaggc aaacctacca ataggaataa caaattaaat 2041 gtcaagagag tatatccaat attaggatat aaatgtatgt gtctcaagtt taactctaca 2101 aaaatttgtt acttgttttt taaactctat atataaagtt cgacttaatc atggctgttc 2161 taagaagtac ttatggagag caagaacatt tttgttcatt tcttaatgtg tgtgttttta 2221 cttgcatatc tgttcaaaac acttttaaca aaattaattc attaaagtcc agttgttgac 2281 ctttgagtta gccgatttct ttattctgtt ctttagttta ttcttactag atgcagagga 2341 attcatctac tgtctgttat taactgttag tttattctca tacttacgat gttgagagtt 2401 tttttgaagc ttaagttacc ctttatggtg gaaaacatta gcttatgctt ctttagatgg 2461 aataatggga aaggagggaa atgggaaatg gatggaaatg ggaaaggagg gaaaataata 2521 gcccagtgag agctgaatga aaagggactg aatttaaata tttgtaagaa ctttgtgatg 2581 atgagtaatt gtcagacgtg ggatagataa ctgagaggct cagaatcttt accaaggata 2641 ttttttagga taaggtagct gcctgttcat gaatttggat aagaatagta ggacaatatt 2701 caacacaatt taatttttgt ctgccacatt agacattttt ttaccttata aaatgatcaa 2761 taaagcaata aggtttattt tgggt // LOCUS HSSELPLG2 3409 bp DNA PRI 23-AUG-1995 DEFINITION Human P-selectin glycoprotein ligand (SELPLG) gene, exon 2, and complete cds. ACCESSION U25956 NID g902795 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3409) AUTHORS Veldman,G.M., Bean,K.M., Cumming,D.A., Eddy,R.L., Sait,N.J.S. and Shows,T.B. TITLE Genomic organization and chromosomal localization of the gene encoding human P-selectin glycoprotein ligand JOURNAL J. Biol. Chem. 270 (27), 16470-16475 (1995) MEDLINE 95332364 REFERENCE 2 (bases 1 to 3409) AUTHORS Veldman,G.M., Bean,K.M., Cumming,D.A., Eddy,R.L., Sait,N.J.S. and Shows,T.B. TITLE Direct Submission JOURNAL Submitted (28-APR-1995) David Merberg, Research Computing, Genetics Institute, 87 CambridgePark Drive, Cambridge, MA 02140, USA FEATURES Location/Qualifiers source 1..3409 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="12q24" /chromosome="12" mRNA join(U25955:444..497,239..2255) /gene="SELPLG" gene join(U25955:444..703,1..2255) /gene="SELPLG" exon 239..2255 /gene="SELPLG" /number=2 CDS 244..1482 /gene="SELPLG" /codon_start=1 /product="P-selectin glycoprotein ligand" /db_xref="PID:g902797" /translation="MPLQLLLLLILLGPGNSLQLWDTWADEAEKALGPLLARDRRQAT EYEYLDYDFLPETEPPEMLRNSTDTTPLTGPGTPESTTVEPAARRSTGLDAGGAVTEL TTELANMGNLSTDSAAMEIQTTQPAATEAQTTQPVPTEAQTTPLAATEAQTTRLTATE AQTTPLAATEAQTTPPAATEAQTTQPTGLEAQTTAPAAMEAQTTAPAAMEAQTTPPAA MEAQTTQTTAMEAQTTAPEATEAQTTQPTATEAQTTPLAAMEALSTEPSATEALSMEP TTKRGLFIPFSVSSVTHKGIPMAASNLSVNYPVGAPDHISVKQCLLAILILALVATIF FVCTVVLAVRLSRKGHMYPVRNYSPTEMVCISSLLPDGGEGPSATANGGLSKAKSPGL TPEPREDREGDDLTLHSFLP" BASE COUNT 793 a 993 c 925 g 698 t ORIGIN 1 tagaagaagt taaagggccc tcctggatgg ctttattcat gttgatgagt aataataata 61 actgctactg gctgaggatc ttctccatcc caggcatgtc agggatgcct aagtccccag 121 tccctgctcc agaccagaca tcttccagct gtggcagtag agggtggtgg tctagggtgc 181 ttgctaagcc caagggtgaa actgtcttga catccctccg cccattgtct cctcctaggt 241 gccatgcctc tgcaactcct cctgttgctg atcctactgg gccctggcaa cagcttgcag 301 ctgtgggaca cctgggcaga tgaagccgag aaagccttgg gtcccctgct tgcccgggac 361 cggagacagg ccaccgaata tgagtaccta gattatgatt tcctgccaga aacggagcct 421 ccagaaatgc tgaggaacag cactgacacc actcctctga ctgggcctgg aacccctgag 481 tctaccactg tggagcctgc tgcaaggcgt tctactggcc tggatgcagg aggggcagtc 541 acagagctga ccacggagct ggccaacatg gggaacctgt ccacggattc agcagctatg 601 gagatacaga ccactcaacc agcagccacg gaggcacaga ccactcaacc agtgcccacg 661 gaggcacaga ccactccact ggcagccaca gaggcacaga caactcgact gacggccacg 721 gaggcacaga ccactccact ggcagccaca gaggcacaga ccactccacc agcagccacg 781 gaagcacaga ccactcaacc cacaggcctg gaggcacaga ccactgcacc agcagccatg 841 gaggcacaga ccactgcacc agcagccatg gaagcacaga ccactccacc agcagccatg 901 gaggcacaga ccactcaaac cacagccatg gaggcacaga ccactgcacc agaagccacg 961 gaggcacaga ccactcaacc cacagccacg gaggcacaga ccactccact ggcagccatg 1021 gaggccctgt ccacagaacc cagtgccaca gaggccctgt ccatggaacc tactaccaaa 1081 agaggtctgt tcataccctt ttctgtgtcc tctgttactc acaagggcat tcccatggca 1141 gccagcaatt tgtccgtcaa ctacccagtg ggggccccag accacatctc tgtgaagcag 1201 tgcctgctgg ccatcctaat cttggcgctg gtggccacta tcttcttcgt gtgcactgtg 1261 gtgctggcgg tccgcctctc ccgcaagggc cacatgtacc ccgtgcgtaa ttactccccc 1321 accgagatgg tctgcatctc atccctgttg cctgatgggg gtgaggggcc ctctgccaca 1381 gccaatgggg gcctgtccaa ggccaagagc ccgggcctga cgccagagcc cagggaggac 1441 cgtgaggggg atgacctcac cctgcacagc ttcctccctt agctcactct gccatctgtt 1501 ttggcaagac cccacctcca cgggctctcc tgggccaccc ctgagtgccc agaccccaat 1561 ccacagctct gggcttcctc ggagacccct ggggatgggg atcttcaggg aaggaactct 1621 ggccacccaa acaggacaag agcagcctgg ggccaagcag acgggcaagt ggagccacct 1681 ctttcctccc tccgcggatg aagcccagcc acatttcagc cgaggtccaa ggcaggaggc 1741 catttacttg agacagattc tctccttttt cctgtccccc atcttctctg ggtccctcta 1801 acatctccca tggctctccc cgcttctcct ggtcactgga gtctcctccc catgtaccca 1861 aggaagatgg agctccccca tcccacacgc actgcactgc cattgtcttt tggttgccat 1921 ggtcaccaaa caggaagtgg acattctaag ggaggagtac tgaagagtga cggacttctg 1981 aggctgtttc ctgctgctcc tctgacttgg ggcagcttgg gtcttcttgg gcacctctct 2041 gggaaaaccc agggtgaggt tcagcctgtg agggctggga tgggtttcgt gggcccaaag 2101 ggcagacctt tctttgggac tgtgtggacc aaggagcttc catctagtga caagtgaccc 2161 ccagctatcg cctcttgcct tcccctgtgg ccactttcca gggtggactc tgtcttgttc 2221 actgcagtat cccaactgca ggtccagtgc aggcaataaa tatgtgatgg acaaaacgat 2281 agcggaatcc ttcaaggttt caaggctgtc tccttcaggc agccttcccg gaattctcca 2341 tccctcagtg caggatgggg gctggtcctc agctgtctgc cctcagcccc tggcccccca 2401 ggaagcctct ttcatgggct gttaggttga cttcagtttt gcctcttgga caacaggggg 2461 tcttgtacat ccttgggtga ccaggaaaag ttcaggctat ggggggccaa agggagggct 2521 gccccttccc caccagtgac cactttattc cacttcctcc attacccagt tttggcccac 2581 agagtttggt cccccccaaa cctcggacca atatccctct aaacatcaat ctatcctcct 2641 gttaaagaaa aaaaaaaatg ggactgggag cagtggctca tgcctgtaat cccagcactt 2701 tgggaggccg aggcaggtac atcacctgag gtcaggagtt caagactagc ctggccaaca 2761 tagtgaaacc ctgtctctac taaaaataca aagattagtc aggtgtggtg gcacatgcct 2821 gtagtcccag ctactgggga ggctgaggca ggagaattgc ttgaacccgg gaagcggagg 2881 gaggttgcag tgagctgaga tcacgctact gcactccagc ctgggtgaca gagtaagact 2941 ccgtctcaaa aaaaaaaaaa aagattcaat gacccttgtt aaagcatggt aaggaagact 3001 ttgttcaagg ggagtgggac tctctcaatc actgcaggga ctgcagctat gggattttgc 3061 agtgggggca tttgggctca actatgagta cagcaggggc aagtgggagc tgatagccag 3121 ggaacagggt tggatatctg cagctggaaa attaccaaga ggaaacatca ggggaagggg 3181 aattctggct aaactgactg ctggggatgg gttctcggtc attttctaca ctgacctaac 3241 aggattcata ctggaggcag gccagggtgc tcagacatca ccggggggat ggtggcagat 3301 gaggaacgtg atcagatata ggaggtgatc agatatggga ggtgatcaga tatggagtgg 3361 tggggggagg gttgttgcta agctgactta gcagagttct tgttagaac // LOCUS HSSHC7 1879 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens shc gene, p66 isoform. ACCESSION Y09847 NID g1834514 KEYWORDS p66 isoform; psi shc gene; shc gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1879) AUTHORS Harun,R.B., Smith,K.K., Leek,J.P., Markham,A.F., Norris,A. and Morrison,J.F. TITLE Characterization of human SHC p66 cDNA and its processed pseudogene mapping to Xq12-q13.1 JOURNAL Genomics 42 (2), 349-352 (1997) MEDLINE 97336064 REFERENCE 2 (bases 1 to 1879) AUTHORS Harun,R. TITLE Direct Submission JOURNAL Submitted (05-DEC-1996) R. Harun, Molecular Medicine Unit, Clinical Sciences Building, St James University Hospital, Beckett Street, Leeds. LS9 7TF, UK COMMENT alternatively spliced gene for p52 and p46 in X68148. FEATURES Location/Qualifiers source 1..1879 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Fibroblast" gene 5..1756 /gene="shc p66" CDS 5..1756 /gene="shc p66" /codon_start=1 /db_xref="PID:e293323" /db_xref="PID:g1834515" /translation="MNLLPPKPKYNPLRNESLSSMEEGASGSTPPEELPSPPASSLGP ILPPLPGDDSPTTLCSFFPRMSNLRLANPAGGRPGSKGEPGRAADDGEGIVGAAMPDS GPLPLLQDMNKLSGGGGRRTRVEGGQLGGEEWTRHGSFVNKPTRGWLHPNDKVMGPGV SYLVRYMGCVEVLQSMRALDFNTRTQVTREAISLVCEAVPGAKGATRRRKPCSRPLSS ILGRSNLKFAGMPITLTVSTSSLNLMAADCKQIIANHHMQSISFASGGDPDTAEYVAY VAKDPVNQRACHILECPEGLAQDVISTIGQAFELRFKQYLRNPPKLVTPHDRMAGFDG SAWDEEEEEPPDHQYYNDFPGKEPPLGGVVDMRLREGAAPGAARPTAPNAQTPSHLGA TLPVGQPVGGDPEVRKQMPPPPPCPGRELFDDPSYVNVQNLDKARQAVGGAGPPNPAI NGSAPRDLFDMKPFEDALRVPPPPQSVSMAEQLRGEPWFHGKLSRREAEALLQLNGDF LVRESTTTPGQYVLTGLQSGQPKHLLLVDPEGVVRTKDHRFESVSHLISYHMDNHLPI ISAGSELCLQQPVERKL" BASE COUNT 388 a 567 c 555 g 369 t ORIGIN 1 agctatgaat ctcctgcccc ccaagcccaa gtacaatcca ctccggaatg agtctctgtc 61 atcgatggag gaaggggctt ctgggtccac ccccccggag gagctgcctt ccccaccagc 121 ttcatccctg gggcccatcc tgcctcctct gcctggggac gatagtccca ctaccctgtg 181 ctccttcttc ccccggatga gcaacctgag gctggccaac ccggctgggg ggcgcccagg 241 gtctaagggg gagccaggaa gggcagctga tgatggggag gggatcgtag gggcagccat 301 gccagactca ggccccctac ccctcctcca ggacatgaac aagctgagtg gaggcggcgg 361 gcgcaggact cgggtggaag ggggccagct tgggggcgag gagtggaccc gccacgggag 421 ctttgtcaat aagcccacgc ggggctggct gcatcccaac gacaaagtca tgggacccgg 481 ggtttcctac ttggttcggt acatgggttg tgtggaggtc ctccagtcaa tgcgtgccct 541 ggacttcaac acccggactc aggtcaccag ggaggccatc agtctggtgt gtgaggctgt 601 gccgggtgct aagggggcga caaggaggag aaagccctgt agccgcccgc tcagctctat 661 cctggggagg agtaacctga aatttgctgg aatgccaatc actctcaccg tctccaccag 721 cagcctcaac ctcatggccg cagactgcaa acagatcatc gccaaccacc acatgcaatc 781 tatctcattt gcatccggcg gggatccgga cacagccgag tatgtcgcct atgttgccaa 841 agaccctgtg aatcagagag cctgccacat tctggagtgt cccgaagggc ttgcccagga 901 tgtcatcagc accattggcc aggccttcga gttgcgcttc aaacaatacc tcaggaaccc 961 acccaaactg gtcacccctc atgacaggat ggctggcttt gatggctcag catgggatga 1021 ggaggaggaa gagccacctg accatcagta ctataatgac ttcccgggga aggaaccccc 1081 cttggggggg gtggtagaca tgaggcttcg ggaaggagcc gctccagggg ctgctcgacc 1141 cactgcaccc aatgcccaga cccccagcca cttgggagct acattgcctg taggacagcc 1201 tgttggggga gatccagaag tccgcaaaca gatgccacct ccaccaccct gtccaggcag 1261 agagcttttt gatgatccct cctatgtcaa cgtccagaac ctagacaagg cccggcaagc 1321 agtgggtggt gctgggcccc ccaatcctgc tatcaatggc agtgcacccc gggacctgtt 1381 tgacatgaag cccttcgaag atgctcttcg ggtgcctcca cctccccagt cggtgtccat 1441 ggctgagcag ctccgagggg agccctggtt ccatgggaag ctgagccggc gggaggctga 1501 ggcactgctg cagctcaatg gggacttctt ggtacgggag agcacgacca cacctggcca 1561 gtatgtgctc actggcttgc agagtgggca gcctaagcat ttgctactgg tggaccctga 1621 gggtgtggtt cggactaagg atcaccgctt tgaaagtgtc agtcacctta tcagctacca 1681 catggacaat cacttgccca tcatctctgc gggcagcgaa ctgtgtctac agcaacctgt 1741 ggagcggaaa ctgtgatctg ccctagcgct ctcttccaga agatgccctc caatcctttc 1801 caccctattc cctaactctc gggacctcgt ttgggagtgt tctgtgggct tggccttgtg 1861 tcagagctgg gagtagcat // LOCUS HSSKI2WGN 3796 bp DNA PRI 17-FEB-1997 DEFINITION H.sapiens SKI2W gene. ACCESSION X98378 NID g1403335 KEYWORDS SKI2W gene; SKI2W protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3796) AUTHORS Albertella,M.R., Jones,H., Thomson,W., Olavesen,M.G. and Campbell,R.D. TITLE Localization of eight additional genes in the human major histocompatibility complex, including the gene encoding the casein kinase II beta subunit (CSNK2B) JOURNAL Genomics 36 (2), 240-251 (1996) MEDLINE 96411681 REFERENCE 2 (bases 1 to 3796) AUTHORS Campbell,R.D. TITLE Direct Submission JOURNAL Submitted (10-JUN-1996) R.D. Campbell, MRC Immunochemistry Unit, University of Oxford, Dept of Biochemistry, South Parks Road, Oxford OX1 3QU, UK REFERENCE 3 (bases 1 to 3796) AUTHORS Dangel,A.W., Shen,L., Mendoza,A.R., Wu,L.C. and Yu,C.Y. TITLE Human helicase gene SKI2W in the HLA class III region exhibits striking structural similarities to the yeast antiviral gene SKI2 and to the human gene KIAA0052: emergence of a new gene family JOURNAL Nucleic Acids Res. 23 (12), 2120-2126 (1995) MEDLINE 95334363 COMMENT Related sequence Z48796. FEATURES Location/Qualifiers source 1..3796 /organism="Homo sapiens" /note="MHC class III region" /db_xref="taxon:9606" /cell_line="ICE5" /clone="Bf23" /chromosome="6" /map="p21.3" gene 6..3746 /gene="SKI2W" CDS 6..3746 /gene="SKI2W" /codon_start=1 /product="SKI2W protein" /db_xref="PID:e248567" /db_xref="PID:g1403336" /translation="MMETERLVLPPPDPLDLPLRAVELGCTGHWELLNLPGAPESSLP HGLPPCAPDLQQEAEQLFLSSPAWLPLHGVEHSARKWQRKTDPWSLLAVLGAPVPSDL QAQRHPTTGQILGYKEVLLENTNLSATTSLSLRRPPGPASQSLWGNPTQYPFWPGGMD EPTITDLNTREEAEEEIDFEKDLLTIPPGFKKGMDFAPKDCPTPAPGLLSLSCLLEPL DLGGGDEDENEAVGQPGGPRGDTVSASPCSAPLARASSLEDLVLKEASTAVSTPEAPE PPSQEQWAIPVDATSPVGDFYRLIPQPAFQWAFEPDVFQKQAILHLERHDSVFVAAHT SAGKTVVAEYAIALAQKHMTRTIYTSPIKALSNQKFRDFRNTFGDVGLLTGDVQLHPE ASCLIMTTEILRSMLYSGSDVIRDLEWVIFDEVHYINDVERGVVWEEVLIMLPDHVSI ILLSATVPNALEFADWIGRLKRRQIYVISTVTRPVPLEHYLFTGNSSKTQGELFLLLD SRGAFHTKGYYAAVEAKKERMSKHAQTFGAKQPTHQGGPAQDRGVYLSLLASLRTRAQ LPVVVFTFSRGRCDEQASGLTSLDLTTSSEKSEIHLFLQRCLARLRGSDRQLPQVLHM SELLNRGLGVHHSGILPILKEIVEMLFSRGLVKVLFATETFAMGVNMPARTVVFDSMR KHDGSTFRDLLPGEYVQMAGRAGRRGLDPTGTVILLCKGRVPEMADLHRMMMGKPSQL QSQFRLTYTMILNLLRVDALRVEDMMKRSFSEFPSRKDSKAHEQALAELTKRLGALEE PDMTGQLVDLPEYYSWGEELTETQHMIQRRIMESVNGLKSLSAGRVVVVKNQEHHNAL GVILQVSSNSTSRVFTTLVLCDKPLSQDPQDRGPATAEVPYPDDLVGFKLFLPEGPCD HTVVKLQPGDMAAITTKVLRVNGEKILEDFSKRQQPKFKKDPPLAAVTTAVQELLRLA QAHPAGPPTLDPVNDLQLKDMSVVEGGLRARKLEELIQGAQCVHSPRFPAQYLKLRER MQIQKEMERLRFLLSDQSLLLLPEYHQRVEVLRTLGYVDEAGTVKLAGRVACAMSSHE LLLTELMFDNALSTLRPEEIAALLSGLVCQSPGDAGDQLPNTLKQGIERVRAVAKRIG EVQVACGLNQTVEEFVGELNFGLVEVVYEWARGMPFSELAGLSGTPEGLVVRCIQRLA EMCRSLRGAARLVGEPVLGAKMETAATLLRRDIVFAASLYTQ" conflict 457 /gene="SKI2W" /citation=[3] /replace="g" conflict 1874 /gene="SKI2W" /citation=[3] /replace="g" conflict 3159 /gene="SKI2W" /citation=[3] /replace="t" conflict 3206 /gene="SKI2W" /citation=[3] /replace="t" polyA_signal 3769..3774 polyA_site 3779 BASE COUNT 809 a 1101 c 1106 g 780 t ORIGIN 1 ccaggatgat ggagacagag cgacttgtgc taccccctcc agatcccctg gacctacccc 61 ttcgggccgt ggagctcgga tgcacggggc actgggagct gctgaacttg cctggagctc 121 cagagagtag ccttccccat ggcctccctc cttgtgcccc agatctgcag caagaagcag 181 aacagttgtt tctgtcatcc ccagcctggc tgcctctgca tggtgtggag cactcagccc 241 gaaaatggca gaggaagacg gatccctggt ctcttttggc tgtcctggga gccccagtcc 301 catccgacct acaggcccaa agacacccaa ccacaggcca gatactgggt tacaaagagg 361 tcttgctgga gaacacaaat ctctcggcta caacctcctt gtctcttcgc cggcctccag 421 ggccagcctc ccagtcctta tggggaaatc caactcagta tcccttctgg ccagggggga 481 tggatgaacc caccataaca gatctgaaca cacgggagga ggctgaggag gagatagact 541 ttgagaaaga tcttcttact attccacctg gtttcaagaa aggcatggac tttgcaccaa 601 aagattgtcc aactccagct cctggactac taagccttag ctgtctgttg gagcctctgg 661 atttgggtgg gggtgacgag gatgagaatg aggcagtggg acagccagga ggtcccagag 721 gggacactgt ttcagcctct ccctgcagtg ctcccctggc ccgagcaagc agcttggaag 781 acctagtgtt gaaggaagcg tccacagctg tatccacccc agaggcccca gagcctccat 841 ctcaggagca gtgggccatc cctgtggacg ccacctcccc tgttggtgat ttctatcgcc 901 tcattcccca gccagccttc cagtgggcat ttgagccaga tgtgtttcag aaacaggcca 961 tcctgcactt ggaacggcat gactctgtct ttgtcgcagc tcacacatct gcaggaaaaa 1021 cagttgtggc tgaatatgcc attgccctgg cccagaaaca catgacacgc accatctaca 1081 cttcgcccat caaggccctg agcaaccaga agttccggga cttccgaaac acattcgggg 1141 atgtggggct gctcaccggg gatgtacagc tgcatccgga ggcctcctgc ctcatcatga 1201 ccacagagat ccttcgctcc atgctgtaca gtggctcaga tgttattcgg gacctggagt 1261 gggtcatctt tgatgaggtt cactatatca acgatgtcga gcgtggggtc gtgtgggagg 1321 aggtgcttat catgctacct gaccacgttt ctatcatcct tctgagtgcc accgtcccca 1381 acgcccttga gtttgctgac tggattgggc ggctgaagcg tcgtcagatc tatgtgatta 1441 gcactgtaac ccgccccgtg cccctggagc actatctttt cacagggaac agctccaaga 1501 cccaggggga gctctttttg ttgctggact cccgaggagc cttccataca aaagggtact 1561 atgcagctgt ggaggccaag aaggagagaa tgagcaaaca cgcccagacc tttggggcca 1621 agcagcccac acatcagggg ggccctgcac aggaccgcgg agtgtacctg tccctcctgg 1681 cctccctccg cacacgtgcc cagttgcccg tggtggtgtt caccttctcc cggggccgct 1741 gtgatgagca ggcctcaggc ctcacctccc ttgacctcac caccagttcg gagaagagcg 1801 agatccacct cttcctgcag cgctgccttg ctcgcctccg tggctctgac cgccagctgc 1861 cccaggtcct gcacatgtca gagctcctga atcgcggcct gggtgtgcac catagcggca 1921 tcctgcccat cctcaaggag atcgtggaga tgctcttcag ccgtggcctg gtcaaggtct 1981 tgtttgccac agagaccttt gccatgggag taaacatgcc tgctcgtaca gtagtgtttg 2041 actccatgcg caaacacgat ggctccacct tccgggacct gctccctggg gagtatgtgc 2101 agatggcagg ccgggcaggg cggaggggcc tggaccccac aggcaccgtt atcctgctct 2161 gcaagggccg agtgcccgag atggcagacc tgcaccgcat gatgatgggg aagccgtccc 2221 agctgcagtc ccagttccgc ctcacgtaca ctatgatcct caacttgctg cgagtggatg 2281 ccctcagggt ggaggacatg atgaagagga gcttctctga gtttccctcc cgcaaagaca 2341 gcaaggccca tgaacaggcc ctggctgaac tgaccaagag gctgggagct ttggaggagc 2401 ctgacatgac tggccaactg gtcgacctgc ctgaatatta cagctggggg gaggaactga 2461 cagagaccca gcacatgatc cagcgacgca tcatggagtc tgtgaacggg ctgaagtctc 2521 tctcagcagg aagggtggtg gttgtgaaga atcaggagca tcacaacgca ttgggagtga 2581 tcctacaggt ctcctcgaac tccaccagca gagtattcac aaccctggtc ttgtgtgata 2641 agcccttgtc ccaggaccca caggacaggg ggccagccac tgcagaggtg ccctatccag 2701 atgacctcgt gggattcaag ctgttcctgc ctgaagggcc ttgtgaccac accgtggtca 2761 agctccagcc aggagatatg gctgccatca ccaccaaggt gctccgggtg aatggggaga 2821 agatcttgga ggacttcagc aagaggcagc agccaaaatt caagaaggat cctccccttg 2881 cagccgtgac cactgctgtc caggaactgc tgcgtctggc tcaggcccac ccagccggac 2941 ctcccaccct cgaccctgtc aatgacctgc agctcaaaga tatgtcagtt gtagagggtg 3001 ggctccgggc ccggaagctg gaggagctga tccagggggc tcagtgtgta cacagccccc 3061 gttttcctgc ccagtacctg aagctgcggg agcgaatgca gatacagaag gagatggagc 3121 ggctgcgctt cctactgtcg gatcagtcat tgctgctgct tcctgagtac catcagcgag 3181 tagaggtgct ccgaaccctg ggttacgtgg acgaggcggg cactgtgaag ctggcagggc 3241 gggtggcttg tgccatgagc agccatgagt tgctcctcac tgagctcatg tttgacaatg 3301 cactgagcac cctgcggcct gaggagattg ctgccttgct ctctggcctg gtctgccaga 3361 gccctgggga cgctggggat cagctcccaa acaccctcaa gcagggaata gaacgtgtcc 3421 gggctgtggc caagcggatt ggtgaggtcc aggtggcttg tggcctgaac cagacggtgg 3481 aggaatttgt gggggagctg aattttgggc tggttgaggt tgtatatgag tgggcccggg 3541 gcatgccctt ctccgagttg gcagggctct cagggacccc tgagggcctg gtggtccgct 3601 gcattcagcg cctggctgag atgtgtcgct cactgcgggg ggcagcccgc ctggtaggag 3661 agcctgtgct gggtgccaag atggagacag cggctacctt gctacggcgg gacatcgtat 3721 ttgcggccag cctctacacc cagtgaatgc cccatgtaaa aacatgatga taaaacagca 3781 aaaaaaaaaa aaaaaa // LOCUS HSSLN2 1417 bp DNA PRI 25-NOV-1997 DEFINITION Human sarcolipin (SLN) gene, exon 2 and complete cds. ACCESSION U96093 NID g1943763 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1417) AUTHORS Odermatt,A., Taschner,P.E.M., Scherer,S.W., Beatty,B., Khanna,V.K., Cornblath,D.R., Chaudhry,V., Yee,W.C., Schrank,B., Karpati,G., Breuning,M.H., Knoers,N. and MacLennan,D.H. TITLE Characterization of the gene encoding human sarcolipin (SLN), a proteolipid associated with SERCA1: absence of structural mutations in five patients with brody disease JOURNAL Genomics 45 (3), 541-553 (1997) MEDLINE 98035878 REFERENCE 2 (bases 1 to 1417) AUTHORS Odermatt,A. and MacLennan,D.H. TITLE Direct Submission JOURNAL Submitted (01-APR-1997) Banting and Best Department of Medical Research, Best Institute, 112 College St., Toronto, Ontario M5G 1L6, Canada FEATURES Location/Qualifiers source 1..1417 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR join(U96092:973..1066,240..314) /gene="SLN" gene join(U96092:973..1387,1..861) /gene="SLN" mRNA join(U96092:973..1066,240..861) /gene="SLN" exon 240..861 /gene="SLN" /number=2 CDS 315..410 /gene="SLN" /note="proteolipid" /codon_start=1 /product="sarcolipin" /db_xref="PID:g2642411" /translation="MGINTRELFLNFTIVLITVILMWLLVRSYQY" 3'UTR 411..861 /gene="SLN" BASE COUNT 402 a 308 c 272 g 435 t ORIGIN 1 ccctccttaa ttacctaatt ttaacttatt tacctcttaa aagaccctat cttcaaacac 61 agtcacattc tgagatactg tgagctaggg ctttgacatg tagatttttg ggggacacaa 121 tttagcccat cacactctcc ttttccacaa cacttctgtt tcctttgagg aaagaacggg 181 cattgttata caggaatgcc cattaatctc cttgtgtttt ctttgttatg ttttatcagg 241 aggtgaggac aagccagagg tccttggtgt gccctcagaa atctgcctgc agttctcacc 301 aagccgctgt gaaaatgggg ataaacaccc gggagctgtt tctcaacttc actattgtct 361 tgattacggt tattcttatg tggctccttg tgaggtccta tcagtactga gaggccatgc 421 catggtcctg ggattgactg agatgctccg gagctgcctg ctctatgccc tgagacccca 481 ctgctgtcat tgtcacagga tgccattctc catccgaggg cacctgtgac ctgcactcac 541 aatatctgct atgctgtagt gctaggattg attatgtgtt ctccaaagat gctgctccca 601 agggctgcca agtgtttgcc agggaacggt agatttattc cccaactctt aactgaaaat 661 gtgttagaca agccacaaag ttaaaattaa actggattca tgatgatgta ggattgttac 721 aagcccctga tctgtctcac cacacatccc ttcaacccac acggtctgca accaaactct 781 aattcaacct gccagaagga atgttagagg aagtctttgt cagcccttat agctatcatg 841 tgaataaagt taagtcaact tcaaaaacaa cttctagaac ttattttagc ttccatgtgt 901 gacagagcat ttgacccttg gctgggattg gagtgacaag tgctaccgta tttctagcat 961 ttgaggtaag ccaagatgct ccaactgctg aagatttgaa accaagtcaa cacactgtgt 1021 catatttcaa gtaattccat tggttcagcg ctcctcaaac ttttccccta aactagtctg 1081 aagggcagag ggagaataaa tccattccac tacggggtct gaagcacagg ctgaattgct 1141 ggctaaaagt gcaacatttc tttgaagtct tgtgttttat ctttagaatc cacaagaaat 1201 gtattttcta tcttataata tcttcatgtt tgttttcata taaatattta aaattattta 1261 cactaagtaa cacaagaaca tgagtcatgt ccctaagagt agcatagtct attttgatta 1321 ttgttattac acaaatggag ctagtcttta atcaactaca gttctaaaag gaggaaaata 1381 gaaaatgcaa acttatatgt ttataaaaga catataa // LOCUS HSSPHAR 1530 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens SPHAR gene for cyclin-related protein. ACCESSION X82554 NID g575271 KEYWORDS cyclin-related gene; SPHAR gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1530) AUTHORS Digweed,M., Gunthert,U., Schneider,R., Seyschab,H., Friedl,R. and Sperling,K. TITLE Irreversible repression of DNA synthesis in Fanconi anemia cells is alleviated by the product of a novel cyclin-related gene JOURNAL Mol. Cell. Biol. 15 (1), 305-314 (1995) MEDLINE 95098005 REFERENCE 2 (bases 1 to 1530) AUTHORS Digweed,M. TITLE Direct Submission JOURNAL Submitted (31-OCT-1994) M. Digweed, Institut fuer Humangenetik, Freie Universitaet Berlin, Heubnerweg 6, 14059 Berlin, FRG FEATURES Location/Qualifiers source 1..1530 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="cDNA, partial genomic" /clone="pSPHAR, pSPHAR-G, pSPHAR-G7, pSPHAR-G11" /chromosome="8" /map="p22-q11" gene 886..1530 /gene="SPHAR" promoter 886..910 /gene="SPHAR" mRNA 929..1530 /gene="SPHAR" /evidence=experimental CDS 1164..1355 /gene="SPHAR" /codon_start=1 /evidence=experimental /db_xref="PID:g575272" /db_xref="SWISS-PROT:Q15513" /translation="MTRIKISVCICFRYFEFCFFYALNILFQKVSEANSQTELLLRPH CKNILFNVSFMIDLQAAHF" polyA_signal 1510..1516 /gene="SPHAR" BASE COUNT 456 a 261 c 251 g 562 t ORIGIN 1 aattaaatgg acaacaccgt tagatgtgta tgtaaaaatt ttctgtttca tatttttcct 61 ttcactttcg gtttagaaca tgctatatgt actgtatgtc ctgtggccca gtgcggctcc 121 acagcatgga atctgatgta tgatatgata gaatgtggca ctaaatgcag tttcagattt 181 tattttttta atcatatgaa ctaaaattgt caattgtgag ttgtgctttc tcatcatgtt 241 ggttatattg cacaattggt tatatttatg acctgatatt caaagactct ggcattgata 301 gccagtgtgt tttcttattt aactccgttt actacattct acatggtgtt tacgtgatcc 361 acacttgaaa tactagatca gtagacattc actaatatac caaaataaaa tgaaaaattg 421 agtttttccg tgaactttat actgtccagc tctgttgatt ttaaagcttc ttcatccagg 481 tcagttcagg aagtatatct ggagtacctg gctctgtttt tggctgtgag actagcacta 541 aggattctgg tacctttacc caaacctact gggctactaa tacttctctc agcagttgaa 601 tcaaatacaa tagaccatgt aagctggggc cgctcatcca cttccagttt gctggtctcc 661 ctgctagaag acacattgta ctgtgctttt tctggaattc acgataatgg catcactgcc 721 tgtttttcac atcttttgtt tcctgttcat tttaaggaaa cctactaaat ccagttaata 781 ttaaatggac accactcatt aagaaatttc tttatggctt ctgcctgaat acttaaaatg 841 ccttactaca gttatccagt tgacatgttt ttaattcata taaggtatat tgggtatatt 901 gaagtatata ttgtattaca aagacttgtt cttgtatttt aaaatgtcag tgcaaaaaat 961 atatggtgga acctttcttt aaagttgaaa tgcagtatta tttaaatctg aaaggttaaa 1021 aagctttctt caccttatat atgttcttcc actgtgactt tttagttgaa gactagtaaa 1081 ttaactttta gttagaagat gcctactgct tttgttgttt attttaatca gcagagcaca 1141 gagacacata aaaactctgg gaaatgacta ggataaaaat atcagtatgt atctgtttta 1201 gatattttga gttttgcttt ttttatgcct tgaatatttt atttcaaaaa gtatctgaag 1261 caaattctca gactgaacta cttcttagac ctcactgtaa gaatatttta ttcaatgtct 1321 catttatgat agatttgcaa gctgctcatt tttgaacagc tttttgcatg ggataggagc 1381 atgtctattc taacacatca gcttattcaa aagcaagaat tttaaaaata agataaatgt 1441 aaagttgttt tataaacgat cctgttaatt aaaccacaga caccatatat ccttctgcat 1501 cctttggcca ataaaagttg ctggagaacc // LOCUS HSSPOT14 461 bp DNA PRI 17-FEB-1997 DEFINITION H.sapiens spot14 gene. ACCESSION Y08409 NID g1568568 KEYWORDS spot14 gene; Spot14 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 461) AUTHORS Grillasca,J.P., Gastaldi,M., Khiri,H., Dace,A., Peyrol,N., Reynier,P., Torresani,J. and Planells,R. TITLE Cloning and initial characterization of human and mouse Spot 14 genes JOURNAL FEBS Lett. 401 (1), 38-42 (1997) MEDLINE 97157517 REFERENCE 2 (bases 1 to 461) AUTHORS Planells,R. TITLE Direct Submission JOURNAL Submitted (23-SEP-1996) R. Planells, INSERM Unite 38, Faculte de Medecine, 27 boulevard Jean Moulin, F- 13385 Marseille Cedex, FRANCE FEATURES Location/Qualifiers source 1..461 /organism="Homo sapiens" /isolate="3 unrelated patients" /db_xref="taxon:9606" /dev_stage="adult" gene 11..451 /gene="Spot14" CDS 11..451 /gene="Spot14" /codon_start=1 /product="Spot14 protein" /db_xref="PID:e268226" /db_xref="PID:g1568569" /translation="MQVLTKRYPKNCLLTVMDRYAAEVHNMEQVVMIPSLLRDVQLSG PGGQAQAEAPDLYTYFTMLKAICVDVDHGLLPREEWQAKVAGSEENGTAETEEVEDES ASGELDLEAQFHLHFSSLHHILMHLTEKAQEVTRKYQEMTGQVW" BASE COUNT 114 a 128 c 146 g 73 t ORIGIN 1 ggaagcaacc atgcaggtgc taaccaagcg ttaccccaag aactgcctgc tgaccgtcat 61 ggaccggtat gcagccgagg tgcacaacat ggagcaggtg gtgatgatcc ccagccttct 121 gcgggacgtg cagctgagtg ggcctggggg ccaggcccag gctgaggccc ctgatctcta 181 cacctacttc accatgctca aggccatctg tgtggatgtg gaccatgggc tgctgccgcg 241 ggaggagtgg caggccaagg tggcaggcag cgaagagaat ggaaccgcag agacagagga 301 agtcgaggac gagagtgcct caggagagct ggacctggaa gcccagttcc acctgcactt 361 ctccagcctc catcacatcc tcatgcacct caccgagaaa gcccaggagg tgacaaggaa 421 ataccaggaa atgacgggac aagtttggta gaccttggac a // LOCUS HSTPRCGEN 2053 bp DNA PRI 03-MAR-1997 DEFINITION H.sapiens TPRC gene. ACCESSION X99720 NID g1869817 KEYWORDS TPRC gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2053) AUTHORS Weterman,M.A., Wilbrink,M. and Geurts van Kessel,A. TITLE Fusion of the transcription factor TFE3 gene to a novel gene, PRCC, in t(X;1)(p11;q21)-positive papillary renal cell carcinomas JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (26), 15294-15298 (1996) MEDLINE 97140324 REFERENCE 2 (bases 1 to 2053) AUTHORS Weterman,M. TITLE Direct Submission JOURNAL Submitted (01-AUG-1996) M. Weterman, University of Nijmegen, Dept of Biochemistry, PO Box 9101, 6500 HB Nijmegen, NETHERLANDS FEATURES Location/Qualifiers source 1..2053 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CL89-12117" /map="1q21" /chromosome="1" /cell_type="renal carcinoma" mRNA 1..2053 /gene="TPRC" gene 1..2053 /gene="TPRC" CDS 213..1688 /gene="TPRC" /codon_start=1 /db_xref="PID:e257626" /db_xref="PID:g1869818" /translation="MSLVAYASSDESEPDEAEPEPEEEEAVAPTSGPALGGLFASLPA PKGPALLPPPPQMLAPAFPPPLLLPPPTGDPRLQPPPPLPFGLGGFPPPPGVSPAEAA GVGEGLGLGLPSPRGPGLNLPPPIGGAGPPLGLPKPKKRKEPVKIAAPELHKGDSDSE EDEPTKKKTILQGSSEGTGLSALLPQPKNLTVKETNRLLLPHAFSRKPSDGSPDTKPS RLASKTKTSSLAPVVGTTTTTPSPSAIKAAAKSAALQVTKQITQEEDDSDEEVAPENF FSLPEKAEPPGVEPYPYPIPTVPEELPPGTEPEPAFQDDAANAPLEFKMAAGSSGAPW MPKPGDDYSYNQFSTYGDANAAGAYYQDYYSGGYYPAQDPALVPPQEIAPDASFIDDE AFKRLQGKRNRGREEINFVEIKGDDQLSGAQQWMTKSLTEEKTMKSFSKKKGEQPTGQ QRRKHQITYLIHQAKERELELKNTWSENKLSRRQTQAKYGF" BASE COUNT 468 a 620 c 553 g 412 t ORIGIN 1 gggactagga gcttaagtga agaggtacgc cttgttcggt ggaaatcagc cgtagccatg 61 agtttctgcc ggggctagcc ctagagtacg gagcaggcgg acttttcggt tccccgcccc 121 gccaggtggc ggggcctact aggcctccgg gcatccccgg tctcaagtag gcctcatctg 181 ccggcaaggg cgcccgaaac gcgggaggcg ccatgtcgct ggttgcttac gccagcagcg 241 atgagagcga gccggatgag gctgagcccg agccggagga agaggaggcg gtggctccta 301 catctgggcc cgctttaggg ggcttgttcg cttctctccc tgcgcccaag ggtccggcct 361 tgctgcctcc gccccctcag atgctggcgc cagcctttcc cccgccgctg ttgcttcccc 421 cacccaccgg agaccccagg cttcagcctc ctcccccctt gcccttcggc ctgggaggct 481 tccccccacc tccaggcgtg agcccggctg aagcggcggg agttggggag ggactgggat 541 tggggttgcc ctcgccccga ggccctggcc tcaatctgcc ccctccaatt ggcggtgccg 601 gtcccccgct ggggcttccc aagccaaaga agaggaaaga gcccgtgaag atcgcggcgc 661 cggagttgca taagggagat tcagattctg aggaagatga acccacaaag aagaaaacta 721 tccttcaggg atccagtgag gggactggtt tgtctgcctt gcttccccaa cctaaaaacc 781 tgactgtgaa agagactaac aggttgctcc tgccccatgc cttctcccgc aaaccctcgg 841 atggctcccc tgatactaag ccctccagac tggcttctaa gaccaagact tcctctcttg 901 cccctgttgt gggcaccaca accaccactc cgtcgccctc tgctatcaag gctgctgcca 961 agagtgctgc cctgcaggtg acaaagcaga tcacgcagga agaagacgac agtgatgagg 1021 aagtagcccc cgaaaacttt ttctccctcc ctgaaaaggc tgagccacct ggagttgagc 1081 cataccctta ccccatcccc actgtccctg aagagctgcc tccaggcacg gaaccagagc 1141 cggctttcca ggacgatgca gccaatgccc cccttgaatt caagatggca gcaggttcaa 1201 gtggggcccc ttggatgcct aagcctgggg acgactacag ctacaatcag ttttccacat 1261 atggcgatgc caatgccgct ggtgcttatt atcaggatta ttacagtggt ggctactatc 1321 ctgcacagga cccggccctg gtcccccccc aggaaattgc cccagatgcc tccttcatcg 1381 atgacgaagc atttaagcgg ctgcagggca agaggaaccg agggagagaa gaaatcaact 1441 ttgtggagat caaaggtgat gaccagctca gtggggccca gcaatggatg actaagtcat 1501 tgacagaaga gaaaaccatg aagtcattca gcaaaaagaa aggtgagcag ccaacaggcc 1561 agcagcggcg gaaacaccag atcacatatc ttattcatca ggccaaggag cgggagctgg 1621 aactgaagaa cacctggtca gagaacaagc tcagccgccg tcagacccaa gccaaatatg 1681 gattctaggg ctctggaact gattgctccc aggatctcct gccagcccag ctggcctggc 1741 ccccagcttc acctctggga ccccagctgc tctaagccca ggatctcttt ccccaaggac 1801 ccagccctcg cctctgcgag aatgaacata tttgatagat ttttcttaac aagttagaaa 1861 attcagctcc tttctgtcct ggagctagca aagacttgtg tgatgcctcc gaaggggctc 1921 tgagttctgg ggtgggagtt ttgctctctg tcaggtgtga taaaatgttg aaccctcccc 1981 accaccactt tttttttttt aaaccaggga tgtctgttga aataaaacat tcagtctgac 2041 aaacaaaaaa aaa // LOCUS HSTRANSPO 1287 bp DNA PRI 20-JAN-1998 DEFINITION H.sapiens gene encoding transposase protein. ACCESSION X94948 NID g2808444 KEYWORDS insertion site; mariner-like transposon; transposase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1287) AUTHORS Bigot,Y.B. and Auge-Gouillou,C. TITLE Human and other eukaryotic mariner-like transposons are located in structurally similar sites JOURNAL Unpublished REFERENCE 2 (bases 1 to 1287) AUTHORS Bigot,Y.B. TITLE Direct Submission JOURNAL Submitted (10-JAN-1996) Y.B. Bigot, IBEAS, Faculte des Sciences, parc de Gremont, 37200 Tours, FRANCE FEATURES Location/Qualifiers source 1..1287 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1..54 /note="5'insertion site" misc_feature 55..89 /note="inverted terminal repeat" CDS 186..1190 /note="mariner-like transposon; cecropia sub-family" /codon_start=1 /product="transposase" /db_xref="PID:e218705" /db_xref="PID:g2808445" /translation="MMLNKKKIRVIFLFEFKMGHKAAEITRNMNNTFGPGTANETYSA VVASRSFAKEESLEDEERDWRPLEVDNDQFESNHQLNYMRNCQKTSTLTILRSLGSFN KLESKWVPHELKSKIFKKYPFLKCLLLFYYNNNEPFLDQIVTCDEKWILYDNWRLPTQ WLDREEAKKHFPKPNLHQKKVMVNVWWSAAGVIHYSFLNPGETITSEKYAQQINEMHR KLQRLQPALVNRKGPILLHNNPQQHVAQPTLQKLNELGYKVLPHPPYSPDLLPTNYHF LEGLNNFLQGKRFHNQQDAENAFQEFVESQSTDFYATGINKLISRWQKCVDCNGSYFD " misc_feature complement(1187..1259) /note="3'insertion site" misc_feature 1235..1258 /note="inverted terminal repeat" BASE COUNT 417 a 247 c 245 g 376 t 2 others ORIGIN 1 ggttggtgca aaagtaattg tggtttttgc actgttggaa tttgccattt gatattagga 61 tactttctta aataaatgtg gttatgttat acatcatttt aatgggcatt tctgctttat 121 nacttatttt tttgctaatg acttattact tgctgtttat tttgtattta ttttagactg 181 tgnaaatgat gttaaacaaa aagaaaattc gagtgatttt cttattcgag ttcaaaatgg 241 gtcataaagc agcagagata actcgaaaca tgaacaacac atttggccca ggaactgcta 301 atgaaacata cagtgcagtg gtggcttcaa gaagttttgc aaaggaagag agccttgaag 361 atgaggaacg agattggcgg ccattggaag ttgacaacga ccaatttgag agcaatcatc 421 aattgaacta catgagaaat tgtcagaaga cctcaacgtt gactattcta cggtcattag 481 ggtcgttcaa caaattggaa agtaagtggg tccctcatga gctgaagtca aaaattttta 541 aaaaatatcc atttttaaag tgtcttctct tattctacta taacaacaac gaaccatttc 601 ttgatcagat tgtgacatgc gatgaaaagt ggattttata tgacaactgg cgactaccaa 661 ctcagtggtt ggaccgagaa gaagctaaaa agcacttccc aaagccaaac ttgcaccaaa 721 aaaaggtcat ggtcaatgtt tggtggtctg ctgccggtgt gatccactac agctttctga 781 atcctggtga aaccattaca tctgagaagt atgctcagca aatcaatgag atgcaccgaa 841 aactgcaacg cctgcagccg gcattggtca acagaaaggg cccaattctt ctccacaaca 901 acccccaaca gcatgtcgca caaccaacgc ttcaaaagtt gaatgaattg ggctacaaag 961 ttttgcctca tccaccgtat tcacctgacc tcttgccaac caactaccac ttcttagaag 1021 gtctcaacaa ctttttacag ggaaaacgct tccacaacca gcaggatgca gaaaatgctt 1081 tccaagagtt cgtcgaatcc caaagcacgg atttttacgc tacaggaata aacaaactta 1141 tttctcgttg gcaaaaatgt gttgattgta atggttccta ttttgattaa taaagatgtg 1201 tttgagccta gttataatga tttaaaatta atggtccaaa acactttatt taaacaatta 1261 cttttgcacc aacctataac acagtga // LOCUS HSTWISTGN 2870 bp DNA PRI 22-JAN-1998 DEFINITION H.sapiens twist gene. ACCESSION X91662 NID g999455 KEYWORDS twist gene; twist protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2870) AUTHORS Wang,S.M., Coljee,V.W., Pignolo,R.J., Rotenberg,M.O., Cristofalo,V.J. and Sierra,F. TITLE Cloning of the human twist gene: its expression is retained in adult mesodermally-derived tissues JOURNAL Gene 187 (1), 83-92 (1997) MEDLINE 97225800 REFERENCE 2 (bases 1 to 2870) AUTHORS Cristofalo,V.J. TITLE Direct Submission JOURNAL Submitted (21-SEP-1995) V.J. Cristofalo, Medical College of Pennsylvania and Hahnemann University, 2900 Queen Lane, Philadelphia, PA 19129, USA FEATURES Location/Qualifiers source 1..2870 /organism="Homo sapiens" /strain="WI-38" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="lung" /clone_lib="genomic lambda FixII" TATA_signal 793..797 exon 825..1788 /number=1 gene 1141..1746 /gene="twist" CDS 1141..1746 /gene="twist" /codon_start=1 /product="TWIST protein" /db_xref="PID:g999456" /db_xref="SWISS-PROT:Q15672" /translation="MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRTSRR TAGGGAGPGGAGGGVGGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGSSSGGGSPQSY EELQTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQ VLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH" intron 1789..2324 /number=1 polyA_signal 2388..2394 polyA_signal 2738..2744 BASE COUNT 583 a 886 c 856 g 545 t ORIGIN 1 cattggactg ggtttccttc caccgaagag tgaacttctg cctctttcga gcaccttccg 61 aggcgtagtc ctttggatgt tggggagcgt cagactgggt cgttgtagag gggaaaggag 121 ggcccagaag ggcgagagag caggccggga cgcaaatcct cagcccccgc ggcgcgccac 181 gtcttcagaa acgccaggac ctccgggctg ggccgccgcg gtttggcctt tggaactcaa 241 gggttcgtct acctgaccat tgggtggctc cgcggttgac acttttcttg gcatgccccc 301 ccaccccgcg ccacaccacc cccccagccc cagcaatcca aatcggcccc acggacctag 361 agggctcttg ggcgagatga gacatcaccc actgtgtaga agctgttgcc attgctgctg 421 tcacagccac tccggatggg gctgccaccg tggccaggac agtctcctcc gaccgcttcc 481 tgggctgcgc tagggttcgg gggcgctgcc cgcacgctcc ggcggggaag gaaatcgccc 541 cgcgcccgcc ggaggaaggc gacggggagg gaagggggag ggcggctagg aggcgggtgg 601 aggggccggc cgcccgggcc aggtcgtttt tgaatggttt gggaggacga attgttagac 661 cccgaggaag ggaggtggga cgggggaggg ggactggaaa gcggaaactt tcctataaaa 721 cttcgaaaag tccctcctcc tcacgtcagg ccaatgacac tgctgccccc aaactttccg 781 cctgcacgga ggtataagag cctccaagtc tgcagctctc gcccaactcc cagacacctc 841 gcgggctctg cagcaccggc accgtttcca ggaggcctgg cggggtgtgc gtccagccgt 901 tgggcgcttt ctttttggga cctcggggcc atccacaccg tcccctcccc ctcccgcctc 961 cctccccgcc tcccccgcgc gccctccccg cggaggtccc tcccgtccgt cctcctgctc 1021 tctcctccgc gggccgcatc gcccgggccg gcgccgcgcc ggggggaagc tggcgggctg 1081 aggcgccccg ctcttctcct ctgccccggg cccgcgaggc cacgcgtcgc cgctcgagag 1141 atgatgcagg acgtgtccag ctcgccagtc tcgccggccg acgacagcct gagcaacagc 1201 gaggaagagc cagaccggca gcagccgccg agcggcaagc gcgggggacg caagcggcgc 1261 acgagcaggc gcacggcggg cggcggcgcg gggcccggcg gagcgggtgg gggcgtcgga 1321 ggcggcgacg agccgggcag cccggcccag ggcaagcgcg gcaagaagtc tgcgggctgt 1381 ggcggcggcg gcggcgcggg cggcggcggc ggcagcagca gcggcggcgg gagtccgcag 1441 tcttacgagg agctgcagac gcagcgggtc atggccaacg tgcgggagcg ccagcgcacc 1501 cagtcgctga acgaggcgtt cgccgcgctg cggaagatca tccccacgct gccctcggac 1561 aagctgagca agattcagac cctcaagctg gcggccaggt acatcgactt cctctaccag 1621 gtcctccaga gcgacgagct ggactccaag atggcaagct gcagctatgt ggctcacgag 1681 cggctcagct acgccttctc ggtctggagg atggaggggg cctggtccat gtccgcgtcc 1741 cactagcagg cggagccccc caccccctca gcagggccgg agacctaggt aaggaccgcg 1801 ccgctgcacc ccttcgcctc tcaggtggca gacggcaggc cggccaggcc gcggttccca 1861 gtccacctcg atttcctccc ctctcccact ctccgctcag ccttcccacc tcacttggca 1921 ccgttgcctc gcgccccagc gtcccggaag gccggtctga ccccgctagg gagagcagtc 1981 tccaggggga tgcgccctgg tgaggggtgt gtgtgcgcgt gagtgtgcgt gacaggaggg 2041 gagacagaga cacccagggt cacgggtaag gaccgttttg tcagcgccac cctttctttc 2101 ggctttcaat ttttgttctc cttaaaacaa atgttttaaa acaaattcca cctcctcctc 2161 ctttccaccc acccacttcc tcttgccctt gggctgaaat ccttccaggt tgttcagctt 2221 aatttctcag tggtggtgat aagaacagtg ctcactagtc ttagaaaaca gccgcagaga 2281 cctaaacaat aaccgactcc ccccccccct ctgggttttt gcagatgtca ttgtttccag 2341 agaaggagaa aatggacagt ctagagactc tggagctgga taactaaaaa taaaaatata 2401 tgccaaagat tttcttggaa attagaagag caaaatccaa attcaaagaa acagggcgtg 2461 gggcgcactt ttaaaagaga aagcgagaca ggcccgtgga cagtgattcc cagacgggca 2521 gcggcaccat cctcacacct ctgcattctg atagaagtct gaacagttgt ttgtgttttt 2581 tttttttttt tttgacgaag aatgttttat ttttattttt ttcatgcatg cattctcaag 2641 aggtcgtgcc aatcagccac tgaaaggaaa ggcatcacta tggactttct ctattttaaa 2701 atggtaacaa tcagaggaac tataagaaca cctttagaaa taaaaatact gggatcaaac 2761 tggcctgcaa aaccatagtc agttaattct ttttttcatc cttcctctga ggggaaaaac 2821 aaaaaaaaac ttaaaataca aaaaacaaca ttctatttat ttattgagga // LOCUS HSU01212 3718 bp DNA PRI 03-AUG-1994 DEFINITION Human olfactory marker protein (OMP) gene, complete cds. ACCESSION U01212 NID g520739 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3718) AUTHORS Buiakova,O.I., Rama Krishna,N.S., Getchell,T.V. and Margolis,F.L. TITLE Human and rodent OMP genes: Conservation of structural and regulatory motifs and cellular localization JOURNAL Genomics 20, 452-462 (1994) MEDLINE 94307732 REFERENCE 2 (bases 1 to 3718) AUTHORS Margolis,F.L. TITLE Direct Submission JOURNAL Submitted (02-SEP-1993) Frank L. Margolis, Roche Institute of Molecular Biology, 340 Kingsland Street, Nutley, NJ 07110-1199, USA FEATURES Location/Qualifiers source 1..3718 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HOMP2, HOMP3, HOMP5" /clone_lib="genomic library in EMBL3 from one caucasian female, Clontech Labs." /chromosome="11" /map="11q13.5" enhancer 449..459 /note="distal Olf-1 binding site" enhancer 510..534 /note="UBE binding site, putative NF-1 element" enhancer 976..986 /note="proximal Olf-1 binding site" CDS 1245..1736 /note="intronless open reading frame" /codon_start=1 /product="olfactory marker protein" /db_xref="PID:g520740" /translation="MAEDRPQQPQLDMPLVLDQGLTRQMRLRVESLKQRGEKRQDGEK LLQPAESVYRLNFTQQQRLQFERWNVVLDKPGKVTITGTSQNWTPDLTNLMTRQLLDP TAIFWRKEDSDAIDWNEADALEFGERLSDLAKIRKVMYFLVTFGEGVEPANLKASVVF NQL" polyA_signal 3593..3598 BASE COUNT 716 a 1063 c 1117 g 822 t ORIGIN 1 ggatcccact gattgattag ccaccatctc acaattgacc gtgcttgtgg tccactagac 61 ccttcagatt gttttccgtt gtgatacctt ggccttgact ctgtcctctt ttctgtgtgt 121 ggcgtgttgt ggcgaggggc gctccccaac tcccatcccc actctctccc caactccggc 181 tccactcaca ctccagttct ttcatttccc cagtataaag gctgagcttc tggttccgcc 241 ccgggccctg gggatataaa catttgccag attcttcctc ggcccctggg ggaaactgag 301 gattaattca ggtggagtaa gtggtgggat ttgggtagaa gtgaagcctt gtcctgttgt 361 ggccatggtg cagggctgcg gcacagccag ccatcagtgt catccgggtc agtaatgctc 421 aaggcacagt ccctggccca gcagcatgtc acctgggagg tggttaggaa tgcagattct 481 caggcccaca gagccctgat aaaccaggag ttctgggagg gggtccagca atctgtgtgt 541 taagtcctga gagtgagtct tgatgctcac tcaagtcttg agaaccacgg gtctgggtga 601 gagatacggt agctgggctg agatcctgtc aatgggactg gaggggaagg gtcccggggt 661 gtttgggaag cagaatcgac aggctttggt gattgggtgt ggaggagtga gagggaggcg 721 ggcgtcaggg gtagctccaa ggtttaactt aggtgacttc agatctccaa tcaccaagcc 781 ctctctggtc ctgccttctc cacctgctcc tgcgggtctt gcatcttctc ctgtgtacct 841 ccagtgagga gtggtcccca ccaccctccc catcagtgca cttacgaagt gctctcatct 901 tcacaaacaa gccagcaccc agcccagccc tggtagtcag ggcggttgcc acagcaattg 961 acatcagcga cctggtcccc aaggaacctg ccaccttccg cctgcctgca gggcctgcat 1021 tatcgcttct gcggggactg gagtggaggc agatggggac tcccacccct gacacacacc 1081 ccattttgag aactgagtgg ggctgggaag agccagtgcc aaagggaggg gaagagggaa 1141 gggcagaaag taggtggggc ccccctttgg tggcctcttc tctccacggc cccaggctcc 1201 agcccacttg ggtccttggc gttggtggca gcagcacttg ggccatggcg gaggacaggc 1261 cgcagcagcc gcagctggac atgccgctgg tcctggacca gggcctgacc aggcagatgc 1321 ggctacgcgt ggagagcctg aagcagcgcg gggagaagcg ccaggatggg gagaagctgc 1381 tgcagccagc ggagtctgtg taccgcctca acttcaccca gcagcagcgg ctacagttcg 1441 agcgctggaa tgtcgtgctg gacaagccgg gcaaggtcac catcacaggc acctcgcaga 1501 actggacgcc tgacctcacc aacctcatga cacgccagct gctggacccc actgccatct 1561 tctggcgcaa ggaggactcg gatgccatag attggaatga ggccgacgcc ctggagtttg 1621 gggagcgcct gtcggacctg gccaagatcc gcaaggtcat gtacttcctc gtcacctttg 1681 gcgagggtgt ggagcccgcc aacctcaagg cctccgtggt ttttaaccag ctctgacagc 1741 agctgccagc tgctgctctc ctctagccca cctgtgctct cccctgcccc tgccactttc 1801 ccccctgtat tttgggggcc attattctcg ctgctcagcc tgtcctctgc ttgcccagag 1861 gccccctgag tcccacacct ttcctcctct gcttctccct ggggccagca ctccagctca 1921 caggaagaag attctgaggc tccatagcct agaagctgga ctggctgctg cattgctata 1981 gacgatagag gcctactagg ggccagtgtg catggacagt gaggccaggg ccatctgcct 2041 tctctctgct tcattgtggg agagagagac tgagaaagac caagagagac acagagacag 2101 agattgaaaa acccagcatc cacttcctcc agagtcaggg agacagagat gatggggcgt 2161 ctccacgggg agtccagcaa gccggcattc actgctccct ggccttggtg ccctttgccg 2221 gagcctgtgt ctgggctgct ggtcccataa cacgtcgaca accctcagga tatggggcag 2281 ggttgctgca ggggtggatt tgggcagtgg agagtggctg gcaccctgga ggctgtgtag 2341 gcccagctgt ggctcttctg ggcctgactt cagggtggag aagtgaaggg ggaggttaca 2401 cagagatctg tctctacgca cacatatcca tgagacagag tgtgctgtat tcatatggat 2461 gtattctaga ggtctattcc taccctagga acaagtgcag ttttagatta tctgttcatc 2521 attgctgctg gttcaaggat ggctcttaac aggggcctgg tccggatgac cttggcctgg 2581 gggcttgctg agctaggaga ctgcagttca gatagtgaaa cagggagtgg attagtaaag 2641 ggggttccct ttgccttgag ggaagttgga gctggagaga gtggattctc cagggcctca 2701 ggtatcccct gctggggagt caggctcttt agagcttgca ggtcagggaa ggcaagtgct 2761 tcgtcctgac atagcatctg ttggcatttc ttgggcttct tcaatgcagc tgaggggggc 2821 agggcgaagg cgtggtgggc agttacgacg gctgatagtc ccaagtgggc tgcaggcggc 2881 agtggtgtga cggcagaatg gtaacctctg gggtcattgg atgcaactca ctcaccaaac 2941 agatggggaa actgaggcac aattttcatc agattcagtt ctgactctta gcctcattcc 3001 ccttcgcatt gcgcagtccc agagagcccc cccttttggg ggagtgcctg acctgcacct 3061 aacatcagcc aagtacagct aagccactgt ccccagcacc ctgacttaag gccagccctg 3121 tgttttgtcc tcagccagtc agggatgtgt ccaagacatt tcccctcatg aagcaaagct 3181 gtcaaggaac ttgccggctc tggaacagat gcactgaggg ccagagggtc agggccatcc 3241 cctgtggctg gggctgccgg gagggtgagc cccacctcgg aggtgtgcag gctggagcag 3301 catgctggag ctgagattct gtgggtgaga gagtgggaga gtgtctgtgg gctgagcact 3361 ggtcctttct gactcacagc tctggggccc attccgggac aggcttgaag aagtctcggc 3421 cattgcctgc cctgctgagc acgaggggag gccagaaccg tgtgcagtgg ccctgccctt 3481 ctgcttgagc tcttcctgca gctctgggga ccctcttagt cccgactgcc tgtctcccca 3541 gcctgtctgt cccggggcct gagtccctct gctgtgcccg ctgcaggtcc ccaataaagc 3601 ctgtgccctg gcctcggtgg tgtgcagtgt ctcgccatca gcccccatcc ctttcacaat 3661 ccctcacggc cccgagcact tgctccctgg ccacttccca cactccccca gcccttgc // LOCUS HSU03486 1260 bp DNA PRI 13-JAN-1995 DEFINITION Human connexin40 gene, complete cds. ACCESSION U03486 NID g416327 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1260) AUTHORS Kanter,H.L., Saffitz,J.E. and Beyer,E.C. TITLE Molecular cloning of two human cardiac gap junction proteins, connexin40 and connexin45 JOURNAL J. Mol. Cell. Cardiol. 26, 861-868 (1994) MEDLINE 95055780 REFERENCE 2 (bases 1 to 1260) AUTHORS Beyer,E.C. TITLE Direct Submission JOURNAL Submitted (16-NOV-1993) Eric C. Beyer, Pediatrics, Washington University School of Medicine, One Children's Place, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1260 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 65..1141 /codon_start=1 /function="gap junction channel protein" /product="connexin40" /db_xref="PID:g416328" /translation="MGDWSFLGNFLEEVHKHSTVVGKVWLTVLFIFRMLVLGTAAEST WGDEQADFRCDTIQPGCHNVCYDQAFPISHIRYWVLQIIFVSTPSLVYMGHAMHTVRM QEKRKLREAERAKEVRGSGSYEYPVAEKAELSCWEEGNGRIALQGTLLNTYVCSILIR TTMEVGFIVGQYFIYGIFLTTLHVCRRSPCPHPVNCYVSRPTEKNVFIVFMLAVAALS LLLSLAELYHLGWKKIRQRFVKPRQYMAKCQLSGPLWAIVQSCTPPPDFNQCLENGPG GKFFNPFSNNMASQQNTDNLVTEQVRGQEQTPGEGFIQVRYGQKPEVPNGVSPGHRLP HGYHSDKRRLSKASSKARSDDLSV" BASE COUNT 267 a 360 c 337 g 296 t ORIGIN 1 cttttctctc tttctctctc tcccatttgc agaagttttg gcatctgttc cctgctgtgc 61 caacatgggc gattggagct tcctgggaaa tttcctggag gaagtacaca agcactcgac 121 cgtggtaggc aaggtctggc tcactgtcct cttcatattc cgtatgctcg tgctgggcac 181 agctgctgag tctacctggg gggatgagca ggctgatttc cggtgtgata cgattcagcc 241 tggctgccac aatgtctgct acgaccaggc tttccccatc tcccacattc gctactgggt 301 gctgcagatc atcttcgtct ctacgccctc tctggtgtac atgggccacg ccatgcacac 361 tgtgcgcatg caggagaagc gcaagctacg ggaggccgag agggccaaag aggtccgggg 421 ctctggctct tacgagtacc cggtggcaga gaaggcagaa ctgtcctgct gggaggaagg 481 gaatggaagg attgccctcc agggcactct gctcaacacc tatgtgtgca gcatcctgat 541 ccgcaccacc atggaggtgg gcttcattgt gggccagtac ttcatctacg gaatcttcct 601 gaccaccctg catgtctgcc gcaggagtcc ctgtccccac ccggtcaact gttacgtatc 661 ccggcccaca gagaagaatg tcttcattgt ctttatgctg gctgtggctg cactgtccct 721 cctccttagc ctggctgaac tctaccacct gggctggaag aagatcagac agcgatttgt 781 caaaccgcgg cagtacatgg ctaagtgcca gctttctggc cctctgtggg ctatagtcca 841 gagctgcaca ccaccccccg actttaatca gtgcctggag aatggtcctg ggggaaaatt 901 cttcaatccc ttcagcaata atatggcctc ccaacaaaac acagacaacc tggtcaccga 961 gcaagtacga ggtcaggagc agactcctgg ggaaggtttc atccaggttc gttatggcca 1021 gaagcctgag gtgcccaatg gagtctcacc aggtcaccgc cttccccatg gctatcatag 1081 tgacaagcga cgtcttagta aggccagcag caaggcaagg tcagatgacc tatcagtgtg 1141 accctccttt atgggaggat caggaccagg tgggaacaaa ggaggctcag agaggaaaga 1201 cgtgtccctt ctgaactgat gctttctcac tgtcatcact gcttggctcc tttggcccgg // LOCUS HSU03493 1191 bp DNA PRI 13-JAN-1995 DEFINITION Human connexin45 gene, complete cds. ACCESSION U03493 NID g424133 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1191) AUTHORS Kanter,H.L., Saffitz,J.E. and Beyer,E.C. TITLE Molecular cloning of two human cardiac gap junction proteins, connexin40 and connexin45 JOURNAL J. Mol. Cell. Cardiol. 26, 861-868 (1994) MEDLINE 95055780 REFERENCE 2 (bases 1 to 1191) AUTHORS Beyer,E.C. TITLE Direct Submission JOURNAL Submitted (16-NOV-1993) Eric C. Beyer, Pediatrics, Washington University School of Medicine, One Children's Place, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1191 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1191 /codon_start=1 /function="gap junction channel protein" /product="connexin45" /db_xref="PID:g424134" /translation="MSWSFLTRLLEEIHNHSTFVGKIWLTVLIVFRIVLTAVGGESIY YDEQSKFVCNTEQPGCENVCYDAFAPLSHVRFWVFQIILVATPSVMYLGYAIHKIAKM EHGEADKKAARSKPYAMRWKQHRALEETEEDNEEDPMMYPEMELESDKENKEQSQPKP KHDGRRRIREDGLMKIYVLQLLARTVFEVGFLIGQYFLYGFQVHPFYVCSRLPCPHKI DCFISRPTEKTIFLLIMYGVTGLCLLLNIWEMLHLGFGTIRDSLNSKRRELEDPGAYN YPFTWNTPSAPPGYNIAVKPDQIQYTELSNAKIAYKQNKANTAQEQQYGSHEENLPAD LEALQREIRMAQERLDLAVQAYSHQNNPHGPREKKAKVGSKAGSNKSTASSKSGDGKN SVWI" BASE COUNT 329 a 276 c 306 g 280 t ORIGIN 1 atgagttgga gctttctgac tcgcctgcta gaggagattc acaaccattc cacatttgtg 61 gggaagatct ggctcactgt tctgattgtc ttccggatcg tccttacagc tgtaggagga 121 gaatccatct attacgatga gcaaagcaaa tttgtgtgca acacagaaca gccgggctgt 181 gagaatgtct gttatgatgc gtttgcacct ctctcccatg tacgcttctg ggtgttccag 241 atcatcctgg tggcaactcc ctctgtgatg tacctgggct atgctatcca caagattgcc 301 aaaatggagc acggtgaagc agacaagaag gcagctcgga gcaagcccta tgcaatgcgc 361 tggaaacaac accgggctct ggaagaaacg gaggaggaca acgaagagga tcctatgatg 421 tatccagaga tggagttaga aagtgataag gaaaataaag agcagagcca acccaaacct 481 aagcatgatg gccgacgacg gattcgggaa gatgggctca tgaaaatcta tgtgctgcag 541 ttgctggcaa ggaccgtgtt tgaggtgggt tttctgatag ggcagtattt tctgtatggc 601 ttccaagtcc acccgtttta tgtgtgcagc agacttcctt gtcctcataa gatagactgc 661 tttatttcta gacccactga aaagaccatc ttccttctga taatgtatgg tgttacaggc 721 ctttgcctct tgcttaacat ttgggagatg cttcatttag ggtttgggac cattcgagac 781 tcactaaaca gtaaaaggag ggaacttgag gatccgggtg cttataatta tcctttcact 841 tggaatacac catctgctcc ccctggctat aacattgctg tcaaaccaga tcaaatccag 901 tacaccgaac tgtccaatgc taagatcgcc tacaagcaaa acaaggccaa cacagcccag 961 gaacagcagt atggcagcca tgaggagaac ctcccagctg acctggaggc tctgcagcgg 1021 gagatcagga tggctcagga acgcttggat ctggcagttc aggcctacag tcaccaaaac 1081 aaccctcatg gtccccggga gaagaaggcc aaagtggggt ccaaagctgg gtccaacaaa 1141 agcactgcca gtagcaaatc aggggatggg aagaactctg tctggattta a // LOCUS HSU03866 1500 bp DNA PRI 29-MAR-1995 DEFINITION Human adrenergic alpha-1c receptor protein gene, complete cds. ACCESSION U03866 NID g494984 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1500) AUTHORS Forray,C., Bard,J.A., Wetzel,J.M., Chiu,G., Shapiro,E., Tang,R., Lepor,H., Hartig,P.R., Weinshank,R.L., Branchek,T.A. and Gluchowski,C. TITLE The alpha 1-adrenergic receptor that mediates smooth muscle contraction in human prostate has the pharmacological properties of the cloned human alpha 1c subtype JOURNAL Mol. Pharmacol. 45 (4), 703-708 (1994) MEDLINE 94239386 REFERENCE 2 (bases 1 to 1500) AUTHORS Tseng-Crank,J.C., Goetz,A., Saussy,D., Roberson,K.M., Hazum,S., Haizlip,J. Godinot N., Wisely,B., Robertson,C.N. and Kost,T. TITLE The adrenergic receptor subtypes and benign prostatic hyperplasia: Cloning and functional expression of the human a1C receptor JOURNAL Unpublished REFERENCE 3 (bases 1 to 1500) AUTHORS Hirasawa,A., Horie,K., Tanaka,T., Takagaki,K., Murai,M., Yano,J. and Tsujimoto,G. TITLE Cloning, functional expression and tissue distribution of human cDNA for the alpha 1C-adrenergic receptor JOURNAL Biochem. Biophys. Res. Commun. 195 (2), 902-909 (1993) MEDLINE 93384619 REFERENCE 4 (bases 1 to 1500) AUTHORS Nawoschik,S.P. and Bard,J.A. TITLE Direct Submission JOURNAL Submitted (30-NOV-1993) S.P. Nawoschik and Jonathan A. Bard, Synaptic Pharmaceutical Corporation, Molecular Biology, 215 College Rd., Paramus, NJ 07652, USA FEATURES Location/Qualifiers source 1..1500 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hH20; #180; hl06" /clone_lib="human lymphocyte (genomic); human hippocampus (cDNA)" /tissue_type="lymphocyte and hippocampus" 5'UTR 1..65 conflict 15..16 /citation=[2] /replace="CG" conflict 31 /citation=[2] /citation=[3] /replace="g" conflict 60 /citation=[2] /replace="" CDS 66..1466 /citation=[1] /codon_start=1 /product="adrenergic alpha-1c receptor protein" /db_xref="PID:g494985" /translation="MVFLSGNASDSSNCTQPPAPVNISKAILLGVILGGLILFGVLGN ILVILSVACHRHLHSVTHYYIVNLAVADLLLTSTVLPFSAIFEVLGYWAFGRVFCNIW AAVDVLCCTASIMGLCIISIDRYIGVSYPLRYPTIVTQRRGLMALLCVWALSLVISIG PLFGWRQPAPEDETICQINEEPGYVLFSALGSFYLPLAIILVMYCRVYVVAKRESRGL KSGLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGI VVGCFVLCWLPFFLVMPIGSFFPDFKPSETVFKIVFWLGYLNSCINPIIYPCSSQEFK KAFQNVLRIQCLCRKQSSKHALGYTLHPPSQAVEGQHKDMVRIPVGSRETFYRISKTD GVCEWKFFSSMPRGSARITVSKDQSSCTTARVRSKSFLQVCCCVGPSTPSLDKNHQVP TIKVHTISLSENGEEV" conflict 192 /citation=[2] /replace="t" conflict 451..452 /citation=[2] /replace="cg" conflict 1104 /citation=[3] /replace="c" conflict 1140 /citation=[2] /replace="c" conflict 1356 /citation=[3] /replace="g" conflict 1389 /citation=[2] /replace="t" 3'UTR 1467..1500 BASE COUNT 308 a 475 c 395 g 322 t ORIGIN 1 ccgcctccgc gccagcccgg gaggtggccc tgacagccgg acctcgcccg gccccggctg 61 ggaccatggt gtttctctcg ggaaatgctt ccgacagctc caactgcacc caaccgccgg 121 caccggtgaa catttccaag gccattctgc tcggggtgat cttggggggc ctcattcttt 181 tcggggtgct gggtaacatc ctagtgatcc tctccgtagc ctgtcaccga cacctgcact 241 cagtcacgca ctactacatc gtcaacctgg cggtggccga cctcctgctc acctccacgg 301 tgctgccctt ctccgccatc ttcgaggtcc taggctactg ggccttcggc agggtcttct 361 gcaacatctg ggcggcagtg gatgtgctgt gctgcaccgc gtccatcatg ggcctctgca 421 tcatctccat cgaccgctac atcggcgtga gctacccgct gcgctaccca accatcgtca 481 cccagaggag gggtctcatg gctctgctct gcgtctgggc actctccctg gtcatatcca 541 ttggacccct gttcggctgg aggcagccgg cccccgagga cgagaccatc tgccagatca 601 acgaggagcc gggctacgtg ctcttctcag cgctgggctc cttctacctg cctctggcca 661 tcatcctggt catgtactgc cgcgtctacg tggtggccaa gagggagagc cggggcctca 721 agtctggcct caagaccgac aagtcggact cggagcaagt gacgctccgc atccatcgga 781 aaaacgcccc ggcaggaggc agcgggatgg ccagcgccaa gaccaagacg cacttctcag 841 tgaggctcct caagttctcc cgggagaaga aagcggccaa aacgctgggc atcgtggtcg 901 gctgcttcgt cctctgctgg ctgccttttt tcttagtcat gcccattggg tctttcttcc 961 ctgatttcaa gccctctgaa acagttttta aaatagtatt ttggctcgga tatctaaaca 1021 gctgcatcaa ccccatcata tacccatgct ccagccaaga gttcaaaaag gcctttcaga 1081 atgtcttgag aatccagtgt ctctgcagaa agcagtcttc caaacatgcc ctgggctaca 1141 ccctgcaccc gcccagccag gccgtggaag ggcaacacaa ggacatggtg cgcatccccg 1201 tgggatcaag agagaccttc tacaggatct ccaagacgga tggcgtttgt gaatggaaat 1261 ttttctcttc catgccccgt ggatctgcca ggattacagt gtccaaagac caatcctcct 1321 gtaccacagc ccgggtgaga agtaaaagct ttttgcaggt ctgctgctgt gtagggccct 1381 caacccccag ccttgacaag aaccatcaag ttccaaccat taaggtccac accatctccc 1441 tcagtgagaa cggggaggaa gtctaggaca ggaaagatgc agaggaaagg ggaatatctt // LOCUS HSU08997 3272 bp DNA PRI 23-AUG-1994 DEFINITION Human glutamate dehydrogenase gene, complete cds. ACCESSION U08997 NID g478987 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Shashidharan,P., Michaelidis,T.M., Robakis,N.K., Kresovali,A., Papamatheakis,J. and Plaitakis,A. TITLE Novel human glutamate dehydrogenase expressed in neural and testicular tissues and encoded by an X-linked intronless gene JOURNAL J. Biol. Chem. 269 (24), 16971-16976 (1994) MEDLINE 94266921 REFERENCE 2 (bases 1 to 3272) AUTHORS Shashidharan,P. TITLE Direct Submission JOURNAL Submitted (18-APR-1994) Pullanipally Shashidharan, Mount Sinai School of Medicine, Neurology, One Gustave L. Levy Place, New York, NY 10029, USA FEATURES Location/Qualifiers source 1..3272 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RETGDH" /clone_lib="EMBL4" /tissue_type="retina" CDS 309..1985 /codon_start=1 /product="glutamate dehydrogenase" /db_xref="PID:g478988" /translation="MYRYLAKALLPSRAGPAALGSAANHSAALLGRGRGQPAAASQPG LALAARRHYSELVADREDDPNFFKMVEGFFDRGASIVEDKLVKDLRTQESEEQKRNRV RGILRIIKPCNHVLSLSFPIRRDDGSWEVIEGYRAQHSQHRTPCKGGIRYSTDVSVDE VKALASLMTYKCAVVDVPFGGAKAGVKINPKNYTENELEKITRRFTMELAKKGFIGPG VDVPAPDMNTGEREMSWIADTYASTIGHYDINAHACVTGKPISQGGIHGRISATGRGV FHGIENFINQASYMSILGMTPGFRDKTFVVQGFGNVGLHSMRYLHRFGAKCIAVGESD GSIWNPDGIDPKELEDFKLQHGSILGFPKAKPYEGSILEVDCDILIPAATEKQLTKSN APRVKAKIIAEGANGPTTPEADKIFLERNILVIPDLYLNAGGVTVSYFEWLKNLNHVS YGRLTFKYERDSNYHLLLSVQESLERKFGKHGGTIPIVPTAEFQDSISGASEKDIVHS ALAYTMERSARQIMHTAMKYNLGLDLRTAAYVNAIEKVFKVYSEAGVTFT" BASE COUNT 882 a 762 c 788 g 840 t ORIGIN 1 ccaaagctct gggtaattat taaacattag aagtatagaa caaacaggcc agccaggccc 61 ggcgaggccg cggcgggtcc ggcgccccgg acctccggac ccggaggtcc tggcgccctg 121 gttggcgccc tgcccccaaa gtccgtcctc cccgttaggt ggcgcccaag gggaggggac 181 agccgggcag gcaggaagct gcggcttaaa agggcaaccc gcgccggacc cttccttcct 241 agtcgcgggg agtctgagaa agcgcacctg ttccgcgacc gtcacgcacc cctcctccgc 301 ctgccgcgat gtaccgctac ctggccaaag cgctgctgcc gtcccgggcc gggcccgctg 361 ccctgggctc cgcggccaac cactcggccg cgttgctggg ccggggccgc ggacagcccg 421 ccgccgcctc gcagccgggg ctcgcattgg ccgcccggcg ccactacagc gagttggtgg 481 ccgaccgcga ggacgacccc aacttcttca agatggtgga gggcttcttc gatcgcggcg 541 ccagcatcgt ggaggacaag ttggtgaagg acctgaggac ccaggaaagc gaggagcaga 601 agcggaaccg ggtgcgcggc atcctgcgga tcatcaagcc ctgcaaccat gtgctgagtc 661 tctccttccc catccggcgc gacgacggct cctgggaggt catcgaaggc taccgggccc 721 agcacagcca gcaccgcacg ccctgcaagg gaggtatccg ttacagcact gatgtgagtg 781 tagatgaagt aaaagctttg gcttctctga tgacatacaa gtgtgcagtg gttgatgtgc 841 cgtttggggg tgctaaagct ggtgttaaga tcaatcccaa gaactatacc gaaaatgaat 901 tggaaaagat cacaaggagg ttcaccatgg agctagcaaa gaagggcttt attggtcctg 961 gcgttgatgt gcctgctcca gacatgaaca caggtgagcg ggagatgtcc tggattgctg 1021 atacctatgc cagcaccata gggcactatg atattaatgc acacgcctgt gttactggta 1081 aacccatcag ccaaggggga atccatggac gcatctctgc tactggccgt ggtgtcttcc 1141 atgggattga aaacttcatc aatcaagctt cttacatgag cattttagga atgacaccag 1201 ggtttagaga taaaacattt gttgttcagg gatttggtaa tgtgggccta cactctatga 1261 gatatttaca tcgttttggt gctaaatgta ttgctgttgg tgagtctgat gggagtatat 1321 ggaatccaga tggtattgac ccaaaggaac tggaagactt caaattgcaa catgggtcca 1381 ttctgggctt ccccaaggca aagccctatg aaggaagcat cttggaggtc gactgtgaca 1441 tactgatccc agctgccact gagaagcagt tgaccaaatc caacgcaccc agagtcaaag 1501 ccaagatcat tgctgaaggt gccaatgggc caacaactcc agaagctgat aagatcttcc 1561 tggagagaaa cattttggtt attccagatc tctacttgaa tgctggagga gtgacagtat 1621 cttactttga gtggctgaag aatctaaatc atgtcagcta tggccgtttg accttcaaat 1681 atgaaaggga ttctaactac cacttgctcc tgtctgttca agagagttta gaaagaaaat 1741 ttggaaagca tggtggaact attcccattg tacccacggc agagttccaa gacagtatat 1801 cgggtgcatc tgagaaagac atcgtgcact ctgccttggc atacacaatg gagcgttctg 1861 ccaggcaaat tatgcacaca gccatgaagt ataacctggg attggacctg agaacagctg 1921 cctatgtcaa tgccattgaa aaagtcttca aagtgtacag tgaagctggt gtgaccttca 1981 catagatgga tcatggctga cttcctcact aacctcttca cgtgtaactt ctgcagacct 2041 accacaagtt tacatgtaac cacagaaatc cctttctctc ctgacatcat tactaatgga 2101 taccattctc aacaagtcaa tccaaatcag cccgttaagg agaaagaaat taatatacaa 2161 gctgagtgtg aaagtagaaa tcacctacac cagagagcta ttttggtatt ttgcctttaa 2221 ataaaaagcc tcctccatat ggctgtgcag ccttgctctg tggcttttcc cagcacaatc 2281 agtgctagtg ctggggaagg gacagtcaag agcagtcagt tgcttactta ttttgctctg 2341 gatgagtctg ggacacgctg taactttaac acatttaaga agaaggtgtg tggccttttc 2401 agaaggtggc atggtcctca agtgagttct tagtatttta tatcagcaaa ataactcaat 2461 tttgcagatt gcaaacaaat ataaaagctg tttctgttta tgaattttat tcttttagaa 2521 tagaataagt acatgctgct gtaataaaat tgcctttaat cacttaacaa gcctaacctt 2581 gactcagtga atgcctataa aaataataaa tgaaaaaaaa ccagtatttt tatatcataa 2641 aagtttcatt tgtagcttat cattcatgta ttgtccagca gacattaaaa gccctgtgga 2701 taattacgtt atcttcatac ctgcaaaacg gtggaggcat tttcgttaaa actgtcagaa 2761 ttcgctgtta taattatgac acatagtcca aagaatgcag taaccttttt atcatgttaa 2821 ctaattgttc tcttttgaag atctatggtt gactaattaa acaataattc aagtagagtg 2881 tcccagaaaa aaaccacttg ggctccctgt ttggagtctg gctggctctg agcattgcca 2941 atggccccta ctcacctgac tttgtatcct cttcttttag aggctttgca ttctgcaccc 3001 agcttcacta acagtgggct gaaaccaacc ttgggttgag tgtttcattt gggagttatt 3061 tggccagggc cttttgaaca aatagtgtcc ccatgaagtg ctagataata tatgtgtaag 3121 aatcagcttt ttttttttta actataatat ccttcagaaa tttctaacta ctttgtaact 3181 gcatggctta acctggtgat aaaagcagtt attaaaagtc tagatttttt tttaaaaaaa 3241 agatagaaca aataatggga gattgggtta ac // LOCUS HSU10116 10079 bp DNA PRI 18-FEB-1995 DEFINITION Human superoxide dismutase (SOD3) gene, complete cds. ACCESSION U10116 NID g529149 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10079) AUTHORS Folz,R.J. and Crapo,J.D. TITLE Extracellular superoxide dismutase (SOD3): tissue-specific expression, genomic characterization, and computer-assisted sequence analysis of the human EC SOD gene JOURNAL Genomics 22 (1), 162-171 (1994) MEDLINE 95048365 REFERENCE 2 (bases 1 to 10079) AUTHORS Folz,R.J. TITLE Direct Submission JOURNAL Submitted (27-MAY-1994) Rodney J. Folz, Medicine, Duke University Medical Center, Bell Building, Room 250, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..10079 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="7" /clone_lib="Adult female leukocyte library" /chromosome="4" /sex="female" /cell_type="leukocyte" /tissue_type="blood" 5'UTR 1..558 mRNA join(1..563,1136..1219,5069..6405) exon 559..563 intron 564..1135 exon 1136..1219 intron 1220..5068 exon 5069..6404 gene 5085..5807 /gene="SOD3" CDS 5085..5807 /gene="SOD3" /codon_start=1 /product="superoxide dismutase" /db_xref="PID:g529150" /translation="MLALLCSCLLLAAGASDAWTGEDSAEPNSDSAEWIRDMYAKVTE IWQEVMQRRDDDGTLHAACQVQPSATLDAAQPRVTGVVLFRQLAPRAKLDAFFALEGF PTEPNSSSRAIHVHQFGDLSQGCESTGPHYNPLAVPHPQHPGDFGNFAVRDGSLWRYR AGLAASLAGPHSIVGRAVVVHAGEDDLGRGGNQASVENGNAGRRLACCVVGVCGPGLW ERQAREHSERKKRRRESECKAA" 3'UTR 5808..6405 polyA_signal 6386..6391 polyA_site 6405 BASE COUNT 2482 a 2612 c 2408 g 2577 t ORIGIN 1 ggatccagag atttagattt tttataagct ttcctgccac cgaaacgggt gtttgggacc 61 tcacgaggcc ctgttcattc ttcgtcgctg cgctccccac tctgtactgg atgcatttac 121 tgacgttgtt gtctccgtcc ccagagtatg aacccccaag gtgactcatg cagctgtggg 181 tgcccggcat acagcatggt gactggaatg gatgagcacc caataaacat ttgttgcagg 241 aatgcaggag gacgggcagg ccagcaagca ggctgcctgg tttttcccac atgggctttt 301 ctgggaaaga agagcttcta tttttggaaa gggctgctat gattgagaaa agttcatggc 361 agcaaaaaaa ggacagacgt cgggagggaa acactcctag ttctcccaga caacacattt 421 tttaaaaaga ctccttcatc tctttaataa taacggtaac gacaatgaca atgatgatta 481 cttatgagtg cggctagtgc cagccactgt gttgtcactg ggcgagtaat gatctcattg 541 gatcttcacg gtgggcgtgc ggggtggaca gcctcacacc cccattttac agatgatgaa 601 aaggaggtgc agggagtggt gcagctgctt caggcgtaca cagataggaa gtgacaaggc 661 tgggactctg cagcctgagt gtgtcatcac gacccacccg ctgctctgct ctcataggta 721 tgacagcaca gctctggagc aaatgccatg cacatttgca aggtgcccat ttccatgcag 781 caaaaataag tcaataagtt attgacttag agaaaagcaa agggcctctc aataaagagg 841 tcattgtaca cctctccaaa caggcgattt tctttctcat ttttattccc ctgctgtgtg 901 ctgaaggtca ctggctacaa gccggtgaag tcgcggaatg gaatccttgg cccgaaaacc 961 caaaaatggg aggggcagag gaggtgggga cagagcggga ggaggtggag gcgaagcaat 1021 tctacaaccc ggggaggtct ggcctgcttt tcctccctga actggcccaa tgactggctc 1081 cctcacgctg accactcctc tgggctggcc tcctgcactc gcgctaacag cccaggctcc 1141 agggacagcc tgcgttcctg ggctggctgg gtgcagctct cttttcagga gagaaagctc 1201 tcttggagga gctggaaagg tgggtgctaa gttgaggttc attttgttct tctcggagtg 1261 tgcttattga gtctgaagct gggttggggc aacgggcctc ttcttgggaa caaattggat 1321 catcttcttg ggaaggaaat gtactttccc tggctgctct gaggggttag tggggaggtg 1381 gagtgagcgg ggaggaaggc aaggagggga ggaagaaacc gttcctcctg tggatctgca 1441 aagaccagtc caagaggatt ttagtgttag gaaaaggaat ctggagtgac gagaaagggg 1501 gcctttctag atgttgcatg gctttggtgt cgggagccac ttatgggaca gcaggtactc 1561 taaaaagcca cctccttagg aaagcagaga ggccctggcc agctcaggct cccagcaaga 1621 gctccttcta ggagacagct gagggatgaa acacacccaa ggctcaagag gggcaggttc 1681 ttcccagata cagacccagg aaggagataa aggcttggtg cctctatttg gttcaggata 1741 agggcccctg tcctctttct ctgataacac tgtcctcttt ctctgataac accgtcctcc 1801 cttccagatc cacgtacaaa ggaggccctt aaaaaggcac ttggtcattc acagctcaaa 1861 ctgagcaaga ggctgtggga gaagaatcaa gttggtcccg aggggaagag gtgtcaaagg 1921 cttaagaaac aagaagtcag agtttacctg ggtttgaggg agaattttct ttcccccttt 1981 tcctcctcct cctccttctt ctcttttttt tttttttttt tttttttttt ttgagacatg 2041 gtctcattct gtcacccagc acccaggctg gaatgtagtg gcacgatcac tatcacggct 2101 cactacagcc tctacctccc gggctcaagt gatcctccta cctcagcctc ctgagtaact 2161 gggactacag gcacatgcca ccacacccag ctattttttt ttttgctaga gatgggggtc 2221 tctaccaggt tggtctcata ctcttgtact caaatgattc tcctgtctca tcctcccaaa 2281 gggtgggatt acaggcataa gccaccatgc ctggctcttc ttttggtttc agagaaaaac 2341 atctccttaa aatgtttatt tcccaaggat tcttgaaaaa gaaagctcac tgacacaccc 2401 aaaacaatct ggttttgctc tgtgctttta gggagaactt tctaagcagc agagcccttc 2461 tgagtggcag ggctgtctta ggaggaaggt gtcttttgat gatggggaac ttcatgtcca 2521 ggtctggcag gagagttacc ccactttcct gcctactccc tggggctttg gggtagtagt 2581 accacattgg gccatgtcat ttaggtgagt ccttcaacat cactttctct gcttctccct 2641 ctttctggat cctccttctt ggagcctttc aaggggacct cctctcacag tgtccatagc 2701 atctcttagc taatggtcct taaaatctct accagcagct tctctctgat agctaagagc 2761 tgccatttac tgggaacttt ctatgtactg ggctctgtgc taagtgccct agatgagaga 2821 tgtgcagtgt ggtgcctaaa ccttgggctt ggagcagaca cacactttca aatcctgcct 2881 tcagctcctt agtgaacatg tcaccttggg cgggacacac gcctctctgt gcctcagttt 2941 cctacacttt agaatgggga taacactgaa taatgttctt gtgaggatgc agggaattaa 3001 cccacgcaca gtacttataa tagtgtctgg cgcctgtgtt cgataagttt tagcaattct 3061 aatcatctct tttaagcctc gcagcaagcc tctaaggtaa gtctgtatta gtatccctat 3121 ttacagatga gaaaactgag gttcacaggg gatgagacag tgtacagtct gcagtccagc 3181 aattactctg ctactcagca ataaaaatag taacagctaa cccttagact aagtggcaga 3241 gtcaggcttt agattcatga ggtgagttct ggaatccatc cctttaataa ccacactaaa 3301 ttgcctttct gaaatggtta tataaagcat atctacccaa tcttggagtt ttttaaatgg 3361 cacctagttt ggtgctggaa atgcagttga ccttcaaagc aattctttgg aggcagcatc 3421 aatccctctg gaaatacctc ggtggcatgg ctggccttat tctacaggta aggaacttga 3481 agctaagcat cagtaacccc gtgaagtcac agttagtata ggttggaatt gggattcaaa 3541 tctgtacctg actttataat tcctagctgg gccccagaat ctttgataga ggtgtcttct 3601 ttcttttctt ttctttcttt cctctttctt tcccttcctt cctctctctc tgtctttctt 3661 ctctcctttc tttctcacag aatcaaaatc tcttggggtg gggcctgggc atctgatttt 3721 taaaaaccag acatctgatg tgcagtcaac actgagaacc cctgccagct tcatctcctc 3781 ttctaagtgc cagacccaag tttccaactg tctgcccacc tgtctcccca cctgggcacc 3841 cgccagcgtc tcaccctcag gagactccag ctgaactaat cctctctccc tgcttttcca 3901 gaacaggtcc caccctccct ccactcagtc tctcctgctg ggaaccctgg tcatctgcac 3961 tgtgccttca tcttccatcc tgccagtgct gcccggtgtg tctcttaaac ccatgcctcc 4021 tctgtgtgca ccacctgcac tttggtaaaa gccttcattt cctgcttggg ttactacaac 4081 gccccctaac tcatctcact gtctctattt ctgcttctct gtctctccct aggctactcc 4141 cattcttcct cccctttcct cttcatccca aagtccaacc catatccttt taccagtagg 4201 acttaaggaa ctaaagacta tctcatcacc cacttttctt cttaaaaact tccactgcac 4261 tgcctgctga gatggccttc ctacccaact tggctggaaa actcctaccc atcttgtgga 4321 acccagttca aaagtcacca cctctgagaa gccttccctg aggctcctag ggagatgggt 4381 actgcctcct ctgtccttct ccagcacagg ccccatcttc aatcacagga ttgtgctgga 4441 atgattggat gccaagtctg tccctcactg aactccttat gcaaaatcca tattatatgt 4501 ttccttttgc caggtgtggg cccaggtgct ggggataccg atgaataaaa ctgagtttct 4561 gtcttcaaga agctccaagt ctactgagtg tagcagagaa cagggagaag gcacttcagg 4621 gagaaggggt agcacatgca aagccccaga aggcagggac agaagcctta gggatgtctg 4681 tgggggagga tggaggaaga gggtaacagg agaccaggtg gggagatgag ggaggtggtc 4741 tggaagggcc atgagacacc cctcacgctc cctgagaccc cctccacgct atagagatgg 4801 gactggagag gacgatgatc atttgtgact cagatccctg tgggtttctt cagattgggt 4861 ctcacccatc tttacagcca cagcacctaa cacagtgccc ggcacacagc aggccctaga 4921 caaacgtttg ccacatgaag tcatgccact ggccaggaag cccactgggg actggggggt 4981 tggttctgcg ataatggggt ccctgagatt ctatgtttca cgtgactaag cctcactctg 5041 cccccacctc cgcgggggcg tcccgcaggt gcccgactcc agccatgctg gcgctactgt 5101 gttcctgcct gctcctggca gccggtgcct cggacgcctg gacgggcgag gactcggcgg 5161 agcccaactc tgactcggcg gagtggatcc gagacatgta cgccaaggtc acggagatct 5221 ggcaggaggt catgcagcgg cgggacgacg acggcacgct ccacgccgcc tgccaggtgc 5281 agccgtcggc cacgctggac gccgcgcagc cccgggtgac cggcgtcgtc ctcttccggc 5341 agcttgcgcc ccgcgccaag ctcgacgcct tcttcgccct ggagggcttc ccgaccgagc 5401 cgaacagctc cagccgcgcc atccacgtgc accagttcgg ggacctgagc cagggctgcg 5461 agtccaccgg gccccactac aacccgctgg ccgtgccgca cccgcagcac ccgggcgact 5521 tcggcaactt cgcggtccgc gacggcagcc tctggaggta ccgcgccggc ctggccgcct 5581 cgctcgcggg cccgcactcc atcgtgggcc gggccgtggt cgtccacgct ggcgaggacg 5641 acctgggccg cggcggcaac caggccagcg tggagaacgg gaacgcgggc cggcggctgg 5701 cctgctgcgt ggtgggcgtg tgcgggcccg ggctctggga gcgccaggcg cgggagcact 5761 cagagcgcaa gaagcggcgg cgcgagagcg agtgcaaggc cgcctgagcg cggcccccac 5821 ccggcggcgg ccagggaccc ccgaggcccc cctctgcctt tgagcttctc ctctgctcca 5881 acagacacct tccactctga ggtctcacct tcgcctctgc tgaagtctcc ccgcagccct 5941 ctccacccag aggtctccct ataccgagac ccaccatcct tccatcctga ggaccgcccc 6001 aaccctcgga gccccccact cagtaggtct gaaggcctcc atttgtaccg aaacaccccg 6061 ctcacgctga cagcctccta ggctccctga ggtacctttc cacccagacc ctccttcccc 6121 accccataag ccctgagact cccgcctttg acctgacgat cttccccctt cccgccttca 6181 ggttcctcct aggcgctcag aggccgctct ggggggttgc ctcgagtccc cccacccctc 6241 cccacccacc accgctcccg cggcaagcca gcccgtgcaa cggaagccag gccaactgcc 6301 ccgcgtcttc agctgtttcg catccaccgc caccccactg agagctgctc ctttggggga 6361 atgtttggca acctttgtgt tacagattaa aaattcagca attcagtact gcgtcgaggt 6421 cttggttact tttttgtttg tttgttttag gcttctctcc caagctgagc ttttttttgt 6481 tttgttttcg ttttcctttt ttttcttttt tttgggagtg gcaaacatgc ttcccaaatc 6541 cctacaggac ttctccttat cctctgcccc cacctcccta accctgctgg caacaacgtt 6601 cagccactgc ttgtcttgcc cttcagtgtg gctccaagag gaagatcacc agaatcactc 6661 agggaagtta aaaaaaaaaa tacagcttcc tgggctacat cccagagctg tggaatccaa 6721 agggagaaga gaaagtgaat ttgcgacaag cgtcgggatg attctggcac tggaccctct 6781 ggcctgagag gggaagaggc cttccatctc acctgggctg gtagcttgtc acatctgcct 6841 ccgagtacag ccttaggtcc atttcccaga tatcagagac agtgccaggg aagccaggtg 6901 actgcatctt gcctaggcac agaagagtag ggttggaatg tgacgttgtt agcatttggc 6961 aggaccaaaa ccagaggcaa acggaggcag tgggatggaa aggcagttga ttttgatgaa 7021 ggcttgttgg gagttcagct ttcttttgaa acttataatc tatacccagg ctagaacagt 7081 cttgtgtata caccttcatt catggaataa acgtacttgc aataactttt tagcctccca 7141 gggtagcctc acttcctagc tgtgactttt ccaccctggt tactgggagg cagcttccat 7201 ttctcccaga ctagctaggc agtgcgtcca actgaaccgc agccagaaac ctgtctccag 7261 gggttatttt tacctctaac taggactaac ttattttaaa atctttcctt gagcccaagt 7321 gacaactgaa gagaaaggct attgcctggt gattttgctc caccagttgg ttctcactgg 7381 tttgaatact aacttgaact gtactcatcg acactgaaag gggatgagca aacagtgtct 7441 ctaaatctcc tgatcctgat ctcaaatatc cccctaatta caagttgcaa caaggcagct 7501 attacacggg gacacaggat ggagaggatg ggtgccaaac acccatcgtc tactctgctg 7561 cctcggttat ggtgaattca ggaccatcaa gggaggtgtg gacctttttt ttcagaagga 7621 ggctgacact tcttgtcaat tgcattgtgt tcttagtttt gctcttcaca acccttgacc 7681 ccgtagatgg gggctgaaga ggcaccctgg ccgactcact ctatttctgt tttgggaatg 7741 ggatggataa actatcccat ggcctccaga gccaaaaaac caaaacgaaa caaaacaaaa 7801 aaccccaaaa caaaaaagca aaaagcaaac aagaaaaaaa aaaaaagagg aaataatagg 7861 cagacaattt acagttcatt gtaagggcaa agatatgcat atagcatgat ggttaacagg 7921 tcaggctcag gtagaaaggc ccatttgaac cccagctctg ccacactcag aaactgtgtg 7981 acccgaacaa gtcacttaac ctctctgagc ataggtaaaa taagatcatc ataccagatt 8041 gttttgaaga ttaaatcaag tgttattcac gagaggtgca cagcatagca tgcacaacaa 8101 ataaggacct ggtaagtatc taattaataa caatggctaa gatccaaaaa acagctacct 8161 actaataaat agatggggct gccttgtaag gcagtgagca tcatgcaacc aggattcaaa 8221 tgaaggacag ttgctacctc tgaggttccc gagaaggatt tctcgatcca ttgagagact 8281 gaatgacatg aactctgcga tcccatctct tgtggggagg gaacctagaa tgaagggaag 8341 attgtgggcc ataaaggcag acatctggtt cctgggcaca gaaccatatg tgtgccacca 8401 aagccaccca ccggacccca cttggcccct ggagtctatt tttactcctc tcatcttaca 8461 agatctattt tgttaatctc cttatatttg ctgttttgac ttcccagcca gcttgctaat 8521 cagtttgcct atttgactca cagggtttgc atttgtcacg gggactgaaa cacacgcttg 8581 ttttgatttc tttttgtaaa ttagaagcgt tgatgtaatg actctaccta gacacagctg 8641 gtaaagtgag aataatgctc aagtttgcac agtttaaaca caatgtagac aataattaga 8701 aatgctatct ttagatgttt aggataagct tttctcagaa ttgcactgat tttttttttc 8761 tgagtggggc tttttagtgc atatatacag aaatactaaa aacgtaagaa aatagagcaa 8821 atcagtgagt gctttggtca acttgaaaga ctgcaggaaa taaaccaact gattttagat 8881 ctgccttttt ttgactgaat gcataaaatc tttacattct ccatattttt catgactacc 8941 atatgatcaa atagttttag gtgacagatt gcaactgata agttgctgca atatggcaga 9001 agtcatgctc agcctccgct tgcccggtgg tgagggtgga atatgaagca aacaataaag 9061 ataattcatc atctctatca ggaaaattgc cacatgttta tttcaggtaa caaaaaagat 9121 atagttatga tatacaatga ccatagaatc caataaagca acttctgcaa atgaatagaa 9181 ggtacttttt ctttaaatga aactacaaaa tagcagctgg ttttaaaaac aaagccaatt 9241 gttttagatt taataggcta ccactggcct ctgctaagat ccccaaatat attcctgagc 9301 tcacatagat tccagaaagt caaacttttc aatattatgc aaactttccc tatgcatcca 9361 aaaaattctc atttagtaaa gaggtgatat gaaatgtaag gcagcatgtc catatctatc 9421 attttaaatt gccttcatgc tgtatcaact ggttttgttt tgggaagcaa ccataatatt 9481 gagagacggg tctttcctat tttttctgct actcatttct aactagattc actacggagc 9541 tcccaattgc atctctctga tctacaaatt tttctctctt caggaagaca cctggaaaga 9601 agggactaca ttaaaggagt gtgttggggg caatgctttg gccttttgac atcctatcta 9661 gtctgaaggg accctcacta ttgctaagga ggaggagtgt tttaaatgga ggcttcagaa 9721 tgaaagcaga ggaagaaggt actctctttt tcaaaaagaa ggagggtaca ggccgggcgc 9781 agctgtcacg cctgcaatcc cagcactttg ggaggccgag gaaggcagat cacgaggttg 9841 ggagtttgag ccagcctggt caacatagtg aaaccccgtc tctactaaaa atacaaaaat 9901 tagccagcat ggtggtgcat gcctgtagtc ccagttactc gggaggctga ggcaggagaa 9961 tcgcttgaac tcgggaagtg gaggttgcag tgagccgaga tcatgccact gcactccacc 10021 ctgggtgaca gagtgagact ctcaaaaaaa aaaaaaaaaa aaaaaagaag tagggtacc // LOCUS HSU10273 2476 bp DNA PRI 01-FEB-1995 DEFINITION Human angiotensin II receptor type 2 subtype gene, complete cds. ACCESSION U10273 NID g607811 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2476) AUTHORS Koike,G., Horiuchi,M., Yamada,T., Szpirer,C., Jacob,H.J. and Dzau,V.J. TITLE Human type 2 angiotensin II receptor gene: cloned, mapped to the X chromosome, and its mRNA is expressed in the human lung JOURNAL Biochem. Biophys. Res. Commun. 203 (3), 1842-1850 (1994) MEDLINE 95032069 REFERENCE 2 (bases 1 to 2476) AUTHORS Koike,G. TITLE Direct Submission JOURNAL Submitted (01-JUN-1994) George Koike, Falk Cardiovascular Research Center, Stanford University School of Medicine, 300 Pasteur Drive, Stanford, CA 94305-5246, USA FEATURES Location/Qualifiers source 1..2476 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pGK1111" /chromosome="X" /sex="female" /cell_type="leukocyte" /tissue_type="blood" /dev_stage="adult" CDS 190..1281 /codon_start=1 /product="angiotensin II receptor type 2 subtype" /db_xref="PID:g607812" /translation="MKGNSTLATTSKNITSGLHFGLVNISGNNESTLNCSQKPSDKHL DAIPILYYIIFVIGFLVNIVVVTLFCCQKGPKKVSSIYIFNLAVADLLLLATLPLWAT YYSYRYDWLFGPVMCKVFGSFLTLNMFASIFFITCMSVDRYQSVIYPFLSQRRNPWQA SYIVPLVWCMACLSSLPTFYFRDVRTIEYLGVNACIMAFPPEKYAQWSAGIALMKNIL GFIIPLIFIATCYFGIRKHLLKTNSYGKNRITRDQVLKMAAAVVLAFIICWLPFHVLT FLDALAWMGVINSCEVIAVIDLALPFAILLGFTNSCVNPFLYCFVGNRFQQKLRSVFR VPITWLQGKRESMSCRKSSSLREMETFVS" BASE COUNT 684 a 448 c 458 g 886 t ORIGIN 1 gaattcttgt tttacaagcc atggctctgt ttcttaatgt tttctataat cactcacttt 61 ttttttgctt ttgacaaaca ttcaaaatgc taatgattca aggatgtcct cagctctgta 121 tgtgttctaa gagttctatg ttttttctcc acagaaggca taagaactag gagctgctga 181 catttcaata tgaagggcaa ctccaccctt gccactacta gcaaaaacat taccagcggt 241 cttcacttcg ggcttgtgaa catctctggc aacaatgagt ctaccttgaa ctgttcacag 301 aaaccatcag ataagcattt agatgcaatt cctattcttt actacattat atttgtaatt 361 ggatttctgg tcaatattgt cgtggttaca ctgttttgtt gtcaaaaggg tcctaaaaag 421 gtttctagca tatacatctt caacctcgct gtggctgatt tactcctttt ggctactctt 481 cctctatggg caacctatta ttcttataga tatgactggc tctttggacc tgtgatgtgc 541 aaagtttttg gttcttttct taccctgaac atgtttgcaa gcattttttt tatcacctgc 601 atgagtgttg ataggtacca atctgtcatc tacccctttc tgtctcaaag aagaaatccc 661 tggcaagcat cttatatagt tccccttgtt tggtgtatgg cctgtttgtc ctcattgcca 721 acattttatt ttcgagacgt cagaaccatt gaatacttag gagtgaatgc ttgcattatg 781 gctttcccac ctgagaaata tgcccaatgg tcagctggga ttgccttaat gaaaaatatc 841 cttggtttta ttatcccttt aatattcata gcaacatgct attttggaat tagaaaacac 901 ttactgaaga cgaatagcta tgggaagaac aggataaccc gtgaccaagt cctgaagatg 961 gcagctgctg ttgttctggc cttcatcatt tgctggcttc ccttccatgt tctgaccttc 1021 ctggatgctc tggcctggat gggtgtcatt aatagctgcg aagttatagc agtcattgac 1081 ctggcacttc cttttgccat cctcttggga ttcaccaaca gctgcgttaa tccgtttctg 1141 tattgttttg ttggaaaccg gttccaacag aagctccgca gtgtgtttag ggttccaatt 1201 acttggctcc aagggaaaag agagagtatg tcttgccgga aaagcagttc tcttagagaa 1261 atggagacct ttgtgtctta aacgtgagag caaaatgcat gtaatcaaca tggctacttg 1321 ctttgaggct caccagaatt atttttaagt ggttttaata aaataataaa atttccccta 1381 atcttttctg aatcttctga aaccaaatgt aactatgttt tatcgtccag tgactttcag 1441 gaattgccca ttgtttttct gatatgtttg tacaagattt tcattggtga gacatattta 1501 caacctagaa gtaactggtg atatatctca aattgtaatt aataatagat tgtgaataat 1561 gatttgggga ttcagatttc tctttgaaac atgcttgtgt ttcttagtgg ggttttatat 1621 ccatttttat caggatttcc tcttgaacca gaaccagtct ttcaactcat tgcatcattt 1681 acaagacaac attgtaagag agatgagcac ttctaagttg agtatattat aatagattag 1741 tactggatta ttcaggcttt aggcatatgc ttctttaaaa acgctataaa ttatattcct 1801 cttgcatttc acttgagtgg aggtttatag ttaatctata actacatatt gaatagggct 1861 aggaatatag attaaatcat actcctatgc tttagcttat ttttacagtt atagaaagca 1921 agatgtacta taacatagaa ttgcaatcta taatatttgt gtgttcacta aactctgaat 1981 aagcactttt taaaaaactt tctactcatt ttaatgattg tttaaaggtt tctattttct 2041 ctgatacttt tttgaaatca gtaaacactg tgtattgttg taaaatgtaa aggtcacttt 2101 tcacatcctt gactttttag atgtgctgct ttgatatata ggacattgat ttgattttta 2161 ttattaatgc tttggttctg ggttgtttcc taaaatatct gggtggctta aaaaaaactc 2221 tttaacttgt aataaaccct taactggcat aggaaatggt atccagaatg gaattttgct 2281 acatggggtc tgggtggggg caaagagacc cagtcaatta catgtttggt accaagaaag 2341 gaacctgtca gggcagtaca atgtgacttt gaaaatatat accgtggggg tagttttacc 2401 ctatatctat aaacactgtt tgttccagaa tctgtatgat tctatggagc tattttaaac 2461 caattgcagg tctaga // LOCUS HSU10360 750 bp DNA PRI 18-NOV-1994 DEFINITION Human interferon-gamma gene, complete cds. ACCESSION U10360 NID g551490 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 750) AUTHORS Realini,C., Dubiel,W., Pratt,G., Ferrell,K. and Rechsteiner,M. TITLE Molecular cloning and expression of a gamma-interferon-inducible activator of the multicatalytic protease JOURNAL J. Biol. Chem. 269 (32), 20727-20732 (1994) MEDLINE 94327656 REFERENCE 2 (bases 1 to 750) AUTHORS Realini,C.A. TITLE Direct Submission JOURNAL Submitted (06-JUN-1994) Claudio A. Realini, Biochemistry, University of Utah, School of Medicine, 317 Wintrobe, Salt Lake City, UT 84132, USA FEATURES Location/Qualifiers source 1..750 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="erythrocyte" /tissue_type="blood" CDS 1..750 /note="identical to interferon-gamma coding sequence from GenBank Accession Number L07633" /codon_start=1 /product="interferon-gamma" /db_xref="PID:g551491" /translation="MAMLRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFL KEPALNEANLSNLKAPLDIPVPDPVKEKEKEERKKQQEKEDKDEKKKGEDEDKGPPCG PVNCNEKIVVLLQRLKPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELM TSLHTKLEGFHTQISKYFSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVME IRNAYAVLYDIILKNFEKLKKPRGETKGMIY" BASE COUNT 217 a 166 c 216 g 151 t ORIGIN 1 atggccatgc tcagggtcca gcccgaggcc caagccaagg tggatgtgtt tcgtgaagac 61 ctctgtacca agacagagaa cctgctcggg agctatttcc ccaagaagat ttctgagctg 121 gatgcatttt taaaggagcc agctctcaat gaagccaact tgagcaatct gaaggcccca 181 ttggacatcc cagtgcctga tccagtcaag gagaaagaga aagaggagcg gaagaaacag 241 caggagaagg aagacaagga tgaaaagaag aagggggagg atgaagacaa aggtcctccc 301 tgtggcccag tgaactgcaa tgaaaagatc gtggtccttc tgcagcgctt gaagcctgag 361 atcaaggatg tcattgagca gctcaacctg gtcaccacct ggttgcagct gcagatacct 421 cggattgagg atggtaacaa ttttggagtg gctgtccagg agaaggtgtt tgagctgatg 481 accagcctcc acaccaagct agaaggcttc cacactcaaa tctctaagta tttctctgag 541 cgtggtgatg cagtgactaa agcagccaag cagccccatg tgggtgatta tcggcagctg 601 gtgcacgagc tggatgaggc agagtaccgg gacatccggc tgatggtcat ggagatccgc 661 aatgcttatg ctgtgttata tgacatcatc ctgaagaact tcgagaagct caagaagccc 721 aggggagaaa caaagggaat gatctattga // LOCUS HSU10554 7411 bp DNA PRI 29-DEC-1997 DEFINITION Homo sapiens vesicular acetylcholine transporter gene, complete cds. ACCESSION U10554 NID g2724153 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7411) AUTHORS Erickson,J.D., Varoqui,H., Schafer,M.K., Modi,W., Diebler,M.F., Weihe,E., Rand,J., Eiden,L.E., Bonner,T.I. and Usdin,T.B. TITLE Functional identification of a vesicular acetylcholine transporter and its expression from a 'cholinergic' gene locus JOURNAL J. Biol. Chem. 269 (35), 21929-21932 (1994) MEDLINE 94350930 REFERENCE 2 (bases 1 to 7411) AUTHORS Chireux,M.A., Le Van Thai,A. and Weber,M.J. TITLE Human choline acetyltransferase gene: localization of alternative first exons JOURNAL J. Neurosci. Res. 40 (4), 427-438 (1995) MEDLINE 95341717 REFERENCE 3 (bases 1 to 7411) AUTHORS Hahm,S.H., Chen,L., Patel,C., Erickson,J., Bonner,T.I., Weihe,E., Shafer,M.K.-H. and Eiden,L. TITLE The human cholinergic gene locus: upstream sequencing, in vivo transcription patterns, and role of the NRSE/RE-1 and other control elements in VAChT expression JOURNAL J. Mol. Neurosci. 9 (1998) In press REFERENCE 4 (bases 1 to 7411) AUTHORS Hahm,S.H., Chen,L., Patel,C., Erickson,J., Bonner,T.I., Weihe,E., Shafer,M.K.-H. and Eiden,L. TITLE Direct Submission JOURNAL Submitted (10-JUN-1994) Section on Molecular Neuroscience, Laboratory of Cellular and Molecular Regulation, NIMH, NIH, 9000 Rockville Pike, Bethesda, MD 20892, USA REFERENCE 5 (bases 1 to 7411) AUTHORS Hahm,S.H., Chen,L., Patel,C., Erickson,J., Bonner,T.I., Weihe,E., Shafer,M.K.-H. and Eiden,L. TITLE Direct Submission JOURNAL Submitted (29-DEC-1997) Section on Molecular Neuroscience, Laboratory of Cellular and Molecular Regulation, NIMH, NIH, 9000 Rockville Pike, Bethesda, MD 20892, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..7411 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene human genomic in pWE15 cosmid" /map="10q11.2" /chromosome="10" misc_feature 4274..4563 /note="submitter indicates overlap with bases 1409-1677 of GenBank Accession Number M96014" /citation=[2] misc_feature 4457..4525 /note="homologous to the first 68 bases of rat cDNA, GenBank Accession Number U09211" /citation=[1] mRNA <4915..7335 /product="vesicular acetylcholine transporter" exon <4915..7335 /note="corresponds to the complete cDNA sequence, GenBank Accession Number U09210; the genomic sequence in this interval is co-linear with the cDNA with no intervening introns" /citation=[1] /evidence=experimental CDS 5356..6954 /note="VAChT" /codon_start=1 /product="vesicular acetylcholine transporter" /db_xref="PID:g769848" /translation="MESAEPAGQARAAATKLSEAVGAALQEPRRQRRLVLVIVCVALL LDNMLYMVIVPIVPDYIAHMRGGGEGPTRTPEVWEPTLPLPTPANASAYTANTSASPT AAWPAGSALRPRYPTESEDVKIGVLFASKAILQLLVNPLSGPFIDRMSYDVPLLIGLG VMFASTVLFAFAEDYATLFAARSLQGLGSAFADTSGIAMIADKYPEEPERSRALGVAL AFISFGSLVAPPFGGILYEFAGKRVPFLVLAAVSLFDALLLLAVAKPFSAAARARANL PVGTPIHRLMLDPYIAVVAGALTTCNIPLAFLEPTIATWMKHTMAASEWEMGMAWLPA FVPHVLGVYLTVRLAARYPHLQWLYGALGLAVIGASSCIVPACRSFAPLVVSLCGLCF GIALVDTALLPTLAFLVDVRHVSVYGSVYAIADISYSVAYALGPIVAGHIVHSLGFEQ LSLGMGLANLLYAPVLLLLRNVGLLTRSRSERDVLLDEPPQGLYDAVRLRERPVSGQD GEPRSPPGPFDECEDDYNYYYTRS" misc_feature 6544..7411 /note="submitter indicates overlap with bases 1-861 of GenBank Accession Number M96015" /citation=[2] polyA_signal 7312..7317 polyA_site 7335 BASE COUNT 1544 a 2244 c 2220 g 1403 t ORIGIN 1 aagcttggca tccctgcagg ggtggattgt cagggtctgg aggcaaggga caagtgaagg 61 gattaactgt cccactgtgg gggaagatgt aggatctggg ctgggggcag gcagggagtg 121 gtctcaatgg aggagggaaa tgaatctgag acattaattg gattgagacg gaaagaaagt 181 ggagagaaca gaaaccaaag ggatttggag gccatgcttc caacgaatga ttcatagtta 241 gtgtcaggga gccagaaaaa aagcaagtga gcaaggtcct gtccctggga gctgtagaga 301 ggagcctggg ccacccacaa gcagcacctg cagtctcttt ccctctcgaa gcccagctat 361 gttgtgcaca aagcaagtct gggcaccgag gacaggctgg ccaagggcag gcaggcaggc 421 acgtagtcct ctggcttcca gccaccacac tcacaggttt ctgggaaagg ctgactgggg 481 ccactttgtt cctttgaatc tgagaatata tgactgggga agcctaaatt aattaaatga 541 tgctgaggcc cgcctgagcc ggtgcacagg ggatgggtta tggagccctg agcaaaactg 601 cacccccagc ccccagtgct ggaatccaga gaggctcatg agctcgattg gaacgaagcc 661 tgtgcttaag tgcttccaga gagacaaaga aataataaat caggagcagg tgccccaccc 721 acacactgcc atcaccaaca ccagcctact tctccacaga aatacagtgg tttcacctct 781 ctggaaccag atgtttcagg gaagcaacaa atggcaaagc cctggaaatg acatggcccc 841 acaaccttct cagaaatgag gccaggctgg gctggcacct ccatccacag cagcacccca 901 ccaccacaac ccacccaaga cctccaaaca ccccctagac ctcacccagg cactggtgca 961 gcatggctat ccttccaccc ctcccaaact cccacacacc aggactcccc catccccagc 1021 aagcccccct cccctgacag ggcactgatc cagcaaggaa gaaattttag tccatcagac 1081 tgtggacaca gttgtatctt gtggataaaa tgagaagaag tgttgtcagc tagaaatcac 1141 caatttgtct ctatccatat aaaataataa caaaattaaa ctcctactag gtatcaagca 1201 ttgggctaga catgttcaag tgagctatct cgcagtcttt agccatcctg cagcaggcac 1261 atggtggctg ggagcaggaa gctggaacta aatcacctgc atgggaatcc catctttacc 1321 acttgtgagc tctctgactg aatacattac tttacctaag tgtctttgcc tataagaaag 1381 taatatgaag gttggtgaga gaattcaatg agttagtgct tatcaaatgc ttagaacggt 1441 gcctgatgcc ttataagctc tactatgtgt ctagtaaata cgttccttcc ttcctctact 1501 cattcaactt taattgagta ccagatatga acgctgccct aacagctgct atttctggtt 1561 gagaaaggag agaaaaatgc ttccgttttc tcctacagga ttggacagga agttctctgt 1621 tgcctgcaag ttatgctcag atagaggagt ccggaaagga atttaccaag cgttcaagtg 1681 aagaggaagc cagaggggcg gggcctccgt gggaggcttc ttggttgcgg tggggccggg 1741 gccagcaccc tggacagctc ccggcggctg gcggggagaa gaggctttgc caggtcccat 1801 ctcggctcag accctcccgt agccttttgc tgcaggggcc tgacgcctcc ttgcaacccc 1861 ggcactcccg tatttggagc cacgatggat tgggggggcg ttgccctctg atctgaaagt 1921 tagagaatga gggagtttag tgtctttgga gcctccttag ggagccagcc caggaccttc 1981 ctcctcccat gggtcttgat ctgctcctca cccctacact gccagcgtgc ggccaccttg 2041 gagatggagc agcaggagca cgacgccgtg ccgggaatag agaagcagtg tgaggaccac 2101 aagacaggtc ccccagcctg ggtttaattc caggatctgc tgctgggagc ttgggcaagc 2161 acccagcatc tccacgccca gcttcctcac ctgataagac caggggaata gcacctacac 2221 tgcaccagga ttgcaggaga atggagtaaa acagcacaca caatactcct ggcacaaagc 2281 aagtaataaa tgagtgtgcc tcctgccata caggaaccga ggactggaga gctagtcatg 2341 agcccagaga tacctggctt catgctcagg aacagccatg tgggccagag ggctggagat 2401 gtggtctgca ccttgaaggc cctgggaaag ttcctgtcct ctatgattcc tctgttcttc 2461 agccactaga cataaaaaaa aattgtcctt caagtcctcc ttcctccttt ctttaatcag 2521 acaggggaaa tagcatgtgc ataaataata attgttttgt aggctgggag actgcccctt 2581 taagagcctc ataatgggag agtcagaaga gcccagaggt ctcaaacgga aatgactgga 2641 gaaccagcaa gtggcataag tgtgtgaagt cagctggggc gagattggga tgtgctggag 2701 agtgtgcatg tactgacgaa aggccatcaa atacaattta aaataattag cccactccaa 2761 agagtcttct gcagcacgca cttctgctaa gggctgcagt gagtgggctc tcacctcaac 2821 ctgaaccaga cggaggctga ggctgggccc cactgagggg ctgtgacttg cccaaggtca 2881 tccagggcca ggcctgtgcc cacattggga ctcttttcct ccctcagaca gagggcactt 2941 cagtcactgc agagcaggtg gcagcagccc ttcagtggat gagtttctcc tttcccaggg 3001 tgactgggtg ggtgaaggaa ttgagcaaca ggtggccagg ctgccgcctc tcaccctgac 3061 acattggtcc ccatcccctc atcctacagc catctcaggg acagatggag acattccaat 3121 gccctccggg tcctccacca acccagacag agcctggagt cacaatgccc acctagccag 3181 agggatcagc cagctagcca agtgccccct gtgtctttca tctgttcttc acaagaccca 3241 caagtgaaaa atgaggataa accgcacatt ctacacacga taacaacata gcaagtcctt 3301 atactgctct tactgtgtgc ccggccccct tagcagtagg tactattatt attcaatttt 3361 acaaagaaac taactgaggg acagagaggc aaagtaattc cctgcaggta ctctaatgag 3421 tacgtggcag agctgggagc catcctggtt gtctgcctcg aaagaccaca ctctttgcac 3481 tgcatcgcgg cacttcctgc ggggtggggg gcacaatggg agaagcatct gcgtctaatg 3541 ctgctttact tttgaggcca gaaaaatggg aaggctcccc tctgactctg gaagagagac 3601 gcaaaccgta atctcaacaa cacaatcccc acctccaacc tcagccgccc tggagcctct 3661 ctcccgccag tccgcccact ggaacacggg ttccatgtgc catccagggt caacgccgct 3721 ctggggacgc gtcaggccca gcgcacagcc tgggcagctc agcctgtcag ctgagcacgg 3781 gcgcctcaag gggtgcggcc ctctcagaag gaaggtgagt ctccctggag cccctgacga 3841 cagcgaagcg gctgggcgct ctccggggag tgggcgcggc acccaagaag ggtttcttac 3901 agttccaaga gaggacccca gccacagtca ctccagagcc cacccggggc ttggaggagg 3961 ctctggaaaa tgagggatgg gggagtgtgc agaacagggg gcggaggctg gcaggagccg 4021 agggagggct tggtcgggtg cgaagagtgc ctgggaggag ttagagtggg gagggggtgc 4081 ggcggcgggg agagaagagg cggagctggg ggaccagacc gctaaaggag aaagggggat 4141 tccaggcaag gggaatgtgg ccaggcagtg cgggagggcg gcagcggcca ggctaatatt 4201 cgcgaccccg actggggcgc ggcaacttct gccctcagtc cccacaagct cccggccgcg 4261 attcgcggcc gagaaattct gcggaggggc cggtgggtca gagccaggcc aggcagttgc 4321 tgagccttcc cctcccctcc tggcgggagt gtgtgtgcaa ggggccaggg ggcggagtcg 4381 gagaaagggt tgggggtgtc cgtgtccagc gctcctccgg acgcggctcc acaatccagc 4441 tgcgagaaca gatggacgca gcggcggctc acctcggggg cttcctgctc gctgtgcgcc 4501 gaggtccagg ctgaagcgga gatcgcaagc ctctggctcc cgcctctccc aaggggcctc 4561 gaggaaccgg ctgccgcgct cccttcccca agaggagaag agcgggcgct gaccccgcga 4621 gtcctagaac cgcgggcggg gggcaggcag ggggcggggg gcagagggcc agctaccggc 4681 gcggaacgcc atcccagggg ctggttcaag acttccacac cgagagccct tcctgggcct 4741 ccagccctgg agcatctgga gagcggaccc ctgcccggcc acgccccgcc cccggccccc 4801 gccccgccga cgtcgcatta gcatgagcga cgtaagtggc ccgggcacca ctcgggggct 4861 gggactcgcc gcgtcacagc cccgagtgga agggaaaaaa aaagaggagg cggcggagga 4921 ggcaagagcc gacgcgaggg gaggggagcg cagcggcggg gctaacgggc gggcaagcgg 4981 gcgggcggca acagcatgtc cctcggccag cgcgggcggc ctcttagcgc ggcgggggct 5041 gctctgggcg cgccccgggc gaagtgcgcc cagtctccgg ccccggcccc tcggcgcgcc 5101 cgacttcccg gccgcccctg agcccagcag ccgcgggtcc cgggatcggc taagagtagc 5161 tgcaacgcct cgccggacgg agtcctttcc tttcccggga cgctgggcca tgagctccgc 5221 ggcccacctg aggcacaggg gagtctgctc ggccaggaca gcctccccga agtcccgtgc 5281 cctcgcctct gcactgcggg acgccagcgc tcggccctgg cggaggcgtc ttcggaagag 5341 catcggggtg ggggcatgga atccgcggaa cctgcgggcc aggcccgggc ggcggccacc 5401 aagctgtcgg aggctgtggg cgcggcgctg caggagcccc ggcggcagag gcgcctggtg 5461 cttgttatcg tgtgcgtggc gctgttactg gacaacatgc tgtacatggt catcgtgccc 5521 atagtgcccg actacatcgc ccacatgcgc gggggcggcg agggccccac ccggactccc 5581 gaggtgtggg agcccaccct gccgctgccc actccggcca atgccagcgc ctacacggcc 5641 aacacctcgg cgtccccgac agctgcgtgg ccagcgggct cagcccttcg gccccgctac 5701 cctacggaga gcgaagacgt gaagatcggg gtgctgtttg cttccaaggc tatcctgcag 5761 ctgctagtga accccttgag cgggcccttc atcgaccgca tgagctacga cgtgccgctg 5821 ctgatcggcc tgggcgtcat gttcgcctct acagtcctgt tcgccttcgc cgaggactac 5881 gccacgctgt tcgcggcgcg cagcctgcag ggcctgggct cagccttcgc cgacacgtct 5941 ggcatagcca tgatcgccga taagtacccg gaggagccgg agcgcagtcg tgcactgggc 6001 gtggcgctgg ccttcattag cttcggaagc ctagtggccc cgcccttcgg gggcatcctc 6061 tatgagttcg ccggcaagcg cgtgcccttc ttggtgctag ctgccgtgtc gctctttgac 6121 gcgctgttgc tgctggcagt ggccaaaccc ttctcggcgg ctgcacgggc tcgggccaac 6181 ctgccagtgg gcactcccat ccaccgcctc atgctagacc cctacattgc cgtggtggcc 6241 ggcgcgctca ccacctgtaa cattcccctc gccttcctcg aacccaccat tgccacgtgg 6301 atgaagcata cgatggcggc ttccgagtgg gagatgggca tggcctggct gccggccttc 6361 gtgcctcatg tgctgggcgt ctacctcacc gtgcgcctgg cggcgcgcta cccacacctg 6421 cagtggctgt acggcgcgct tgggctggct gtgatcggcg ccagctcgtg catcgtgccc 6481 gcctgccgct ccttcgcgcc gctagtggtc tcactatgcg gcctctgttt tggcatagcc 6541 ctagtcgaca cagcactgct gcccacgctc gccttcctgg tggacgtgcg ccatgtctca 6601 gtctatggca gcgtctacgc catcgccgac atctcctatt cggtggccta cgcgctcggg 6661 cccatagtgg caggccacat tgtgcactcg ctgggctttg agcagctcag ccttggcatg 6721 ggactggcca acctgctcta tgctcccgtc ttgctgctgc tccgcaacgt gggcctcctg 6781 acgcgctccc gttccgagcg cgatgtgctg cttgatgagc caccgcaagg tctgtacgat 6841 gcggtgcgcc tgcgtgagcg tcctgtgtct ggccaggacg gcgagcctcg cagcccgcct 6901 ggcccttttg atgagtgcga ggacgactac aactactact acacccgcag ctagcatccc 6961 cactcctcct ccagcccacc caaccgcctt gggtcaaggg ggctgctctg caagcccact 7021 ggccagctct ggctcagggc ccacctcctc cagcgagtac cccagccact cctcaacctt 7081 gacttctgcc caaatcccct ccctgtgacc cgttccatat ccctttctct cttgtccaat 7141 ggggcttgga gcaccgaggc cagcgaagcc atcgcgctcc ttgcggaggt gaagaggacc 7201 ctgagtcccc acctgcggct cccctgtgta gagcctgcat ctgtctgtcc ttccttccat 7261 tgctcccagt gccaaacttg ggccgctgca ccgcggcgcc tccgcccaaa tcaataaact 7321 gtgtctgtcc caggaggccg agtctcttta ctggtggggg gtgcgtggag gcgcgcaggg 7381 ccagagcaga ggggagggtg aactgggtct c // LOCUS HSU10685 3510 bp DNA PRI 23-JUN-1995 DEFINITION Human MAGE-10 antigen (MAGE10) gene, complete cds. ACCESSION U10685 NID g533510 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3510) AUTHORS De Plaen,E., Arden,K., Traversari,C., Gaforio,J.J., Szikora,J.P., De Smet,C., Brasseur,F., van der Bruggen,P., Lethe,B., Lurquin,C. Brasseur,R., Chomez,P., De Backer,O., Cavenee,W. and Boon,T. TITLE Structure, chromosomal localization, and expression of 12 genes of the MAGE family JOURNAL Immunogenetics 40 (5), 360-369 (1994) MEDLINE 95012457 REFERENCE 2 (bases 1 to 3510) AUTHORS De Plaen,E. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Etienne De Plaen, Ludwig Institute for Cancer Research, 74 Avenue Hippocrate, Brussels, 1200, Belgium FEATURES Location/Qualifiers source 1..3510 /organism="Homo sapiens" /isolate="patient MZ2" /db_xref="taxon:9606" /chromosome="X" /sex="female" /cell_type="lymphocyte" /tissue_type="blood" /dev_stage="adult" exon 1741..1814 /number=2 exon 1890..>3065 /number=3 gene 1955..3064 /gene="MAGE10" CDS 1955..3064 /gene="MAGE10" /codon_start=1 /product="MAGE-10 antigen" /db_xref="PID:g533511" /translation="MPRAPKRQRCMPEEDLQSQSETQGLEGAQAPLAVEEDASSSTST SSSFPSSFPSSSSSSSSSCYPLIPSTPEEVSADDETPNPPQSAQIACSSPSVVASLPL DQSDEGSSSQKEESPSTLQVLPDSESLPRSEIDEKVTDLVQFLLFKYQMKEPITKAEI LESVIKNYEDHFPLLFSEASECMLLVFGIDVKEVDPTGHSFVLVTSLGLTYDGMLSDV QSMPKTGILILILSIIFIEGYCTPEEVIWEALNMMGLYDGMEHLIYGEPRKLLTQDWV QENYLEYRQVPGSDPARYEFLWGPRAHAEIRKMSLLKFLAKVNGSDPRSFPLWYEEAL KDEEERAQDRIATTDDTTAMASASSSATGSFSYPE" BASE COUNT 886 a 892 c 909 g 823 t ORIGIN 1 cagggagatg gtggctttgg cgtgcaagac ccatacacga ttcagcagga gggaaaggct 61 gggctgtcgg gagtaaatct gaatacctgg aggacaccca aataaaggaa gtccccgtct 121 tgtccccctc ccctgcccac cacccccccc ccccccgcca aatgtctgct ccttctgtca 181 gctttgggaa tcccatgcag gtgtgatcgt gtggtgcccc tccccacttc tgcctgccgg 241 gtctcaggga ggtgaggacc ttggtctgag ggttgctaag aagttattac agggttccac 301 acttggtcaa cagagggagg agtcccagaa tctgcaggac ccaaggggtg cccccttagt 361 gaggactgga ggtacctgca gcccagaaag aagggatgtc acagagtctg gctgtcccct 421 gttcttagct ctgaggggac ctgatcagga ttggcactaa gtggcaagct caattttacc 481 acaggcagga agatgaggaa ccctcaggga aatggagttt tggtgtaaag gggagatatc 541 agccctggac accccacagg gatgacagga tgtggctcct tcttactttt gttttggaat 601 ctcagggagg tgagaacctt gctctcagag ggtgactcaa gtcaacacag ggaacccctc 661 ttttctacag acacagtggg tcgcaggatc tgacaagagt ccaggtaagg aacctgaggg 721 aaatctgagg gtacccccag cccataacac agatggggtc cccacagaaa tctgccatga 781 ccctactgtc actctggaga acccagtcag ggctgtccgc tgagtctccc tgtcttatac 841 aaggatcact ggtctctggg agggagaggt gttggtctaa gggagctgca ctcgggtcag 901 cagagggagg gtcccagacc ctgccaggag tcaaggtgag gactgagggg acaccattct 961 ccaaacgcac aggactcagc cccaccctac cccttctgtc agccacggga attcatgggg 1021 aactgggggt agatggactc ccctcacttc ctctttccat gtctcctgga ggtaggacct 1081 tggtttaagg aagtggcctc agatcaacaa agggagggtc ccaggtcgta tcaggcatca 1141 agaagaggac caagcaggct cctcacccca gtacacatgg acccagctga atatggccac 1201 ctcttgctgt cttttctggg aggacctctg cagttgtggc cagatgtggg tcccctcatg 1261 tcttctattt cgtatcaggg atgtaagctt ttgatctgag agtttcttag accagcaaag 1321 gagcagggtc taggcttttc caggagaaag gtgagagccc cacgtgagca cagaggctcc 1381 ccaccccagg gtagtgggga actcacagag tccagcccac cctcctgaca acactgggag 1441 gctggggctg tgcttgcagc ctgaaccctg agggcccctc aattcctctt tcaggagctc 1501 cagggactgt gaggtgaggc cttggtctaa ggcagtgttt tcaggtcaca gagcagaaag 1561 ggcccagaca gtgccaggag tcaaggtgag gtgcatgccc tgaatgtgta ccaagggccc 1621 cacctgctcc aggacaaagt ggaccccact gcatcagctc cacctaccct actgtcagtc 1681 ctggagcctt ggcctctgcc ggctgcatcc tgaggagcca tctctcactt ccttcttcag 1741 gttctcaggg gacagggaga gcaagaggtc aagagctgtg ggacaccaca gagcagcact 1801 gaaggagaag acctgtaagt tggcctttgt tagaacctcc agggtgtggt tctcagctgt 1861 ggccacttac accctccctc tctccccagg cctgtgggtc cccatcgccc aagtcctgcc 1921 cacactccca cctgctaccc tgatcagagt catcatgcct cgagctccaa agcgtcagcg 1981 ctgcatgcct gaagaagatc ttcaatccca aagtgagaca cagggcctcg agggtgcaca 2041 ggctcccctg gctgtggagg aggatgcttc atcatccact tccaccagct cctcttttcc 2101 atcctctttt ccctcctcct cctcttcctc ctcctcctcc tgctatcctc taataccaag 2161 caccccagag gaggtttctg ctgatgatga gacaccaaat cctccccaga gtgctcagat 2221 agcctgctcc tccccctcgg tcgttgcttc ccttccatta gatcaatctg atgagggctc 2281 cagcagccaa aaggaggaga gtccaagcac cctacaggtc ctgccagaca gtgagtcttt 2341 acccagaagt gagatagatg aaaaggtgac tgatttggtg cagtttctgc tcttcaagta 2401 tcaaatgaag gagccgatca caaaggcaga aatactggag agtgtcataa aaaattatga 2461 agaccacttc cctttgttgt ttagtgaagc ctccgagtgc atgctgctgg tctttggcat 2521 tgatgtaaag gaagtggatc ccactggcca ctcctttgtc cttgtcacct ccctgggcct 2581 cacctatgat gggatgctga gtgatgtcca gagcatgccc aagactggca ttctcatact 2641 tatcctaagc ataatcttca tagagggcta ctgcacccct gaggaggtca tctgggaagc 2701 actgaatatg atggggctgt atgatgggat ggagcacctc atttatgggg agcccaggaa 2761 gctgctcacc caagattggg tgcaggaaaa ctacctggag taccggcagg tgcctggcag 2821 tgatcctgca cggtatgagt ttctgtgggg tccaagggct catgctgaaa ttaggaagat 2881 gagtctcctg aaatttttgg ccaaggtaaa tgggagtgat ccaagatcct tcccactgtg 2941 gtatgaggag gctttgaaag atgaggaaga gagagcccag gacagaattg ccaccacaga 3001 tgatactact gccatggcca gtgcaagttc tagcgctaca ggtagcttct cctaccctga 3061 ataaagtaag acagattctt cactgtgttt taaaaggcaa gtcaaatacc acatgatttt 3121 actcatatgt ggaatctaaa aaaaaaaaaa aaaaaagttg gtatcatgga agtagagagt 3181 agagcagtag ttacattaca attaaatagg aggaataagt tctagtgttc tattgcacag 3241 taggatgact atagttaaca ttaagatatt gtatattaca aaacagctag aaggaaggct 3301 tttcaatatt gtcaccaaaa agaaatgata aatgcatgag gtgatggata cactacctga 3361 tttgatcatt atactacata tacatgaatc agaacatcaa attgtacctc ataaatatct 3421 acaattacat gtcagttttt gtttatgttt ttgttttttt ttaatttatg aaaacaaatg 3481 agaatggaaa tcaatgatgt atgtggtgga // LOCUS HSU10686 3672 bp DNA PRI 23-JUN-1995 DEFINITION Human MAGE-11 antigen (MAGE11) gene, complete cds. ACCESSION U10686 NID g533512 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3672) AUTHORS De Plaen,E., Arden,K., Traversari,C., Gaforio,J.J., Szikora,J.P., De Smet,C., Brasseur,F., van der Bruggen,P., Lethe,B., Lurquin,C. Brasseur,R., Chomez,P., De Backer,O., Cavenee,W. and Boon,T. TITLE Structure, chromosomal localization, and expression of 12 genes of the MAGE family JOURNAL Immunogenetics 40 (5), 360-369 (1994) MEDLINE 95012457 REFERENCE 2 (bases 1 to 3672) AUTHORS De Plaen,E. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Etienne De Plaen, Ludwig Institute for Cancer Research, 74 Avenue Hippocrate, Brussels, 1200, Belgium FEATURES Location/Qualifiers source 1..3672 /organism="Homo sapiens" /isolate="patient MZ2" /db_xref="taxon:9606" /chromosome="X" /sex="female" /cell_type="lymphocyte" /tissue_type="blood" /dev_stage="adult" exon 1806..1879 /number=2 exon 1955..>3246 /number=3 gene 2019..2978 /gene="MAGE11" CDS 2019..2978 /gene="MAGE11" /codon_start=1 /product="MAGE-11 antigen" /db_xref="PID:g533513" /translation="MPLEQRSQHCKPEEGLQAQEEDLGLVGAQALQAEEQEAAFFSST LNVGTLEELPAAESPSPPQSPQEESFSPTAMDAIFGSLSDEGSGSQEKEGPSTSPDLI DPESFSQDILHDKIIDLVHLLLRKYRVKGLITKAEMLGSVIKNYEDYFPEIFREASVC MQLLFGIDVKEVDPTSHSYVLVTSLNLSYDGIQCNEQSMPKSGLLIIVLGVIFMEGNC IPEEVMWEVLSIMGVYAGREHFLFGEPKRLLTQNWVQEKYLVYRQVPGTDPACYEFLW GPRAHAETSKMKVLEYIANANGRDPTSYPSLYEDALREEGEGV" BASE COUNT 907 a 897 c 973 g 895 t ORIGIN 1 agtccaggat ctgccagtag tcaaggagag gaaaattgat gaagactgaa ggtaagaatg 61 taccctccca catgccaaag aaaaagggac ctcaccaatc cttgcttcct ctgttttcat 121 ccctcggagg cccaagttgg ggaggcatgt gccatgctca catttctgcc acgaggttgg 181 gggtggcacc ttgctcaggg aggtgagcac cgttgtttca agggggtgat gacaggtcag 241 caggtggagc cacacctgat cagcagaggg aggagtccca ggatctttag gactcaaggt 301 gtatgtgtcc ccttggtgag gactggagag cccacatccc ataatgaagg gatcccacag 361 agtctctctg tccccatgtc cttggctgtg tggggacctc atcacgggtg gccccaagtg 421 gcaaggtcac ttgtaccaca ggcagaaagt tgggaaacct tcagggagat gaggtcttgg 481 tgtaaaggga tatgtctgct catctcaggg gttgggagtc aaggaaggac aggccctggc 541 agaagtaaag atgaaaaacc cacaggagga ctttggaatc cccagaaccg aagggtccag 601 cctctgctgt cagccctgga caaccacatg atggggtgat gggacgtggg gccccttact 661 tctgttttgg aatcttgggc aggtgagcac tatgttctca gaggacgact tccagtcaac 721 agaaagagcc ccatatggtc cacaactaca gtggtcccag gatctgccaa gagtccaggt 781 gagaaacctg agggaggatt gagggttcct cctggccaga acacagaggg ctgcttagaa 841 atctgctctg cccctgctgt ctccccagag agcatgtgca ggactatgtg ctgagacccc 901 tctcttatac tgggatcatt ggtctcaggg agcgggagac attggtctga gagggctgca 961 cttaggtcag cagtgggagg gtcccaggcc atgaccagaa tcaaggtggg ggctgacggg 1021 acagcactta ccaaaaacat gggactcagc ccttccctgc cccttctgtc agctatggga 1081 agtccctggg accatgggtg tttctatttc cctgatttcc tcttctgata tctcctggag 1141 gtagagcttt ggtttaagga gatggcgtca ggtcaacaga gggagggtcc caggccaaga 1201 taggcatcaa gatgggaacc aaacaggctc cttacccgag gacacatgga ccctgctgac 1261 tgtcaccatc tcttgctgtc cttcctgggt agccctgtgt acatgtggcc agatgtgtat 1321 ccccacatgt cctctttcat atcaggaaag agctattgat ctgagagttt ctcaggtcag 1381 gagagctgtg tcttccaggc cctggcagga gaaaggtgag ggccctgagc acagagggga 1441 ccatccactc caaaaaagtg agaaactcac agagtttggc acacctttct gacagtgctg 1501 gggtgccagg atgggtgctt gcagtctgca gcctgatggc cccatgattc ctcttctaga 1561 agctccaaaa actgagcagt gaggccttgg tctcaagcaa tgtcttcaga tctcagaaca 1621 caggaagcct aggcagtgcc agtagtcaag atgagatgtt cacccttaat ctacaaatgg 1681 ccccacctgc cccagtacag aaagggaccc ccagcttgca acctcacctg ccctacctca 1741 gtcctggagc ctcctgctct gatgtccagc tgcatcttga gcagccttct cacttccttt 1801 ttcaggtttt tagagaacag gccaacctgg aggacaggag tcccaggaga acccagagga 1861 tcactggagg agaacaagtg taagtaggcc tttgttagat tctccatggt tcatatctca 1921 tctgagtctg ttctcacgct ccctctctcc ccaggctgtg gggccccatc acccagatat 1981 ttcccacagt tcggcctgct gacctaacca gagtcatcat gcctcttgag caaagaagtc 2041 agcactgcaa gcctgaggaa ggccttcagg cccaagaaga agacctgggc ctggtgggtg 2101 cacaggctct ccaagctgag gagcaggagg ctgccttctt ctcctctact ctgaatgtgg 2161 gcactctaga ggagttgcct gctgctgagt caccaagtcc tccccagagt cctcaggaag 2221 agtccttctc tcccactgcc atggatgcca tctttgggag cctatctgat gagggctctg 2281 gcagccaaga aaaggagggg ccaagtacct cgcctgacct gatagaccct gagtcctttt 2341 cccaagatat actacatgac aagataattg atttggttca tttattgctc cgcaagtatc 2401 gagtcaaggg gctgatcaca aaggcagaaa tgctggggag tgtcatcaaa aattatgagg 2461 actactttcc tgagatattt agggaagcct ctgtatgcat gcaactgctc tttggcattg 2521 atgtgaagga agtggacccc actagccact cctatgtcct tgtcacctcc ctcaacctct 2581 cttatgatgg catacagtgt aatgagcaga gcatgcccaa gtctggcctc ctgataatag 2641 tcctgggtgt aatcttcatg gaggggaact gcatccctga agaggttatg tgggaagtcc 2701 tgagcattat gggggtgtat gctggaaggg agcacttcct ctttggggag cccaagaggc 2761 tccttaccca aaattgggtg caggaaaagt acctggtgta ccggcaggtg cccggcactg 2821 atcctgcatg ctatgagttc ctgtggggtc caagggccca cgctgagacc agcaagatga 2881 aagttcttga gtacatagcc aatgccaatg ggagggatcc cacttcttac ccatccctgt 2941 atgaagatgc tttgagagag gagggagagg gagtctgagc atgagatgca accagggcca 3001 gcgggcaggg aaatgggcca atgcatgctt cagggccaca cccagcagtt tccctgtcct 3061 gtgtgaaatc aggcccattc ttccctctgt gtttgatgag agaagtcagt gttctcagta 3121 gtagaaggca cagtgaatgg aagggaacac attgtatact gcctttaggt ttctcttcca 3181 tcgggtgact tggagatttg tttttgtttc cctttggtaa ttttcaaata ttgttcctgt 3241 aataaaagtt ttagttagct tcaacatcta agtgtatgga tgatactgac cacacatgtt 3301 gttttgctta tccatttcaa gtgcaagtgt ttgccatttt gtaaaacatt ttgggaaatc 3361 ttccatcttg ctgtgatttg caataggtat tttcttggag aatgtaagaa cttaacaata 3421 aagctgaact ggtgttgtga aacagagaaa taaaaggaga aggtcattaa ttcttgtctt 3481 cttatccata ttaatctgtt gttctatgaa agtacacacc catacacaca tgtacacccc 3541 cctcccccca catacatatt caccaaggaa atgcagtttc ctactgagtt gcagattctc 3601 tgagatgtcc tggacaataa aaaatattcc aaagtagaga gtggtagcac cgtggggtca 3661 cagtaatact ag // LOCUS HSU10689 4741 bp DNA PRI 23-JUN-1995 DEFINITION Human MAGE-5a antigen (MAGE5a) gene, complete cds. ACCESSION U10689 NID g533518 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4741) AUTHORS De Plaen,E., Arden,K., Traversari,C., Gaforio,J.J., Szikora,J.P., De Smet,C., Brasseur,F., van der Bruggen,P., Lethe,B., Lurquin,C. Brasseur,R., Chomez,P., De Backer,O., Cavenee,W. and Boon,T. TITLE Structure, chromosomal localization, and expression of 12 genes of the MAGE family JOURNAL Immunogenetics 40 (5), 360-369 (1994) MEDLINE 95012457 REFERENCE 2 (bases 1 to 4741) AUTHORS De Plaen,E. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Etienne De Plaen, Ludwig Institute for Cancer Research, 74 Avenue Hippocrate, Brussels, 1200, Belgium FEATURES Location/Qualifiers source 1..4741 /organism="Homo sapiens" /isolate="patient MZ2" /db_xref="taxon:9606" /chromosome="X" /sex="female" /cell_type="lymphocyte" /tissue_type="blood" /dev_stage="adult" exon 644..696 /number=1 exon 2848..2934 /number=2 exon 3010..>4560 /number=3 gene 3075..3449 /gene="MAGE5a" CDS 3075..3449 /gene="MAGE5a" /codon_start=1 /product="MAGE-5a antigen" /db_xref="PID:g533519" /translation="MSLEQKSQHCKPEEGLDTQEEALGLVGVQAATTEEQEAVSSSSP LVPGTLGEVPAAGSPGPLKSPQGASAIPTAIDFTLWRQSIKGSSNQEEEGPSTSPDPE SVFRAALSKKVADLIHFLLLKY" BASE COUNT 1118 a 1274 c 1305 g 1044 t ORIGIN 1 gttctgctcc tgctttcaac ccagggaatc cctgggtgac cagatgtggt gccactgtct 61 tgcacatttg aggtcggaga gaagcaaggg cctcgctctc aggggcagct ggagatcagc 121 tgagggcagc tggccctggc tctgtgagga tgcaaggtga gatgctgagg gaggactaag 181 gagtatccca cccctggtag tggaccccaa ataatccagt gccacctctc ctgctgctag 241 ctctggacca tccagggcag gacttcttag gctgggccac ccccagtccc ccaccgctta 301 agccgcaggg gactcaggag acagagcttg gtatgaccag ggcaggactg gttaggagag 361 gacagctccc agctctgcca ggaaacaacg tcaggaacct aagggaaagc tgaggctacc 421 cccaccccaa actctattcc tgtccctacc tccgtccccc acctacaccc cccattcccc 481 caccccttcc ctaccggcac ctctatccca catcccccac ccctatcctg gcagaatccg 541 attctgcccc tgatttcaac ccagggaagc cctagggggc cggatgtgat gctgctgact 601 tgtgcattgg gggtcagaga gaatcaaggg catggttctg agaagccgac tgagatcagc 661 agaggggaat gggcccgggc tctgtgagga ggcaaggtga gacccccgag gaaggaatga 721 ggaagccctc acccagatag agaaccccaa ataatccagt actacctttg ctgccagccc 781 tggaccaccc agggcagact tctcaggctg aaccttcccc cctccccact gccacttaag 841 ccacaaggga ctctggagtc agaccttggt gtgaccaggg aagggccggt caggagaggg 901 caggggccag gctctgtcag gcatcaaaat caggaccctg agagagaatt gagggccccc 961 accccaaccc ctatacccat ccctaacccc atacccactc tacttgcatt cccagcccca 1021 tccccacacc ctaccccatc ttggcagaat ctgtttcttt ccctgcagtc aacccacaga 1081 agccccagga atgacagaca ggcacaccta ttctgacgtc cacatccagg gctgaaggag 1141 ggaaagggct tagtatcatg agcagggcct caggggagtc tctgctcctc aagccctgct 1201 gggagtaaag ggaggcctca gggaacccag gtcctcagga tagggggtcc actccaaccc 1261 tgtctgagac tgaggcgcct cctctttcat cctcgggaat cacagggatg gagactcacg 1321 tcagcagagg gtggggccca accctgccag gatcaaggag aggaagaaga gggaggactc 1381 agggtacctt tgagtccaga acaatgggga cctttgccct gggaggtcca gtgcacagtg 1441 gccacctgta gcccatgctt gctgcacctt ctgggtgaca aagaggagag ggctgtggtc 1501 agagcagtgg tgactcaggt cagcagaggg aggagtccca gcatctgcag gccccaatgt 1561 gtgccccatt catgaagatt ggggatacct tggctcagaa agaagggacc ccacagagtc 1621 tggctgtccc ctgatttttg ctcagagggg accaaatcaa ggatagccct atgtgccaac 1681 ctcatttgtg ccacaggaaa gaagttgaag agccctcagg gtgatggggt cttgcagtaa 1741 aggggagcta tctgctcatc tcagggggtt tcaggttgag gaatggcagg ccccatcacg 1801 atgaagagta acccacagga gccatagaaa cactcacccc agaaccaaag gggtcatacc 1861 tggacacccc atgtgggggt gacaggatgt agctccatct cattcctgtt ttcagatctc 1921 ggggaggtga ggaacttgtt ctccgaggat gactcaggtc aacacagggg cccccatctg 1981 gtggatagac agagtggtcc caggatctgt cagtagttcc ggtgaggaac atgagggacg 2041 attgagggca cccttgggcc agaacacaga tgaggacctc acggaaatct gccctgcccc 2101 tgctgtcact ccagagagca tgggcagggc tgtctgctgc agtccccccc acttaccctg 2161 ggatcattgg tgtcagggat ggggaggtct ttgtcgaggg gtctgcactc aggtcagtag 2221 agggagcgtc ttaggccctg ccaggagaca aggtaagaac gaagcaggtt cctcacccag 2281 gacacatgaa ttccaatgca tttcagcatc tcttcctgtc cttcccaaga ggacctgggc 2341 acgtgtggcc agatgtgagt ctcctcatgt cctgttccct atcagggatg tgagctctta 2401 atctgagttt ctcaggccag caaaagggtg ggatccaggc cttgccagga gaaaggtgag 2461 ggccctgtgt gagcacagag gggaccattc accccaagag ggtggagacc tcacagattc 2521 cagcctaccc tcctgttagc actgggggcc tgaggctgtg cttgcagtct gcaccctgag 2581 ggcccatgca ttcctcttcc aggagctcca ggaaacagac actgaggcct tggtctgagg 2641 ccgtgccctc aggtcacaga gcagaggaga tgcagacgtc tagtgccagc agtgaacgtt 2701 tgccttgaat gcacactaat ggcccccatc gccccagaac atatgggact ccagagcacc 2761 tggcctcacc ctctctactg tcagtcctgc agaatcagcc tctgcttgct tgtgtaccct 2821 gaggtgccct ctcacttttt ccttcaggtt ctcaggggac aggctgacca ggatcaccag 2881 gaagctccag aggatcccca ggaggcccta gaggagcacc aaaggagaag atctgtaagt 2941 aagcctttgt tagagcctcc aaggttcagt ttttagctga ggcttctcac atgctccctc 3001 tctctccagg ccagtgggtc tccattgccc agctcctgcc cacactcctg cctgttgcgg 3061 tgaccagagt cgtcatgtct cttgagcaga agagtcagca ctgcaagcct gaggaaggcc 3121 ttgacaccca agaagaggcc ctgggcctgg tgggtgtgca ggctgccact actgaggagc 3181 aggaggctgt gtcctcctcc tctcctctgg tcccaggcac cctgggggag gtgcctgctg 3241 ctgggtcacc aggtcctctc aagagtcctc agggagcctc cgccatcccc actgccatcg 3301 atttcactct atggaggcaa tccattaagg gctccagcaa ccaagaagag gaggggccaa 3361 gcacctcccc tgacccagag tctgtgttcc gagcagcact cagtaagaag gtggctgact 3421 tgattcattt tctgctcctc aagtattaag tcaaggagct ggtcacaaag gcagaaatgc 3481 tggagagcgt catcaaaaat tacaagcgct gctttcctga gatcttcggc aaagcctccg 3541 agtccttgca gctggtcttt ggcattgacg tgaaggaagc ggaccccacc agcaacacct 3601 acacccttgt cacctgcctg ggactcctat gatggcctgc tggttgataa taatcagatc 3661 atgcccaaga cgggcctcct gataatcgtc ttgggcatga ttgcaatgga gggcaaatgc 3721 gtccctgagg agaaaatctg ggaggagctg agtgtgatga aggtgtatgt tgggagggag 3781 cacagtgtct gtggggagcc caggaagctg ctcacccaag atttggtgca ggaaaactac 3841 ctggagtacc ggcaggtgcc cagcagtgat cccatatgct atgagttact gtggggtcca 3901 agggcactcg ctgcttgaaa gtactggagc acgtggtcag ggtcaatgca agagttctca 3961 tttcctaccc atccctgcgt gaagcagctt tgagagagga ggaagaggga gtctgagcat 4021 gagctgcagc cagggccact gcgagggggg ctgggccagt gcaccttcca gggctccgtc 4081 cagtagtttc ccctgcctta atgtgacatg aggcccattc ttctctcttt gaagagagca 4141 gtcaacattc ttagtagtgg gtttctgttc tattggatga ctttgagatt tgtctttgtt 4201 tccttttgga attgttcaaa tgtttctttt aatgggtggt tgaatgaact tcagcattca 4261 aatttatgaa tgacagtagt cacacatagt gctgtttata tagtttagga gtaagagtct 4321 tgttttttat tcagattggg aaatccattc cattttgtga attgggacat agttacagca 4381 gtggaataag tattcattta gaaatgtgaa tgagcagtaa aactgatgac ataaagaaat 4441 taaaagatat ttaattcttg cttatactca gtctattcgg taaaattttt tttaaaaaat 4501 gtgcatacct ggatttcctt ggcttctttg agaatgtaag acaaattaaa tctgaataaa 4561 tcattctccc tgttcactgg ctcatttatt ctctatgcac tgagcatttg ctctgtggaa 4621 ggccctgggt taatagtgga gatgctaagg taagccagac tcacccctac ccacagggta 4681 gtaaagtcta ggagcagcag tcatataatt aaggtggaga gatgccctct aagatgtaga 4741 g // LOCUS HSU10693 3839 bp DNA PRI 23-JUN-1995 DEFINITION Human MAGE-8 antigen (MAGE8) gene, complete cds. ACCESSION U10693 NID g533525 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3839) AUTHORS De Plaen,E., Arden,K., Traversari,C., Gaforio,J.J., Szikora,J.P., De Smet,C., Brasseur,F., van der Bruggen,P., Lethe,B., Lurquin,C. Brasseur,R., Chomez,P., De Backer,O., Cavenee,W. and Boon,T. TITLE Structure, chromosomal localization, and expression of 12 genes of the MAGE family JOURNAL Immunogenetics 40 (5), 360-369 (1994) MEDLINE 95012457 REFERENCE 2 (bases 1 to 3839) AUTHORS De Plaen,E. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Etienne De Plaen, Ludwig Institute for Cancer Research, 74 Avenue Hippocrate, Brussels, 1200, Belgium FEATURES Location/Qualifiers source 1..3839 /organism="Homo sapiens" /isolate="patient MZ2" /db_xref="taxon:9606" /chromosome="X" /sex="female" /cell_type="lymphocyte" /tissue_type="blood" /dev_stage="adult" exon 1996..2057 /number=2 exon 2133..>3746 /number=3 gene 2196..2900 /gene="MAGE8" CDS 2196..2900 /gene="MAGE8" /codon_start=1 /product="MAGE-8 antigen" /db_xref="PID:g533526" /translation="MLLGQKSQRYKAEEGLQAQGEAPGLMDVQIPTAEEQKAASSSST LIMGTLEEVTDSGSPSPPQSPEGASSSLTVTDSTLWSQSDEGSSSNEEEGPSTSPDPA HLESLFREALDEKVAELVRFLLRKYQIKEPVTKAEMLESVIKNYKNHFPDIFSKASEC MQVIFGIDVKEVDPAGHSYILVTCLGLSYDGLLGDDQSTPKTGLLIIVLGMILMEGSR APEEAIWEALSVMGAV" BASE COUNT 922 a 959 c 1064 g 894 t ORIGIN 1 agtctcagat cactggagag aggtgcccca gagcccttaa ggaggactca gcagacctcc 61 catcatggcc taggaaacct gctcccactc tcaggtctgg gcacccaagg caggacagtg 121 gggaagggat gtggcccccc cactttctgg taggggggcc tcaaggagat ggtggccttg 181 gcatgcaaga cacatccacg gttcagcagg aaggaaaggg ccatgccttg tcgtggagta 241 aatatgaata cctggatgac acccagacag agaaagaccc catgaaacct actacttctg 301 tcagccgtgg gaatcccatg cagggttgtc catgtagtgc ctccttactt ctgcctcctg 361 ggtctcaggg aggtagcaac ctgggtctga agggcgtcct cagctcagca gagggagcca 421 cacctgttca acagagggac ggggtcacag gatctgcagg acccaagatg tgctcacttt 481 gtgatgaatg ggggtactcc tggcctggaa agaagggacc ccacaaagtc tggctaactt 541 tggttattat ctctggggga acccgatcaa gggtggccct aagtggagat ctcatctgta 601 ctgtgggcag gaagttgggg aaacgcagga agataaggtc ttggtggtaa ggggagatgt 661 ctgctcatat cagggtgttg tgggttgagg aagggcgggc tccatcaggg gaaagatgaa 721 taaccccctg aagaccttag aacccaccac tcaagaacaa gtagggacag atcctagtgt 781 cacccctgga caccccaccc agtggtcatc agatgtggtg gctcctcatt tctctcttga 841 gtctcaggga agtgaggacc ttgttctcag agggcaactc aggacaaaac agggaccccc 901 atgtgggcaa cagactcagt ggtccaagaa tctaccaaga gtctaggtga caacactgag 961 ggaagattga gggtaccctc gatggttctc ctagcaggca aaaaacagat gggggcccaa 1021 cagaaatctg cccggcctct tttgtcaccc ctgagagcat gagcaggact atcagctgag 1081 gcccctgtgt tataccagac tcattggtct cagggagaag aaggccttgg tctgagggca 1141 ctgcattcag gtcagcagag cgggggtcca aggccctgcc aggagtcagg gactcagagg 1201 acaccactca ccaaacacac aggaccgaac cccaccctgc accttctgtc agccatggga 1261 agtgcaggga aaggtgggtg gatggaatcc cctcatttgc tcttccagtg tctcctggag 1321 ataggtcctt ggattaagga agtggcctca ggtcagccca ggacacatgg gccccaatgt 1381 attttgtgta gctattgctt ttttctcacc ctaggacaga cacgtgggcc ccattgcatt 1441 ttgtgtagct attgcttttt tcccaggagg ccttgggcat gtggggccag atgtgggtcc 1501 cttcatatcc ttgtcttcca tatcagggat ataaactctt gatctgaaag tttctcaggc 1561 cagcaaaagg gccagatcca ggccctgcca ggagaaagat gagggccctg aatgagcaca 1621 gaaaggacca tccacacaaa atagtgggga gctcacagag tcaggctcac cctcctgaca 1681 gcactggggt gctggggctg tgcttgcagt ctgcagcctg agttcccctc gatttatctt 1741 ctaggagctc caggaaccag gctgtgaggt cttggtctga ggcagtatct tcaatcacag 1801 agcataagag gcccaggcag tagtagcagt caagctgagg tggtgtttcc cctgtatgta 1861 taccagaggc ccctctggca tcagaacagc aggaacccca cagttcctgg ccctaccagc 1921 ccttttgtca gtcctggagc cttggccttt gccaggaggc tgcaccctga gatgccctct 1981 caatttctcc ttcaggttcg cagagaacag gccagccagg aggtcaggag gccccagaga 2041 agcactgaag aagacctgta agtagacctt tgttagggca tccagggtgt agtacccagc 2101 tgaggcctct cacacgcttc ctctctcccc aggcctgtgg gtctcaattg cccagctccg 2161 gcccacactc tcctgctgcc ctgacctgag tcatcatgct tcttgggcag aagagtcagc 2221 gctacaaggc tgaggaaggc cttcaggccc aaggagaggc accagggctt atggatgtgc 2281 agattcccac agctgaggag cagaaggctg catcctcctc ctctactctg atcatgggaa 2341 cccttgagga ggtgactgat tctgggtcac caagtcctcc ccagagtcct gagggtgcct 2401 cctcttccct gactgtcacc gacagcactc tgtggagcca atccgatgag ggttccagca 2461 gcaatgaaga ggaggggcca agcacctccc cggacccagc tcacctggag tccctgttcc 2521 gggaagcact tgatgagaaa gtggctgagt tagttcgttt cctgctccgc aaatatcaaa 2581 ttaaggagcc ggtcacaaag gcagaaatgc ttgagagtgt catcaaaaat tacaagaacc 2641 actttcctga tatcttcagc aaagcctctg agtgcatgca ggtgatcttt ggcattgatg 2701 tgaaggaagt ggaccctgcc ggccactcct acatccttgt cacctgcctg ggcctctcct 2761 atgatggcct gctgggtgat gatcagagta cgcccaagac cggcctcctg ataatcgtcc 2821 tgggcatgat cttaatggag ggcagccgcg ccccggagga ggcaatctgg gaagcattga 2881 gtgtgatggg ggctgtatga tgggagggag cacagtgtct attggaagct caggaagctg 2941 ctcacccaag agtgggtgca ggagaactac ctggagtacc gccaggcgcc cggcagtgat 3001 cctgtgcgct acgagttcct gtggggtcca agggcccttg ctgaaaccag ctatgtgaaa 3061 gtcctggagc atgtggtcag ggtcaatgca agagttcgca tttcctaccc atccctgcat 3121 gaagaggctt tgggagagga gaaaggagtt tgagcaggag ttgcagctag ggccagtggg 3181 gcaggttgtg ggagggcctg ggccagtgca cgttccaggg ccacatccac cactttccct 3241 gctctgttac atgaggccca ttcttcactc tgtgtttgaa gagagcagtc acagttctca 3301 gtagtgggga gcatgttggg tgtgagggaa cacagtgtgg accatctctc agttcctgtt 3361 ctattgggcg atttggaggt ttatctttgt ttccttttgg aattgttcca atgttccttc 3421 taatggatgg tgtaatgaac ttcaacattc attttatgta tgacagtaga cagacttact 3481 gctttttata tagtttagga gtaagagtct tgcttttcat ttatactggg aaacccatgt 3541 tatttcttga attcagacac tacaagagca gaggattaag gtttttttag aaatgtgaaa 3601 caacatagca gtaaaataca tgagataaag acataaagaa attaaacaat agttaattct 3661 tgccttacct gtacctctta gtgtacccta tgtacctgaa tttgcttggc ttctttgaga 3721 atgaaattga attaaatatg aataaataag tccccctgct cactggctca ttttttccca 3781 aaatattcat tgagcttccg ctatttggaa ggccctgggt tagtattgga gatgctaca // LOCUS HSU11424 1944 bp DNA PRI 11-MAY-1995 DEFINITION Human thiopurine methyltransferase processed pseudogene (pseudoTPMT) gene, complete cds. ACCESSION U11424 NID g805081 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1944) AUTHORS Lee,D., Szumlanski,C., Houtman,J., Honchel,R., Rojas,K., Overhauser,J., Wieben,E.D. and Weinshilboum,R.M. TITLE Thiopurine methyltransferase pharmacogenetics. Cloning of human liver cDNA and a processed pseudogene on human chromosome 18q21.1 JOURNAL Drug Metab. Dispos. 23 (3), 398-405 (1995) MEDLINE 95354518 REFERENCE 2 (bases 1 to 1944) AUTHORS Szumlanski,C.L. TITLE Direct Submission JOURNAL Submitted (27-JUN-1994) Carol L. Szumlanski, Pharmacology, Mayo Medical School/Mayo Clinic/Mayo Foundation, 200 First St. SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1944 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lymphocyte genomic library in the lambda dash vector, Stratagene" /chromosome="18" /map="18q21.1" /cell_type="lymphocyte" repeat_unit 64..73 /note="5' direct repeat" gene 168..908 /gene="pseudoTPMT" CDS 168..908 /gene="pseudoTPMT" /EC_number="2.1.1.67" /codon_start=1 /function="S-methylation" /product="thiopurine methyltransferase" /db_xref="PID:g805082" /translation="MDGTRTSLDIEEYSNTEVQKNQVLTLEEWQDKWVNGNTAFHQEQ GPQLLKKHLDTFLKGESGLRVFFPLCRKEVEMKWFADRGHSVVGVEISELGIREFFTE QNLSYSEEPITEIPGTKAFKSSSGNISSYCCSIFDLPRTNIGTFDMIWDRGALVAINP GDRKCYADIMLSLLGKKFQYLLCVFLTIQLNIQVHHFMFHMLKLKGCLVKYAIYIVLR RLMLLKNDIKIGGLTIFLKSYIYLQKSK" polyA_signal 1828..1833 polyA_site 1849 repeat_unit 1880..1889 /note="3' direct repeat" BASE COUNT 644 a 339 c 370 g 591 t ORIGIN 1 taaatcagat ccctgaggac ctccacttga tatcatcttt agtcccatta atgcccttaa 61 aaacaatttg cttgggatga cactagtggc ggaggcaatg gccagcaacc ctctgtaagc 121 gaggcgtgga agacatatgc ttgtgagaaa aaggtgtcta tgaaactatg gatggtacaa 181 gaacttcact tgacattgaa gagtactcca atactgaggt acagaaaaac caagtactaa 241 ctctggaaga atggcaagac aagtgggtga acggcaacac tgcttttcat caggaacaag 301 gacctcagct attaaagaaa catttagata cttttcttaa aggagagagt ggactgaggg 361 tattttttcc tctttgcaga aaagaggttg agatgaaatg gtttgcagac cggggacaca 421 gcgtagttgg tgtggaaatc agtgaacttg ggatacgaga attttttaca gagcagaatc 481 tatcttactc agaagaacca atcaccgaaa ttcctggaac caaagcattt aagagttctt 541 cggggaacat ttcatcatac tgttgcagta tttttgatct tcccaggaca aatattggca 601 catttgacat gatttgggat agaggagcat tagttgccat taatccaggt gatcgcaaat 661 gctatgcgga tataatgtta tccctcctgg gaaagaagtt tcaatatctc ctgtgtgtct 721 ttcttacgat ccaactaaac atccaggtcc accattttat gttccacatg ctgaaattga 781 aaggttgttt ggtaaaatat gcaatataca ttgtcttgag aaggttgatg cttttgaaga 841 atgacataaa aattggggga ttgaccatct ttctgaaaag ttatatctac ttacagaaaa 901 gtaaatgaga catagataaa atcacattga catgtttttg aggaattgaa aattatgcta 961 aagcctgaaa atgtaatgga tgaatttttt aaattgttta taaatcacat gatagatcta 1021 tactaaaaat ggctttttag taaagctgtt tactttttct aaaaaagttt taggagaaaa 1081 agatgtaact aaacttttca agtagctcct ttggagagga gattatgatg tgaaagatta 1141 tgcctgtgtg tcttacagat tgcaagatat tttatcaatc agtgtgtgtt acctgtacaa 1201 ttaaaaaaat attttaaaat gcaatgcata ttaaacataa tacacacaga aaaactggca 1261 tttattttat ttttttgaga tggagtttcg ttctcgttgc ccaacctgga gtgcaatggc 1321 acaatctcag ctcactgcaa cctctgcctc ccaggttcaa gtgattctcc tgcctcagcc 1381 tcccaagtag ctgagattac aggtgtgcgc caccatgccc agctaatttt ttgtattttt 1441 agtagagaca gggtttcacc atgttggtca ggctggtctc gaactccaga cctcaggtga 1501 tctacccacc tcagcctccc aaagtgctgg gattacaggc gtgagccact gtgcctggcc 1561 tgacattctt tatgaaattt agaattgttg aaaaaaatat aacacttcag tagggttcaa 1621 ggtggtccca aaagttatat aaaagattag tttttactat aaacccttgt cttttactca 1681 gatcctagca tcccttttca catggtttct ccatatatgt aacagaatca agaaacaaat 1741 tttaattaaa caatctgtaa cagaatcaag aaacaaataa attttaatta aacaatctat 1801 atggaacaaa cattcccaaa ttctaagaat aaatttttct ttaagtttaa aacaaacaaa 1861 caaaaaaaca aaaaaaaaac aatttgcttt tctgattttg tttagattac tgtttccaag 1921 ttattgcaag tggatgaagt atac // LOCUS HSU11870 4452 bp DNA PRI 28-MAR-1995 DEFINITION Human interleukin-8 receptor type A (IL8RBA) gene, promoter and complete cds. ACCESSION U11870 NID g511804 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4452) AUTHORS Ahuja,S.K., Shetty,A., Tiffany,H.L. and Murphy,P.M. TITLE Comparison of the genomic organization and promoter function for human interleukin-8 receptors A and B JOURNAL J. Biol. Chem. 269 (42), 26381-26389 (1994) MEDLINE 95014476 REFERENCE 2 (bases 1 to 4452) AUTHORS Ahuja,S.K. TITLE Direct Submission JOURNAL Submitted (06-JUL-1994) Sunil K. Ahuja, Laboratory of Host Defenses, National Institutes of Health, National Institute of Allergy and Infectious Diseases, NIH, Bldg 10, Rm 11N109, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..4452 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="6e/IL8RA-gene" /chromosome="2" /map="2q34-35" /tissue_type="placenta" promoter 1..300 /evidence=experimental protein_bind 65..70 /bound_moiety="NF-ATp" protein_bind 96..101 /bound_moiety="NF-ATp" protein_bind 205..212 /bound_moiety="GRE" exon 301..367 /gene="IL8RA" /number=1 mRNA join(301..367,2024..4452) /gene="IL8RA" /evidence=experimental gene join(301..367,2024..4452) /gene="IL8RA" intron 368..2023 /number=1 exon 2024..4452 /gene="IL8RA" /number=2 CDS 2057..3109 /gene="IL8RA" /note="neutrophil chemoattractant receptor; G-protein coupled seven-transmembrane spanning receptor" /codon_start=1 /evidence=experimental /product="interleukin-8 receptor type A" /db_xref="PID:g511805" /translation="MSNITDPQMWDFDDLNFTGMPPADEDYSPCMLETETLNKYVVII AYALVFLLSLLGNSLVMLVILYSRVGRSVTDVYLLNLALADLLFALTLPIWAASKVNG WIFGTFLCKVVSLLKEVNFYSGILLLACISVDRYLAIVHATRTLTQKRHLVKFVCLGC WGLSMNLSLPFFLFRQAYHPNNSSPVCYEVLGNDTAKWRMVLRILPHTFGFIVPLFVM LFCYGFTLRTLFKAHMGQKHRAMRVIFAVVLIFLLCWLPYNLVLLADTLMRTQVIQES CERRNNIGRALDATEILGFLHSCLNPIIYAFIGQNFRHGFLKILAMHGLVSKEFLARH RVTSYTSSSVNVSSNL" BASE COUNT 1067 a 1112 c 1078 g 1195 t ORIGIN 1 aagcttccac aggtgatata ctagggaatt taggaataaa caaatggaaa taaattcaag 61 aaaaggaaaa taataaaaat gatcatccat agagtggaga attcagataa tggaccctca 121 accccagctt cacacctggg acccccactt ggtcatatgg accctggcag tctctaatca 181 caagtctgtg atcccttgac ttaaactgtt cttccccaaa tgtagacatg ggtggggctc 241 agaagggagg tgtcatctga tgtggtttcc ttatttccgt ttattcatca agtgccctct 301 agctgttaag tcactctgat ctctgactgc agctcctact gttggacaca cctggccggt 361 gcttcaggta ggaagccacc tctgtttgct aggactttct gtggggtagg gctgcttggc 421 ttgactttat tttggaaaat gtattcattt ctgtgggagc tgaggatttc tgctgcccgc 481 ttcctctctt gtgaccacca ctcatacatg gagttagcag gtgaacacca cagcctcctc 541 atccctcttt ttataccttg ccctttttgt gggaagaggc tgaccattcc caatttgagt 601 ttattgattc accttaagac tttgcttcag tgaaaatgca aatactggta gggagggaac 661 acactaaatg tgtggatctg agtggactca tcatacaccc tcaggcctgg cagcaaatga 721 gaggtggatg attatgtctt cctcgatttg cagatgagaa gactgaaatc tataggggta 781 cgagacccca ctcaaggtca tgagacttat ttatttattt atttatttat tttgagatgg 841 agtctcactc tgtcgcctag gctgtagtac tgtggcacga tctcggctca ctgcaacctc 901 catttccggg ttcaagtgat tctcctgcct cagcctcctg agtagttggg attacaggtg 961 cacgccacca cacccagcta atttttttgt atttttagta gagacggggt ttcatcattt 1021 tggtcaggct ggtctcgaac tcctgacttc atgatctgcc tgccttggcc tcccaaaggc 1081 tgggattaca cacgaaactt attaatggta gaatcaggat ggaatgaaga ctggatgcta 1141 ggtgtcttta caaccaacct cggtgacttt ccaaaggctt tcagacttct ctggagagtc 1201 ctggagcttt gaggggctct ttaggggcca tctgggttcg gagattagca cactctgccc 1261 cacagtggct cagcttttat ctgcttcaca tactgggctt ctgggaagat cttatttcag 1321 taaactattt cacaatagga aatatatttg aaactcttga atgattctgt ggattttttc 1381 aggggtggag ggtttcaggg accaagatgg agactttatc ttctcaaata cttaaaactc 1441 ctttagacag aggaacaaaa tatgtgctcc tcatttaaag aaggcttaaa atatacagtt 1501 taatgcaatg tgttgttatc acttccttcc cccagtagat tttaaggagg gtaaacaaga 1561 atcagggaga acataggaat agaaatagta gaaggggaca cctgggaaca ggtttgcctt 1621 cttgcatttt gcttaatgct ggcccttccc tgaatgtcta agaccaacct ggtccccaca 1681 tccaaatgca cagacacagc tgaggatgga gaaggctaaa gagggacaga ggtagagaca 1741 taggctgaga ggaggcagtt gtaggttgag ctagggctaa ggtgttttcc ccatattcca 1801 tcttacccca cactcaggcc aggccttaga gttgtggaag gtggagaaca ctgggaagcc 1861 aacctccgaa gaagaccagg ttggagtcaa aggaggaagg agagctctca ttgccaaacc 1921 aacagggaag ccaaggatat cccagtaact gctctcacat cattgatgag aatgccttga 1981 atccgagcta ctaaatcaca tttccttcct tctaaccttc cagttagatc aaaccattgc 2041 tgaaactgaa gaggacatgt caaatattac agatccacag atgtgggatt ttgatgatct 2101 aaatttcact ggcatgccac ctgcagatga agattacagc ccctgtatgc tagaaactga 2161 gacactcaac aagtatgttg tgatcatcgc ctatgcccta gtgttcctgc tgagcctgct 2221 gggaaactcc ctggtgatgc tggtcatctt atacagcagg gtcggccgct ccgtcactga 2281 tgtctacctg ctgaacctgg ccttggccga cctactcttt gccctgacct tgcccatctg 2341 ggccgcctcc aaggtgaatg gctggatttt tggcacattc ctgtgcaagg tggtctcact 2401 cctgaaggaa gtcaacttct acagtggcat cctgctgttg gcctgcatca gtgtggaccg 2461 ttacctggcc attgtccatg ccacacgcac actgacccag aagcgtcact tggtcaagtt 2521 tgtttgtctt ggctgctggg gactgtctat gaatctgtcc ctgcccttct tccttttccg 2581 ccaggcttac catccaaaca attccagtcc agtttgctat gaggtcctgg gaaatgacac 2641 agcaaaatgg cggatggtgt tgcggatcct gcctcacacc tttggcttca tcgtgccgct 2701 gtttgtcatg ctgttctgct atggattcac cctgcgtaca ctgtttaagg cccacatggg 2761 gcagaagcac cgagccatga gggtcatctt tgctgtcgtc ctcatcttcc tgctttgctg 2821 gctgccctac aacctggtcc tgctggcaga caccctcatg aggacccagg tgatccagga 2881 gagctgtgag cgccgcaaca acatcggccg ggccctggat gccactgaga ttctgggatt 2941 tctccatagc tgcctcaacc ccatcatcta cgccttcatc ggccaaaatt ttcgccatgg 3001 attcctcaag atcctggcta tgcatggcct ggtcagcaag gagttcttgg cacgtcatcg 3061 tgttacctcc tacacttctt cgtctgtcaa tgtctcttcc aacctctgaa aaccatcgat 3121 gaaggaatat ctcttctcag aaggaaagaa taaccaacac cctgaggttg tgtgtggaag 3181 gtgatctggc tctggacagg cactatctgg gttttggggg gacgctatag gatgtgggga 3241 agttaggaac tggtgtcttc aggggccaca ccaaccttct gaggagctgt tgaggtacct 3301 ccaaggaccg gcctttgcac ctccatggaa acgaagcacc atcattcccg ttgaacgtca 3361 catctttaac ccactaactg gctaattagc atggccacat ctgagccccg aatctgacat 3421 tagatgagag aacagggctg aagctgtgtc ctcatgaggg ctggatgctc tcgttgaccc 3481 tcacaggagc atctcctcaa ctctgagtgt taagcgttga gccaccaagc tggtggctct 3541 gtgtgctctg atccgagctc aggggggtgg ttttcccatc tcaggtgtgt tgcagtgtct 3601 gctggagaca ttgaggcagg cactgccaaa acatcaacct gccagctggc cttgtgagga 3661 gctggaaaca catgttcccc ttgggggtgg tggatgaaca aagagaaaga gggtttggaa 3721 gccagatcta tgccacaaga acccccttta cccccatgac caacatcgca gacacatgtg 3781 ctggccacct gctgagcccc aagtggaacg agacaagcag cccttagccc ttcccctctg 3841 cagcttccag gctggcgtgc agcatcagca tccctagaaa gccatgtgca gccaccagtc 3901 cattgggcag gcagatgttc ctaataaagc ttctgttccg tgcttgtccc tgtggaagta 3961 tcttggttgt gacagagtca agggtgtgtg cagcattgtt ggctgttcct gcagtagaat 4021 gggggcagca cctcctaaga aggcacctct ctgggttgaa gggcagtgtt ccctggggct 4081 ttaactcctg ctagaacagt ctcttgaggc acagaaactc ctgttcatgc ccatacccct 4141 ggccaaggaa gatccctttg tccacaagta aaaggaaatc ctcctccagg gagtctcagc 4201 ttcaccctga ggtgagcatc atcttctggg ttaggccttg cctaggcata gcctgcctca 4261 agctatgtga gctcaccagt ccctccccaa atgctttcca tgagttgcag ttttttccta 4321 gtctgttttc cctccttgga gaacagggcc ctgtcggttt gttcactgta tgtccttggt 4381 gcctggagcc tactaaatgc tcaataaata atgatcacag gaatgaatgc atgctgaaaa 4441 gaccactctt tt // LOCUS HSU13666 1438 bp DNA PRI 01-APR-1995 DEFINITION Human G protein-coupled receptor (GPR1) gene, complete cds. ACCESSION U13666 L35539 NID g577412 KEYWORDS G protein-coupled receptor; intronless. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1438) AUTHORS Marchese,A., Docherty,J.M., Nguyen,T., Heiber,M., Cheng,R., Heng,H.H., Tsui,L.C., Shi,X., George,S.R. and O'Dowd,B.F. TITLE Cloning of human genes encoding novel G protein-coupled receptors JOURNAL Genomics 23 (3), 609-618 (1994) MEDLINE 95154831 REFERENCE 2 (bases 1 to 1438) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (15-AUG-1994) Brian F. O'Dowd, Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario, M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1438 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q21.6" gene 227..1294 /gene="GPR1" CDS 227..1294 /gene="GPR1" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g577413" /translation="MEDLEETLFEEFENYSYDLDYYSLESDLEEKVQLGVVHWVSLVL YCLAFVLGIPGNAIVIWFTGLKWKKTVTTLWFLNLAIADFIFLLFLPLYISYVAMNFH WPFGIWLCKANSFTAQLNMFASVFFLTVISLDHYIHLIHPVLSHRHRTLKNSLIVIIF IWLLASLIGGPALYFRDTVEFNNHTLCYNNFQKHDPDLTLIRHHVLTWVKFIIGYLFP LLTMSICYLCLIFKVKKRTVLISSRHFWTILVVVVAFVVCWTPYHLFSIWELTIHHNS YSHHVMQAGIPLSTGLAFLNSCLNPILYVLISKKFQARFRSSVAEILKYTLWEVSCSG TVSEQLRNSETKNLCLLETAQ" BASE COUNT 340 a 333 c 278 g 487 t ORIGIN 1 gggctgcagt gagccaaaag catgccattg cactccagct tgggcaacag agtgagaccc 61 tgtctcaaaa aaaagaaaaa ataatactat gtctggtcca taacctgaaa tatttttatc 121 ttcacgttcc ttatcattca ctgaactttt atttttcttt taaaattttt tcctttcttt 181 ttaaatttgc ttctacagat ttcttcattc tccatttagc aaggtcatgg aagatttgga 241 ggaaacatta tttgaagaat ttgaaaacta ttcctatgac ctagactatt actctctgga 301 gtctgatttg gaggagaaag tccagctggg agttgttcac tgggtctccc tggtgttata 361 ttgtttggct tttgttctgg gaattccagg aaatgccatc gtcatttggt tcacggggct 421 caagtggaag aagacagtca ccactctgtg gttcctcaat ctagccattg cggatttcat 481 ttttcttctc tttctgcccc tgtacatctc ctatgtggcc atgaatttcc actggccctt 541 tggcatctgg ctgtgcaaag ccaattcctt cactgcccag ttgaacatgt ttgccagtgt 601 ttttttcctg acagtgatca gcctggacca ctatatccac ttgatccatc ctgtcttatc 661 tcatcggcat cgaaccctca agaactctct gattgtcatt atattcatct ggcttttggc 721 ttctctaatt ggcggtcctg ccctgtactt ccgggacact gtggagttca ataatcatac 781 tctttgctat aacaattttc agaagcatga tcctgacctc actttgatca ggcaccatgt 841 tctgacttgg gtgaaattta tcattggcta tctcttccct ttgctaacaa tgagtatttg 901 ctacttgtgt ctcatcttca aggtgaagaa gcgaacagtc ctgatctcca gtaggcattt 961 ctggacaatt ctggttgtgg ttgtggcctt tgtggtttgc tggactcctt atcacctgtt 1021 tagcatttgg gagctcacca ttcaccacaa tagctattcc caccatgtga tgcaggctgg 1081 aatccccctc tccactggtt tggcattcct caatagttgc ttgaacccca tcctttatgt 1141 cctaattagt aagaagttcc aagctcgctt ccggtcctca gttgctgaga tactcaagta 1201 cacactgtgg gaagtcagct gttctggcac agtgagtgaa cagctcagga actcagaaac 1261 caagaatctg tgtctcctgg aaacagctca ataagttatt acttttccac aaatcagtat 1321 atggcttttt atgtgggtcc tctgactgat gctttcagat taaaattgtt tccaagatag 1381 agagccgact ccactttcat agttattgtt tctggtcaca tatatggcat cacatttt // LOCUS HSU13695 3063 bp DNA PRI 23-MAR-1995 DEFINITION Human homolog of yeast mutL (hPMS1) gene, complete cds. ACCESSION U13695 NID g535512 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3063) AUTHORS Nicolaides,N.C., Papadopoulos,N., Liu,B., Wei,Y.-F., Carter,K.C., Ruben,S.M., Rosen,C.A., Haseltine,W.H., Fleischmann,R.D., Fraser,C.M., Adams,M.D., Venter,J.C., Dunlop,M.G., Hamilton,S.R., Petersen,G.M.,de la Chapelle, Vogelstein,B. and Kinzler,K.W. TITLE Mutations of two PMS homologues in hereditary nonpolyposis colon cancer JOURNAL Nature 371 (6492), 75-80 (1994) MEDLINE 94352394 REFERENCE 2 (bases 1 to 3063) AUTHORS Wei,Y.-F. TITLE Direct Submission JOURNAL Submitted (16-AUG-1994) Ying-Fei Wei, Molecular Biology, Human Genome Sciences, Inc., 9620 Medical Center Drive, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..3063 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p2" /tissue_type="gall bladder" /dev_stage="adult" gene 81..2879 /gene="hPMS1" CDS 81..2879 /gene="hPMS1" /note="homolog of yeast mutL gene" /codon_start=1 /function="DNA mismatch repair" /db_xref="PID:g535513" /translation="MKQLPAATVRLLSSSQIITSVVSVVKELIENSLDAGATSVDVKL ENYGFDKIEVRDNGEGIKAVDAPVMAMKYYTSKINSHEDLENLTTYGFRGEALGSICC IAEVLITTRTAADNFSTQYVLDGSGHILSQKPSHLGQGTTVTALRLFKNLPVRKQFYS TAKKCKDEIKKIQDLLMSFGILKPDLRIVFVHNKAVIWQKSRVSDHKMALMSVLGTAV MNNMESFQYHSEESQIYLSGFLPKCDADHSFTSLSTPERSFIFINSRPVHQKDILKLI RHHYNLKCLKESTRLYPVFFLKIDVPTADVDVNLTPDKSQVLLQNKESVLIALENLMT TCYGPLPSTNSYENNKTDVSAADIVLSKTAETDVLFNKVESSGKNYSNVDTSVIPFQN DMHNDESGKNTDDCLNHQISIGDFGYGHCSSEISNIDKNTKNAFQDISMSNVSWENSQ TEYSKTCFISSVKHTQSENGNKDHIDESGENEEEAGLENSSEISADEWSRGNILKNSV GENIEPVKILVPEKSLPCKVSNNNYPIPEQMNLNEDSCNKKSNVIDNKSGKVTAYDLL SNRVIKKPMSASALFVQDHRPQFLIENPKTSLEDATLQIEELWKTLSEEEKLKYEEKA TKDLERYNSQMKRAIEQESQMSLKDGRKKIKPTSAWNLAQKHKLKTSLSNQPKLDELL QSQIEKRRSQNIKMVQIPFSMKNLKINFKKQNKVDLEEKDEPCLIHNLRFPDAWLMTS KTEVMLLNPYRVEEALLFKRLLENHKLPAEPLEKPIMLTESLFNGSHYLDVLYKMTAD DQRYSGSTYLSDPRLTANGFKIKLIPGVSITENYLEIEGMANCLPFYGVADLKEILNA ILNRNAKEVYECRPRKVISYLEGEAVRLSRQLPMYLSKEDIQDIIYRMKHQFGNEIKE CVHGRPFFHHLTYLPETT" BASE COUNT 1100 a 503 c 580 g 880 t ORIGIN 1 ggcacgagtg gctgcttgcg gctagtggat ggtaattgcc tgcctcgcgc tagcagcaag 61 ctgctctgtt aaaagcgaaa atgaaacaat tgcctgcggc aacagttcga ctcctttcaa 121 gttctcagat catcacttcg gtggtcagtg ttgtaaaaga gcttattgaa aactccttgg 181 atgctggtgc cacaagcgta gatgttaaac tggagaacta tggatttgat aaaattgagg 241 tgcgagataa cggggagggt atcaaggctg ttgatgcacc tgtaatggca atgaagtact 301 acacctcaaa aataaatagt catgaagatc ttgaaaattt gacaacttac ggttttcgtg 361 gagaagcctt ggggtcaatt tgttgtatag ctgaggtttt aattacaaca agaacggctg 421 ctgataattt tagcacccag tatgttttag atggcagtgg ccacatactt tctcagaaac 481 cttcacatct tggtcaaggt acaactgtaa ctgctttaag attatttaag aatctacctg 541 taagaaagca gttttactca actgcaaaaa aatgtaaaga tgaaataaaa aagatccaag 601 atctcctcat gagctttggt atccttaaac ctgacttaag gattgtcttt gtacataaca 661 aggcagttat ttggcagaaa agcagagtat cagatcacaa gatggctctc atgtcagttc 721 tggggactgc tgttatgaac aatatggaat cctttcagta ccactctgaa gaatctcaga 781 tttatctcag tggatttctt ccaaagtgtg atgcagacca ctctttcact agtctttcaa 841 caccagaaag aagtttcatc ttcataaaca gtcgaccagt acatcaaaaa gatatcttaa 901 agttaatccg acatcattac aatctgaaat gcctaaagga atctactcgt ttgtatcctg 961 ttttctttct gaaaatcgat gttcctacag ctgatgttga tgtaaattta acaccagata 1021 aaagccaagt attattacaa aataaggaat ctgttttaat tgctcttgaa aatctgatga 1081 cgacttgtta tggaccatta cctagtacaa attcttatga aaataataaa acagatgttt 1141 ccgcagctga catcgttctt agtaaaacag cagaaacaga tgtgcttttt aataaagtgg 1201 aatcatctgg aaagaattat tcaaatgttg atacttcagt cattccattc caaaatgata 1261 tgcataatga tgaatctgga aaaaacactg atgattgttt aaatcaccag ataagtattg 1321 gtgactttgg ttatggtcat tgtagtagtg aaatttctaa cattgataaa aacactaaga 1381 atgcatttca ggacatttca atgagtaatg tatcatggga gaactctcag acggaatata 1441 gtaaaacttg ttttataagt tccgttaagc acacccagtc agaaaatggc aataaagacc 1501 atatagatga gagtggggaa aatgaggaag aagcaggtct tgaaaactct tcggaaattt 1561 ctgcagatga gtggagcagg ggaaatatac ttaaaaattc agtgggagag aatattgaac 1621 ctgtgaaaat tttagtgcct gaaaaaagtt taccatgtaa agtaagtaat aataattatc 1681 caatccctga acaaatgaat cttaatgaag attcatgtaa caaaaaatca aatgtaatag 1741 ataataaatc tggaaaagtt acagcttatg atttacttag caatcgagta atcaagaaac 1801 ccatgtcagc aagtgctctt tttgttcaag atcatcgtcc tcagtttctc atagaaaatc 1861 ctaagactag tttagaggat gcaacactac aaattgaaga actgtggaag acattgagtg 1921 aagaggaaaa actgaaatat gaagagaagg ctactaaaga cttggaacga tacaatagtc 1981 aaatgaagag agccattgaa caggagtcac aaatgtcact aaaagatggc agaaaaaaga 2041 taaaacccac cagcgcatgg aatttggccc agaagcacaa gttaaaaacc tcattatcta 2101 atcaaccaaa acttgatgaa ctccttcagt cccaaattga aaaaagaagg agtcaaaata 2161 ttaaaatggt acagatcccc ttttctatga aaaacttaaa aataaatttt aagaaacaaa 2221 acaaagttga cttagaagag aaggatgaac cttgcttgat ccacaatctc aggtttcctg 2281 atgcatggct aatgacatcc aaaacagagg taatgttatt aaatccatat agagtagaag 2341 aagccctgct atttaaaaga cttcttgaga atcataaact tcctgcagag ccactggaaa 2401 agccaattat gttaacagag agtcttttta atggatctca ttatttagac gttttatata 2461 aaatgacagc agatgaccaa agatacagtg gatcaactta cctgtctgat cctcgtctta 2521 cagcgaatgg tttcaagata aaattgatac caggagtttc aattactgaa aattacttgg 2581 aaatagaagg aatggctaat tgtctcccat tctatggagt agcagattta aaagaaattc 2641 ttaatgctat attaaacaga aatgcaaagg aagtttatga atgtagacct cgcaaagtga 2701 taagttattt agagggagaa gcagtgcgtc tatccagaca attacccatg tacttatcaa 2761 aagaggacat ccaagacatt atctacagaa tgaagcacca gtttggaaat gaaattaaag 2821 agtgtgttca tggtcgccca ttttttcatc atttaaccta tcttccagaa actacatgat 2881 taaatatgtt taagaagatt agttaccatt gaaattggtt ctgtcataaa acagcatgag 2941 tctggtttta aattatcttt gtattatgtg tcacatggtt attttttaaa tgaggattca 3001 ctgacttgtt tttatattga aaaaagttcc acgtattgta gaaaacgtaa ataaactaat 3061 aac // LOCUS HSU13696 2771 bp DNA PRI 23-MAR-1995 DEFINITION Human homolog of yeast mutL (hPMS2) gene, complete cds. ACCESSION U13696 NID g535514 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2771) AUTHORS Nicolaides,N.C., Papadopoulos,N., Liu,B., Wei,Y.-F., Carter,K.C., Ruben,S.M., Rosen,C.A., Haseltine,W.H., Fleischmann,R.D., Fraser,C.M., Adams,M.D., Venter,J.C., Dunlop,M.G., Hamilton,S.R., Petersen,G.M., de la Chapelle, Vogelstein,B. and Kinzler,K.W. TITLE Mutations of two PMS homologues in hereditary nonpolyposis colon cancer JOURNAL Nature 371 (6492), 75-80 (1994) MEDLINE 94352394 REFERENCE 2 (bases 1 to 2771) AUTHORS Wei,Y.-F. TITLE Direct Submission JOURNAL Submitted (16-AUG-1994) Ying-Fei Wei, Molecular Biology, Human Genome Sciences, Inc., 9620 Medical Center Drive, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..2771 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p2" /tissue_type="endometrial tumor" /dev_stage="adult" gene 25..2613 /gene="hPMS2" CDS 25..2613 /gene="hPMS2" /note="homolog of yeast mutL gene" /codon_start=1 /function="DNA mismatch repair" /db_xref="PID:g535515" /translation="MERAESSSTEPAKAIKPIDRKSVHQICSGQVVLSLSTAVKELVE NSLDAGATNIDLKLKDYGVDLIEVSDNGCGVEEENFEGLTLKHHTSKIQEFADLTQVE TFGFRGEALSSLCALSDVTISTCHASAKVGTRLMFDHNGKIIQKTPYPRPRGTTVSVQ QLFSTLPVRHKEFQRNIKKEYAKMVQVLHAYCIISAGIRVSCTNQLGQGKRQPVVCTG GSPSIKENIGSVFGQKQLQSLIPFVQLPPSDSVCEEYGLSCSDALHNLFYISGFISQC THGVGRSSTDRQFFFINRRPCDPAKVCRLVNEVYHMYNRHQYPFVVLNISVDSECVDI NVTPDKRQILLQEEKLLLAVLKTSLIGMFDSDVNKLNVSQQPLLDVEGNLIKMHAADL EKPMVEKQDQSPSLRTGEEKKDVSISRLREAFSLRHTTENKPHSPKTPEPRRSPLGQK RGMLSSSTSGAISDKGVLRPQKEAVSSSHGPSDPTDRAEVEKDSGHGSTSVDSEGFSI PDTGSHCSSEYAASSPGDRGSQEHVDSQEKAPETDDSFSDVDCHSNQEDTGCKFRVLP QPTNLATPNTKRFKKEEILSSSDICQKLVNTQDMSASQVDVAVKINKKVVPLDFSMSS LAKRIKQLHHEAQQSEGEQNYRKFRAKICPGENQAAEDELRKEISKTMFAEMEIIGQF NLGFIITKLNEDIFIVDQHATDEKYNFEMLQQHTVLQGQRLIAPQTLNLTAVNEAVLI ENLEIFRKNGFDFVIDENAPVTERAKLISLPTSKNWTFGPQDVDELIFMLSDSPGVMC RPSRVKQMFASRACRKSVMIGTALNTSEMKKLITHMGEMDHPWNCPHGRPTMRHIANL GVISQN" BASE COUNT 826 a 603 c 664 g 678 t ORIGIN 1 cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 61 aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 121 ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 181 aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 241 tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 301 caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 361 tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 421 actcgactga tgtttgatca caatgggaaa attatccaga aaacccccta cccccgcccc 481 agagggacca cagtcagcgt gcagcagtta ttttccacac tacctgtgcg ccataaggaa 541 tttcaaagga atattaagaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 601 atcatttcag caggcatccg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 661 cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 721 cagaagcagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 781 gaagagtacg gtttgagctg ttcggatgct ctgcataatc ttttttacat ctcaggtttc 841 atttcacaat gcacgcatgg agttggaagg agttcaacag acagacagtt tttctttatc 901 aaccggcggc cttgtgaccc agcaaaggtc tgcagactcg tgaatgaggt ctaccacatg 961 tataatcgac accagtatcc atttgttgtt cttaacattt ctgttgattc agaatgcgtt 1021 gatatcaatg ttactccaga taaaaggcaa attttgctac aagaggaaaa gcttttgttg 1081 gcagttttaa agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1141 agtcagcagc cactgctgga tgttgaaggt aacttaataa aaatgcatgc agcggatttg 1201 gaaaagccca tggtagaaaa gcaggatcaa tccccttcat taaggactgg agaagaaaaa 1261 aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1321 aagcctcaca gcccaaagac tccagaacca agaaggagcc ctctaggaca gaaaaggggt 1381 atgctgtctt ctagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 1441 gaggcagtga gttccagtca cggacccagt gaccctacgg acagagcgga ggtggagaag 1501 gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 1561 agtcactgca gcagcgagta tgcggccagc tccccagggg acaggggctc gcaggaacat 1621 gtggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt ggactgccat 1681 tcaaaccagg aagataccgg atgtaaattt cgagttttgc ctcagccaac taatctcgca 1741 accccaaaca caaagcgttt taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1801 aagttagtaa atactcagga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 1861 aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 1921 catcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1981 tgtcctggag aaaatcaagc agccgaagat gaactaagaa aagagataag taaaacgatg 2041 tttgcagaaa tggaaatcat tggtcagttt aacctgggat ttataataac caaactgaat 2101 gaggatatct tcatagtgga ccagcatgcc acggacgaga agtataactt cgagatgctg 2161 cagcagcaca ccgtgctcca ggggcagagg ctcatagcac ctcagactct caacttaact 2221 gctgttaatg aagctgttct gatagaaaat ctggaaatat ttagaaagaa tggctttgat 2281 tttgttatcg atgaaaatgc tccagtcact gaaagggcta aactgatttc cttgccaact 2341 agtaaaaact ggaccttcgg accccaggac gtcgatgaac tgatcttcat gctgagcgac 2401 agccctgggg tcatgtgccg gccttcccga gtcaagcaga tgtttgcctc cagagcctgc 2461 cggaagtcgg tgatgattgg gactgctctt aacacaagcg agatgaagaa actgatcacc 2521 cacatggggg agatggacca cccctggaac tgtccccatg gaaggccaac catgagacac 2581 atcgccaacc tgggtgtcat ttctcagaac tgaccgtagt cactgtatgg aataattggt 2641 tttatcgcag atttttatgt tttgaaagac agagtcttca ctaacctttt ttgttttaaa 2701 atgaaacctg ctacttaaaa aaaatacaca tcacacccat ttaaaagtga tcttgagaac 2761 cttttcaaac c // LOCUS HSU15128 3414 bp DNA PRI 09-FEB-1996 DEFINITION Human beta-1,2-N-acetylglucosaminyltransferase II (MGAT2) gene, complete cds. ACCESSION U15128 L36537 NID g902744 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3414) AUTHORS Tan,J., D'Agostaro,A.F., Bendiak,B., Reck,F., Sarkar,M., Squire,J.A., Leong,P. and Schachter,H. TITLE The human UDP-N-acetylglucosamine: alpha-6-D-mannoside-beta-1,2- N-acetylglucosaminyltransferase II gene (MGAT2). Cloning of genomic DNA, localization to chromosome 14q21, expression in insect cells and purification of the recombinant protein JOURNAL Eur. J. Biochem. 231 (2), 317-328 (1995) MEDLINE 95361854 REFERENCE 2 (bases 1 to 3414) AUTHORS Schachter,H. TITLE Direct Submission JOURNAL Submitted (23-SEP-1994) Harry Schachter, Biochemistry, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada FEATURES Location/Qualifiers source 1..3414 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHG30 and pHG36" /clone_lib="Clontech library in lambdaEMBL-3 SP6/T7 (catalog #HL1111j)" /chromosome="14" /map="14q21" /cell_type="leukocyte" /dev_stage="adult" gene 683..2026 /gene="MGAT2" CDS 683..2026 /gene="MGAT2" /EC_number="2.4.1.143" /note="GlcNAc-transferase II" /codon_start=1 /product="beta-1,2-N-acetylglucosaminyltransferase II" /db_xref="PID:g902745" /translation="MRFRIYKRKVLILTLVVAACGFVLWSSNGRQRKNEALAPPLLDA EPARGAGGRGGDHPSVAVGIRRVSNVSAASLVPAVPQPEADNLTLRYRSLVYQLNFDQ TLRNVDKAGTWAPRELVLVVQVHNRPEYLRLLLDSLRKAQGIDNVLVIFSHDFWSTEI NQLIAGVNFCPVLQVFFPFSIQLYPNEFPGSDPRDCPRDLPKNAALKLGCINAEYPDS FGHYREAKFSQTKHHWWWKLHFVWERVKILRDYAGLILFLEEDHYLAPDFYHVFKKMW KLKQQECPECDVLSLGTYSASRSFYGMADKVDVKTWKSTEHNMGLALTRNAYQKLIEC TDTFCTYDDYNWDWTLQYLTVSCLPKFWKVLVPQIPRIFHAGDCGMHHKKTCRPSTQS AQIESLLNNNKQYMFPETLTISEKFTVVAISPPRKNGGWGDIRDHELCKSYRRLQ" polyA_signal 2092 polyA_signal 2712 polyA_signal 2870 repeat_region 3329..>3414 /note="Alu fragment 1-192 (numbering according to Alu general consensus)" /rpt_type=tandem /rpt_family="Alu-J" BASE COUNT 900 a 756 c 863 g 895 t ORIGIN 1 cccttcgcac gtctcgcctt tcgcacgtct cgcctaacag gaaagggaag aaagaggcgg 61 aagtgggaac tgcacctgag cgacagtact gcaaaccaat aggcagccgg ccacggcggt 121 caggcgcctt cggtcgcgtc tggaaagcac caaccaacgg tctaaggggc gggccggagg 181 ggtgtgggcc ggagggcgcg gtgtgccgcg gggcagttgc gggttgtcat aacggtcccc 241 gccggagtga ggcgaggccg cgtcgctcag ttctggccgt ctagggcccc tgtaaggatg 301 agagcgcaga ggacgcaggg ccgctggagg cgcaggtaac gaagctaggg tgcggttggg 361 accgcggctg agctttttcc gggacccgtg gtgctgaatg gagaggacgg agacgaagcc 421 gagccgcggc tcctagcggc ggcgccgatg ctcgagctgt agctgccagg cgaggatgtg 481 tggagcgcag gcggcgcggg gtaaatgaga ggtctcgggc cccaggaccc ccggggcccg 541 ggatgagtta gcgagggcag ccgcgggggc cagttccgac cgtgacaggc caaggcgacg 601 gccgccgccc gcccgcccct tccgtgcaga agcagctgct cctttccgcg cccgcccgcc 661 tgcgctcccg gccctggaga ccatgaggtt ccgcatctac aaacggaagg tgctaatcct 721 gacgctcgtg gtggccgcct gcggcttcgt cctctggagc agcaatgggc gacaaaggaa 781 gaacgaggcc ctcgccccac cgttgctgga cgccgaaccc gcgcggggtg ccggcggccg 841 cggtggggac cacccctctg tggctgtggg catccgcagg gtctccaacg tgtcggcggc 901 ttccctggtc ccggcggtcc cccagcccga ggcggacaac ctgacgctgc ggtaccggtc 961 cctggtgtac cagctgaact ttgatcagac cctgaggaat gtagataagg ctggcacctg 1021 ggccccccgg gagctggtgc tggtggtcca ggtgcataac cggcccgaat acctcagact 1081 gctgctggac tcacttcgaa aagcccaggg aattgacaac gtcctcgtca tctttagcca 1141 tgacttctgg tcgaccgaga tcaatcagct gatcgccggg gtgaatttct gtccggttct 1201 gcaggtgttc tttcctttca gcattcagtt gtaccctaac gagtttccag gtagtgaccc 1261 tagagattgt cccagagacc tgccgaagaa tgccgctttg aaattggggt gcatcaatgc 1321 tgagtatccc gactccttcg gccattatag agaggccaaa ttctcccaga ccaaacatca 1381 ctggtggtgg aagctgcatt ttgtgtggga aagagtgaaa attcttcgag attatgctgg 1441 ccttatactt ttcctagaag aggatcacta cttagcccca gacttttacc atgtcttcaa 1501 aaagatgtgg aaactgaagc agcaagagtg ccctgaatgt gatgttctct ccctggggac 1561 ctatagtgcc agtcgcagtt tctatggcat ggctgacaag gtagatgtga aaacttggaa 1621 atccacagag cacaatatgg gtctagcctt gacccggaat gcctatcaga agctgatcga 1681 gtgcacagac actttctgta cttatgatga ttataactgg gactggactc ttcaatactt 1741 gactgtatct tgtcttccaa aattctggaa agtgctggtt cctcaaattc ctaggatctt 1801 tcatgctgga gactgtggta tgcatcacaa gaaaacctgt agaccatcca ctcagagtgc 1861 ccaaattgag tcactcttaa ataataacaa acaatacatg tttccagaaa ctctaactat 1921 cagtgaaaag tttactgtgg tagccatttc cccacctaga aaaaatggag ggtggggaga 1981 tattagggac catgaactct gtaaaagtta tagaagactg cagtgaaaat cacagttaca 2041 aaagcgacag tcttctattt ttgatatttg tccaaacagg acatacaatt gaataaaaga 2101 gtttaggaac tggtttctgc tttaatacaa aaacaaaatc ttgtaaaagg tgtccaaata 2161 catagtaatc ttttccagtt atgtctgatt aagatttaaa actgaaggtt tcattttggg 2221 agtagggttt taaagctcaa tctgttatct gctaaaattg attattgttg atatgagaga 2281 agaggggaaa ttttatttaa attgcattta ttaatctttt tatctgaaac tttgtacact 2341 tttccacttt caaaacctat tttaagtaca gcaaaattta tttaaaactg tgatagcagt 2401 aaaaagtatt acgatgaaat tgttagggta ttaatggaac aaacccagtt tcactctctt 2461 gacacactta ttaggaaggg attgcttcac tggtttaata atttaaaagt tatgtttgtt 2521 aaacaccctg tcagaacagt cattttcagt attagattcc tgtactattg tgttttgagt 2581 gtgttttgga accttcatag aacacacttt cttttggaat gtatttgatt gataagaaag 2641 tttaaacatt gttttcacct caatgtagaa atacagtggt tttgtttttt tttttctttt 2701 agtgctgaca aaataaaata ctcatttttg cataaaaagg ttcctaatcc ttttgcagaa 2761 taagttttgt ttactcttta taccaaaatt cagtgaaggc attctacaag ttttgagtta 2821 gcattacatt ttaatattta ctattgctac attgtataat tgagtttgaa ataaaaccca 2881 gcttatgaca atgcattccc tgtgcaagaa actgtttggc tttcaaatta cccaggcatt 2941 gaaaatgaat gataaaaagt tgctgtgtaa gggaaataca gcctaaatgt tttgaaagcc 3001 agaaatgata caaagttcag tcatgccaaa gtgaaatact ttctagtgcc agctttaact 3061 taaatcatac gttttaaaag gacagataca gaaaattata ggaaacaggc ttaaattttg 3121 ctccatattt aatgtagacg tttatagaag tttcccttaa tttgtaattg cattcaaccg 3181 agaatttctc ataaaagact aatttctgtg taaagatatt acgggctggg tgtggtggct 3241 catgtctgta atccagcact ctgggaggtt gaggcaggac gattgcttga actcagagtt 3301 tgagaccagc ctgggcaaca tggcgaaaaa cccatctcta ctaaaaataa caaaaaatta 3361 gccgggcgta gtggtgactc tgtagtccca gctacttgag aggctgaggt ggga // LOCUS HSU16812 6478 bp DNA PRI 19-AUG-1995 DEFINITION Human Bak-2 gene, complete cds. ACCESSION U16812 NID g595925 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6478) AUTHORS Kiefer,M.C., Brauer,M.J., Powers,V.C., Wu,J.J., Umansky,S.R., Tomei,L.D. and Barr,P.J. TITLE Modulation of apoptosis by the widely distributed Bcl-2 homologue Bak JOURNAL Nature 374 (6524), 736-739 (1995) MEDLINE 95231654 REFERENCE 2 (bases 1 to 6478) AUTHORS Kiefer,M.C. TITLE Direct Submission JOURNAL Submitted (02-NOV-1994) Michael C. Kiefer, Mol. Biol., LXR Biotechnology Inc., 1401 Marina Way South, Richmond, CA 94804, USA FEATURES Location/Qualifiers source 1..6478 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="A4" /clone_lib="pWE 15/Human genomic DNA" /chromosome="20" gene 3279..3914 /gene="Bak-2" CDS 3279..3914 /gene="Bak-2" /codon_start=1 /product="Bak-2 protein" /db_xref="PID:g595926" /translation="MASGQGPGPPRQECGEPALPSASEEQVAQDTEEVFRSYVFYHHQ QEQEAEGAAAPADPEMVTLPLQPSSTMGQVGRQLAIIGDDINRRYDSEFQTMLQHLQP TAENAYEYFTKIASSLFESGINWGRVVALLGFSYRLALHIYQRGLTGFLGQVTRFVVD FMLHHCIARWIAQRGGWVAALNLGNGPILNVLVVLGVVLLGQFVVRRFFKS" BASE COUNT 1503 a 1609 c 1599 g 1767 t ORIGIN 1 gatcctcctg ccttggtctc ccaaaatgtt gagattatag gcatgagccc accacgcctg 61 gctggggttt tgtttttgtt tttttaacat ttgtcacatt tcacaaaggt atttcagaat 121 ctctgagaaa agtgctataa tgtctaatga tactttatat ttggacagca ctttcgtttg 181 ttttttttgg cggggggggt gggagaagtc aagtaactta catatagtga aatttaccct 241 tcttgagtat gcagttcagt gagttttgat aaatgtgtaa tggtagtgta atcactacca 301 cagtcaagac atggacaatt ttcattaccc cacgaaggtc cctcatgtgt ggttagagtc 361 agccctccca tcagcacagt cctggcagcc actgacctgg tttctgtccc tactgttttg 421 ccttttccag aatgtcattt aagtgacatc attcattatg gagacttgtt ttatttttta 481 ttttttattt tttgagaagg agtctcgctc ttgttgccca ggctagagtg caatgctgtg 541 atttcggctc actgcaacct ccgcctcccg ggttcaagtg attctcctgc ctcagcctcc 601 cgagtagttc gggaccacag gcgtacacca ccatgcccag ctaatttttt ttttttttga 661 gatggagtct cgccctgtca cccaggctag agtccagtgg catgatctcg cctcactgcc 721 aagctcctgc ctcccgggtt tcacgccatt cccctgcctc agcctcccga gtatgcccgg 781 ctaatttttg tatttttagt agagacgggg tttccccatg ttggccaggc tagtctcaaa 841 ctcctgacct caagtaatcc gcctgccttg gctcccaaag tgctgggatt acaggtgtga 901 gccaccgcgc ccagcccatt atgtagcttt ttgtgcccca cttctcccac ttagcataat 961 gctttttgag attcatctgt attactagtg catctgtagt tctttccttt ttatggcttg 1021 ggttgttttt tgtttttgtt tttgtttttg agatagggtc tcactctgtt gcccagggtg 1081 gagtgcacta tcgcagctct ccgcaacctc cacctcccag gctcaagata tcctcccacc 1141 tcagcctcct gagtagctgg aaatacaagt gtgtgtgcca ccatgccggt taattttttt 1201 ttcttttttt tttttttttt tcaatttttg ttggaagcac catggagccg cctgagcctg 1261 gctgagccta aaagccctgt ggtgcatgcc tggccaattt ttgtattttt tagtagagac 1321 gggattttgc catgtcgccc aggctggtct ggaactcctg gtctcaggtg attctcctgc 1381 ttcggcctcc caagtagctg gggttacagg catgtgccac catgctcagc cctcccgtca 1441 gcacagtcct ggcagccact ggcctggttt ctgtccctac tgttttgcct tttactggtc 1501 tccatgctca cctaaatttt tttttatttt ttgtagagac agattctcgc aatgttgctc 1561 aggctagtct cgaactcccg gcttcaagca atcctcccac ctcagtcctc caaagttctg 1621 gaattacagg catgaatcac tgtgccagtg ttgcattcca aagcatggat acatcacagt 1681 ttttaaaatg tttaccaatg taaatggccc gttttaatat gagaataact aatgttgaag 1741 agagtgctac aaagaggatc atggtcgtga tgtgtagaca caaaaatgag gttagaagac 1801 agtaaggtga gggccgggca cagtggctca tgcttgtaat cctagcactt tgggaggtgc 1861 aggtgggagg attgcttgag gccaggagct ggagacaagt ccaggcaaca cagcgagtcc 1921 ctgcctttat aaaaaatcag aaattaaaaa agccttggcg gtggctcacg cctgtaatcc 1981 cagcactttg ggaggccgag acgggtggat cacgaagtca ggagttcaag accagcttgg 2041 ccaagatggt gaaaccctgt ctctactaaa aataaaaaaa aaaaattagc cagtcgtggt 2101 ggtggcacct gtaatcccag ctactcagga ggctgaggca ggagaatcgc ttgaacccag 2161 gaggcggagt ttgcagtaag ccaaggtgcg ccactgcact ccagcctggg caacagagta 2221 agactctgtc tcaaaaaaaa acaaaaaaca aaaaaacaaa aaaaaaacag gccggcgcag 2281 tggctcatgc ctataatcca agtactttgg gaggccaagg caggcggatc gcaaagtcag 2341 gagttcgaga ccagcctggc caatatggtg aaaccctgtt tctgctaaaa atacaaaaaa 2401 tagccaggtg tggtgggaag cgcctgtagt cccagctact caggaggctg aggcaggaga 2461 atcgcttgaa cccgggaggc agaagttgca gtgagctgag attgcgccac tgcactccag 2521 cctgggcaac agagcgagac tccatctcaa agaaaaaaaa gccaaaacat agtaaggtga 2581 gggtgaaact tctcttttaa aaaaatgttt acatagaaac aaactaaatg gacaaaatgg 2641 atataaacaa aaatgttatc ggtggttatt tttgggcagt agaattatag gtttttaatt 2701 tcttttgctt atttatagtt tcaaaaattt tcaattttaa tataaattaa tgtgctctat 2761 ttatagagac aatacatgaa atatacttaa taaaaattca aatgttatag aactgaaaaa 2821 gatgaaaagt aaaaacaacc tattccccag aggtagccac tgtccatagt ttctatttta 2881 gattctttcc tttatacaag attattatag cttctatttt ttggtgtatg aactgtagtc 2941 ctagaggatt ttattagtta tgagttctat aactaagatc catcatctta gttgctaaga 3001 acgtagatac tgagaacatc atttaaaaaa acatttttgg ctggcacctc tatgatcact 3061 ggagtctcgc gggtccctca ggctgcacag ggacaagtaa aggctacatc cagatgctgg 3121 gaatgcactg acgcccattc ctggaaactg ggctcccact cagcccctgg gagcagcagc 3181 cgccagcccc tcgggacctc catctccacc ctgctgagcc acccgggttg ggccaggatc 3241 ccggcaggct gatcccgtcc tccactgaga cctgaaaaat ggcttcgggg caaggcccag 3301 gtcctcccag gcaggagtgc ggagagcctg ccctgccctc tgcttctgag gagcaggtag 3361 cccaggacac agaggaggtt ttccgcagct acgtttttta ccaccatcag caggaacagg 3421 aggctgaagg ggcggctgcc cctgccgacc cagagatggt caccttacct ctgcaaccta 3481 gcagcaccat ggggcaggtg ggacggcagc tcgccatcat tggggacgac atcaaccgac 3541 gctatgactc agagttccag accatgttgc agcacctgca gcccacggca gagaatgcct 3601 atgagtactt caccaagatt gcctccagcc tgtttgagag tggcatcaat tggggccgtg 3661 tggtggctct tctgggcttc agctaccgtc tggccctaca catctaccag cgtggcctga 3721 ctggcttcct gggccaggtg acccgctttg tggtggactt catgctgcat cactgcattg 3781 cccggtggat tgcacagagg ggtggctggg tggcagccct gaacttgggc aatggtccca 3841 tcctgaacgt gctggtggtt ctgggtgtgg ttctgttggg ccagtttgtg gtacgaagat 3901 tcttcaaatc atgactccca agggtgccct ttggggtccc agttcagacc cctgcctgga 3961 cttaagcgaa gtctttgcct tctctgctcc ttgcaggggt cccccctcaa gagtacagaa 4021 gctttagcaa gtgtgcactc cagcttcgga gggcccctgt gtgggggcca gtcaggctgc 4081 agaggcacct caacattcca tggtgctagt gggccctctc tctgggccca ggggctgtgg 4141 cgtctcctcc ctcagctctc tgggacctcc ttagccctgt ctgctaggcg ctggggagac 4201 tgataacttg gggaggcaag agactgggag ccacttctcc ccagaaagtg tttaatggtt 4261 ttagcttttt ataataccct tgtgagagcc cattcccacc attctacctg aggccaggac 4321 gtctggggtg tggggattgg tgtgtctatg ttccccagga ttcagctatt ctggaagatc 4381 agcaccctaa gagatgggac taggacctga gcctggtcct ggccgtccct aagcatgtgt 4441 cccaggagca ggacctacta ggagaggggg gccaaggtcc tgctcaactc tacccctgct 4501 cccattcctc cctccggcca tactgccttt gcagttggac tctcagggat tctgggcttg 4561 gggtgtgggg tggggtggag tcgagaccag agctgtctga actcatgtgt cagaagccct 4621 ccaagcctgc ctcccagggt cctctcagtt ctctcccttc ctctctcctt atagacactt 4681 gctcccaacc cattcactac aggtgaaggc tcctcacccc catccctggg ccttgggtga 4741 gtaacctgct aaggcctcct tgcccagact acagggctta ggacttggtt tgttatttca 4801 gggaaaagga gtagggagtt catctggagg gttctaagtg ggagaaggac tatcaacacc 4861 actaggaatc ccagaggtgg gatcctccct catggctctg gcacagtgta atccaggggt 4921 gtagatgggg gaactgtgaa tacttgaact ctgttccccc accctccatg ctcctcacct 4981 gtctaggtct cctcagggtg gggggtgaga gtgccttctc tattgggcac agcctagggt 5041 cttgggggtc ggggggagaa gttcttgatt cagccaaatg cagggagggg aggcagatgg 5101 agcccatagg ccacctccta tcctctgagt gtttggaaat aaactgtgca atcccctcaa 5161 aaaaataaaa ataaaaaaaa taaaaataaa aaaacatttt tttcaagcag ggagtggtgg 5221 ctcccgcctg taatcccagc actttgggag gccaaggcgt gcagattgct tcagttcagg 5281 agttcaagac cagcctggga aacatggtga aaccccatct ctactaaaaa taaaaaatta 5341 gccaggcata gtgtcgcgca cctgtactcc cagctatttg ggaggctgag gtaggagaat 5401 tgcttgaacc caggaggtgg aggttgcagt gagctgagat caggccactg cactccaacg 5461 taggtgacag agatagcctc cttctaaaaa aacaaccttt tttccagcca aaacaactga 5521 acttcctccc cactgaccac ctcaattatt tctagatgcc ttgttgctgt ccagactgcg 5581 gtgattccct gggctgatct gagcccgtgg cctgagtcat ttgcagttcc tctagcaggt 5641 ggtcccccat gtcatggccc ctgtgaaacc agttccttac catctctgtt catcgctgct 5701 ccctaagtta ggccctgcat gtcttgaggg taggttagat tcagaaaagc tttggtcgca 5761 tcactgcttt cataaactca aatgagaggg agggagggaa ggcaggaaga agggagggag 5821 tcctttctct cccacagtgt gcattacctc atgtaacact tcttgctaat gtggtagaat 5881 gtgtttgact ttgaatgaga cttgggttta tttttattta tttatttatt tatttattta 5941 tttattttga gatggagttt cactcttgtt gcccaggctg gagtgtagtg gcacgatctc 6001 tactcattgc accctccgcc ttccaggttc aaacgattct cctgcctcag cctcccaagt 6061 agctgggatt acaggggcat gccaccatgc ccagctaatt tttgtatttt tagtagggac 6121 ggggtttcac catgttgacc aggctggtct ggaactcctg atctcaggtg atccacctgc 6181 ctcggcctcc caaagtgttg ggattacagg cgtgagccac cgtgcctggc ctgagactta 6241 aatccatctc ttttttcttc ttctttttga gacagagcct cattctgttc cccatgctgg 6301 agttcagtgg cgtgattttg gctcactgca accttggcca tctgggtttg agcaattctc 6361 gtgcctcagc ctcctgagta gctggcacta tagtcacatg ccaccacgcc cggctaactt 6421 ttttgtattt ttagtagaga cagggtttca ctatgttagc caggctggtc tcgaattc // LOCUS HSU17894 2115 bp DNA PRI 02-MAR-1995 DEFINITION Human alpha(1,2)fucosyltransferase (FUT2) gene, complete cds. ACCESSION U17894 NID g687618 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2115) AUTHORS Lowe,J.B. TITLE Sequence and expression of a candidate for the human Secretor blood group alpha(1,2)fucosyltransferase gene (FUT2); homozygosity for an enzyme-inactivating nonsense mutation commonly correlates with the non-secretor phenotype JOURNAL J. Biol. Chem. (1995) In press REFERENCE 2 (bases 1 to 2115) AUTHORS Lowe,J.B. TITLE Direct Submission JOURNAL Submitted (30-NOV-1994) John B Lowe, Pathology, Howard Hughes Medical Institute, University of Michigan, MSRBI, Room 3510, 1150 West Medical Center Drive, Ann Arbor, MI 48109-0650, USA FEATURES Location/Qualifiers source 1..2115 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" gene 64..1095 /gene="FUT2" CDS 64..1095 /gene="FUT2" /note="member of the secretor blood group" /codon_start=1 /product="alpha(1,2)fucosyltransferase" /db_xref="PID:g687619" /translation="MLVVQMPFSFPMAHFILFVFTVSTIFHVQQRLAKIQAMWELPVQ IPVLASTSKALGPSQLRGMWTINAIGRLGNQMGEYATLYALAKMNGRPAFIPAQMHST LAPIFRITLPVLHSATASRIPWQNYHLNDWMEEEYRHIPGEYVRFTGYPCSWTFYHHL RQEILQEFTLHDHVREEAQKFLRGLQVNGSRPGTFVGVHVRRGDYVHVMPKVWKGVVA DRRYLQQALDWFRARYSSLIFVVTSNGMAWCRENIDTSHGDVVFAGDGIEGSPAKDFA LLTQCNHTIMTIGTFGIWAAYLTGGDTIYLANYTLPDSPFLKIFKPEAAFLPEWTGIA ADLSPLLKH" BASE COUNT 505 a 606 c 552 g 452 t ORIGIN 1 ttcaccagcg ccccgggcct ccatctccca gctaacgtgt cccgttttcc tcccctgaca 61 gccatgctgg tcgttcagat gcctttctcc tttcccatgg cccacttcat cctctttgtc 121 tttacggttt ccactatatt tcacgttcag cagcggctag cgaagattca agccatgtgg 181 gagttaccgg tgcagatacc agtgctagcc tcaacatcaa aggcactggg acccagccag 241 ctcaggggga tgtggacgat caatgcaata ggccgcctgg ggaaccagat gggcgagtac 301 gccacactgt acgccctggc caagatgaac gggcggcccg ccttcatccc ggcccagatg 361 cacagcaccc tggcccccat cttcagaatc accctgccgg tgctgcacag cgccacggcc 421 agcaggatcc cctggcagaa ctaccacctg aacgactgga tggaggagga ataccgccac 481 atcccggggg agtacgtccg cttcaccggc tacccctgct cctggacctt ctaccaccac 541 ctccgccagg agatcctcca ggagttcacc ctgcacgacc acgtgcggga ggaggcccag 601 aagttcctgc ggggcctgca ggtgaacggg agccggccgg gcacctttgt aggggtccat 661 gttcgccgag gggactatgt ccatgtcatg ccaaaagtgt ggaagggggt ggtggccgac 721 cggcgatacc tacagcaggc cctggactgg ttccgagctc gctacagctc cctcatcttc 781 gtggtcacca gtaatggcat ggcctggtgt cgggagaaca ttgacacctc ccacggtgat 841 gtggtgtttg ctggcgatgg cattgagggc tcacctgcca aagattttgc tctactcaca 901 cagtgtaacc acaccatcat gaccattggg acgttcggga tctgggccgc atacctcacg 961 ggcggagaca ccatctacct ggccaattac accctccccg actccccttt cctcaaaatc 1021 tttaagccag aggcagcctt cctgccggag tggacaggga ttgccgcaga cctgtccccc 1081 ttactcaagc actaatgctg gcccgtcctt tgagaccttt tctccttctc tgcctccctc 1141 aagatgagtg cccgggcatg agaagcacat ggttccatga gcaggaccca tctctcttct 1201 gtgaagatgc gttgggctgc aagtaacaga aatctcagtg aacagtggcc tggcgtggtg 1261 gctcatgcct gtaatgctcg cactttggga ggccagggtg ggtggatcac ttgaggtcag 1321 gagttcaaga ctagcctggc caacatggtg aaaccccatc tcgactaaaa atacaaaaat 1381 tagccaggcg tggtggtgca cacttgtaat cccagctact cgggaggctg aggcaagaga 1441 atcacttgaa cccaggaggc ggaggttgca gtgagccaag atggtgccgc tgcactccag 1501 cctgggtgac acagcaagac tccatctcaa aaaaaaaaaa agaaaaagaa atgaacgggt 1561 tcaaagacca taatcatgca tatcacataa gaccagaagt ggcccaggtc cagggtcagt 1621 taatttagca gctccacaaa gtcatcagtc acctgagctc catccatctt cacatgctgt 1681 gctaccattt cttagctgta tcatcccatg gtcccaaaag ggctgctaca catccagcca 1741 tcacatgcag ataattcctt tcaaaaacag cagaaagagg ctcgttcttg tcttggtccc 1801 ttttgaagaa tgaatgaaac cttcctaagc cttccagcaa tttcccccca actccgatgg 1861 gtaggaattg tcacataccc atgtgacccg ataggaggca aaagaaatga gacttctggg 1921 attagtttag cctcagattc tgcagctgag aagttgatca gccacctctg aaggacatgc 1981 agcttgcaga aaattagggt ggtgttacca aggtgaaaag gggaaatggc tttagagtag 2041 acaacagaga tgccctgagg ggttgtgtag gttgttcact gcaggaagtc ccctggttaa 2101 gaaggcaagt ggggt // LOCUS HSU18548 1230 bp DNA PRI 08-MAR-1996 DEFINITION Human GPR12 G protein coupled-receptor gene, complete cds. ACCESSION U18548 NID g604499 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1230) AUTHORS Song,Z.H., Modi,W. and Bonner,T.I. TITLE Molecular cloning and chromosomal localization of human genes encoding three closely related G protein-coupled receptors JOURNAL Genomics 28 (2), 347-349 (1995) MEDLINE 96015070 REFERENCE 2 (bases 1 to 1230) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (13-DEC-1994) Tom I. Bonner, Laboratory of Cell Biology, National Institute of Mental Health, Bldg 36, Room 3A-07, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..1230 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="h6-7 c1" /clone_lib="human genomic in pWE15 cosmid vector (Stratagene)" /chromosome="13" /map="13q12" exon 130..1230 /note="based on discontinuity in similarity to rat G protein coupled-receptor cDNA, GenBank Accession Number X61496, and mouse cDNA, GenBank Accession Number D21061, 5' of this point; another rat cDNA, GenBank Accession Number U12184, has similarity extending 5' of this point to base 1 as though it is not spliced" CDS 145..1149 /note="putative GPR12 G protein coupled-receptor, ligand unknown" /codon_start=1 /db_xref="PID:g604500" /translation="MNEDLKVNLSGLPRDYLDAAAAENISAAVSSRVPAVEPEPELVV NPWDIVLCTSGTLISCENAIVVLIIFHNPSLRAPMFLLIGSLALADLLAGIGLITNFV FAYLLQSEATKLVTIGLIVASFSASVCSLLAITVDRYLSLYYALTYHSERTVTFTYVM LVMLWGTSICLGLLPVMGWNCLRDESTCSVVRPLTKNNAAILSVSFLFMFALMLQLYI QICKIVMRHAHQIALQHHFLATSHYVTTRKGVSTLAIILGTFAACWMPFTLYSLIADY TYPSIYTYATLLPATYNSIINPVIYAFRNQEIQKALCLICCGCIPSSLAQRARSPSDV " BASE COUNT 234 a 379 c 284 g 333 t ORIGIN 1 aagcttgtgg catttggtac tggtatctga gcaggggctg gctttctgtt tgtctgtgtg 61 ttttttgcat gatcttggat tgtcaccctg ctgtatttaa acattaaaaa gcctgtcttt 121 tcgttgaaga ggacaggggt taaaatgaat gaagacctga aggtcaattt aagcgggctg 181 cctcgggatt atttagatgc cgctgctgcg gagaacatct cggctgctgt ctcctcccgg 241 gttcctgccg tagagccaga gcctgagctc gtagtcaacc cctgggacat tgtcttgtgt 301 acctcgggaa ccctcatctc ctgtgaaaat gccattgtgg tccttatcat cttccacaac 361 cccagcctgc gagcacccat gttcctgcta ataggcagcc tggctcttgc agacctgctg 421 gccggcattg gactcatcac caattttgtt tttgcctacc tgcttcagtc agaagccacc 481 aagctggtca cgatcggcct cattgtcgcc tctttctctg cctctgtctg cagcttgctg 541 gctatcactg ttgaccgcta cctctcactg tactacgctc tgacgtacca ttcggagagg 601 acggtcacgt ttacctatgt catgctcgtc atgctctggg ggacctccat ctgcctgggg 661 ctgctgcccg tcatgggctg gaactgcctc cgagacgagt ccacctgcag cgtggtcaga 721 ccgctcacca agaacaacgc ggccatcctc tcggtgtcct tcctcttcat gtttgcgctc 781 atgcttcagc tctacatcca gatctgtaag attgtgatga ggcacgccca tcagatagcc 841 ctgcagcacc acttcctggc cacgtcgcac tatgtgacca cccggaaagg ggtctccacc 901 ctggctatca tcctggggac gtttgctgct tgctggatgc ctttcaccct ctattccttg 961 atagcggatt acacctaccc ctccatctat acctacgcca ccctcctgcc cgccacctac 1021 aattccatca tcaaccctgt catatatgct ttcagaaacc aagagatcca gaaagcgctc 1081 tgtctcattt gctgcggctg catcccgtcc agtctcgccc agagagcgcg ctcgcccagt 1141 gatgtgtagc acccttgcac ccaggaggac tctgcattta ccaagcactt ccactgcctg 1201 gccaaggttt gagatgcttc ccttgaattc // LOCUS HSU18549 2699 bp DNA PRI 08-MAR-1996 DEFINITION Human GPR6 G protein-coupled receptor gene, complete cds. ACCESSION U18549 NID g604501 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2699) AUTHORS Song,Z.H., Modi,W. and Bonner,T.I. TITLE Molecular cloning and chromosomal localization of human genes encoding three closely related G protein-coupled receptors JOURNAL Genomics 28 (2), 347-349 (1995) MEDLINE 96015070 REFERENCE 2 (bases 1 to 2699) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (13-DEC-1994) Tom I. Bonner, Laboratory of Cell Biology, National Institute of Mental Health, Bldg 36, Room 3A-07, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..2699 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HCN3A c1" /clone_lib="human genomic in pWE15 cosmid vector (Stratagene)" /chromosome="6" /map="6q21" exon 1..66 /note="similar to part of 5'UTR of rat G protein-coupled receptor cDNA, GenBank Accession Number U12006" exon 694..2307 CDS 712..1800 /note="putative GPR6 G protein-coupled receptor, ligand unknown" /codon_start=1 /db_xref="PID:g604502" /translation="MNASAASLNDSQVVVVAAEGAAAAATAAGGPDTGEWGPPAAAAL GAGGGANGSLELSSQLSAGPPGLLLPAVNPWDVLLCVSGTVIAGENALVVALIASTPA LRTPMFVLVGSLATADLLAGCGLILHFVFQYLVPSETVSLLTVGFLVASFAASVSSLL AITVDRYLSLYNALTYYSRRTLLGVHLLLAATWTVSLGLGLLPVLGWNCLAERAACSV VRPLARSHVALLSAAFFMVFGIMLHLYVRICQVVWRHAHQIALQQHCLAPPHLAATRK GVGTLAVVLGTFGASWLPFAIYCVVGSHEDPAVYTYATLLPATYNSMINPIIYAFRNQ EIQRALWLLLCGCFQSKVPFRSRSPSEV" polyA_signal 2296..2301 polyA_site 2308 /note="by similarity with rat cDNA, GenBank Accession Number U12006" BASE COUNT 528 a 770 c 731 g 670 t ORIGIN 1 gagctcagcc cggcgccccg cctcggcgcc catgaccagc gacttccaga agtcctgacg 61 ccccaggtga gtggccgagt tcccagggaa ctttgaggat gtggggagag gaggggagaa 121 agatatccag gggacaggtc tttccagaag aggagaaatg gagccaacct tgactcccac 181 cctctgctgc ccccatacac acaccctagt ctctcctccc agaccccctt cgcaggggtc 241 tctgggctcc aggactctcc aggcttcgca atgccagaaa cttcgctttc agcaccgcgg 301 acagcgtctc tgctgcccag ccccatgggg atgtcctggt cgcgccgcca caacccaggc 361 gggactctcc caggatgaca ctcctagctt ggtgcactcg ggtgcgtgtg caaatacagg 421 cagccttgga gaattgatcc ttgtgcatgt gagtgcatgt tgtcctggtg tatatctgca 481 aaaactcaga agtccgcgtg ggcatttcac ttactcactc aacaggtatt tattttgggt 541 tcgcacttaa cccatgattc cccgatttgc tctggggtcc cagggtgggg tgggcaactc 601 aagtggggca atgcgagagg aggctctacg cgtggggaag tttcctggca gcctgaagtg 661 tacacctgac gcctgcactc cctccctatg cagggtgcaa atccggccgc gatgaacgcg 721 agcgccgcct cgctcaacga ctcccaggtg gtggtagtgg cggccgaagg agcggcggcg 781 gcggccacag cagcaggggg gccggacacg ggcgaatggg gaccccctgc tgcggcggct 841 ctaggagccg gcggcggagc taatgggtct ctggagctgt cctcgcagct gtcggctggg 901 ccaccgggac tcctgctgcc agcggtgaat ccgtgggacg tgctcctgtg cgtgtcgggg 961 acagtgatcg ctggagaaaa cgcgctggtg gtggcgctca tcgcgtccac tccggcgctg 1021 cgcacgccca tgttcgtgct ggtaggcagc ctggccaccg ctgacctgtt ggcgggctgt 1081 ggcctcatct tgcactttgt gttccagtac ttggtgccct cggagactgt gagtctgctc 1141 acggtgggct tcctcgtggc ctccttcgcc gcctctgtca gcagcctgct ggccattacg 1201 gtggaccgct acctgtccct gtataacgcg ctcacctatt actcgcgccg gaccctgttg 1261 ggcgtgcacc tcctgcttgc cgccacttgg accgtgtccc taggcctggg gctgctgccc 1321 gtgctgggct ggaactgcct ggcagagcgc gccgcctgca gcgtggtgcg cccgctggcg 1381 cgcagccacg tggctctgct ctccgccgcc ttcttcatgg tcttcggcat catgctgcac 1441 ctgtacgtgc gcatctgcca ggtggtctgg cgccacgcgc accagatcgc gctgcagcag 1501 cactgcctgg cgccacccca tctcgctgcc accagaaagg gtgtgggtac actggctgtg 1561 gtgctgggca ctttcggcgc cagctggctg cccttcgcca tctattgcgt ggtgggcagc 1621 catgaggacc cggcggtcta cacttacgcc accctgctgc ccgccaccta caactccatg 1681 atcaatccca tcatctatgc cttccgcaac caggagatcc agcgcgccct gtggctcctg 1741 ctctgtggct gtttccagtc caaagtgccc tttcgttcca ggtctcccag cgaggtctga 1801 agggctcgcc ccgtgtcctc tcaccaacac cacaccccaa caagccagcc tttggtaagc 1861 tcggtgcctg ctgacgaact ctgagatccc aatggtgtga gtctgacttt ggaaagaaaa 1921 agggactaaa gagaaatgta acaaacttac aaggacaaag aggcttgttg gcactttaca 1981 tatacagtgt atacatgtgt acatatatat acaaatattt gtatcttctg gaggtgttca 2041 ggatgtggag cttcctgttc tgtgaaaaac caagaaaaag atatggttgt atactcaaat 2101 tgtacatcac gtttgtcaaa cgaagacatt ccaatactgc ttaattatag cactttattt 2161 ttagctgctg aactgccaaa acagtgttgc cattttcaag ggcagggaaa agggagtaaa 2221 aggtgtattt ttgtcgtatg tgatagaata ttttgctgca catgcatcaa caaattacaa 2281 catgttttgt acacgaataa acccattaca agaatgtaat ttggggtatg tcactgacta 2341 cagaattaca attagctgaa ttgtaagtgt atgagtgtct ttctttcctt tctttcttcc 2401 tttctttctt tcttgcttgc ttgctttctc ttgctttctt tctttctctt tcgtttgttc 2461 gagatagagt ctcactctgt cgcccaggcc tgaatgcagt ggcacaatca tagctcagtg 2521 cagccttgaa ctcctgggcc caagaaatcc tgctttagca tccctagtag ctgggactac 2581 aggcatgcca cagcactcac cttactttat ttattttttt aagtttttaa aatttcagta 2641 gttttgaggg tacaggtgcc ttttttggtt acatagatga gttctttagt agtgaattc // LOCUS HSU19107 3965 bp DNA PRI 02-OCT-1995 DEFINITION Human ZNF127 (ZNF127) gene, complete cds. ACCESSION U19107 NID g1001958 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3965) AUTHORS Jong,M.T., Carey,A.H., Glenn,C.C., Saitoh,S., Stewart,C.L., Rinchik,E.M., Driscoll,D.J. and Nicholls,R.D. TITLE A novel imprinted zinc-finger gene and overlapping antisense transcript identify multiple candidate genes for Prader-Willi syndrome and a mouse genetic model JOURNAL Unpublished REFERENCE 2 (bases 1 to 3965) AUTHORS Jong,M.T. TITLE Direct Submission JOURNAL Submitted (25-DEC-1994) Michelle T. Jong, Division of Pediatric Genetics, College of Medicine, University of Florida, 1600 S.W. Archer Road, Gainesville, FL 32610-0296, USA FEATURES Location/Qualifiers source 1..3965 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15q11-q13" /chromosome="15" promoter 1..968 5'UTR 969..1077 mRNA 969..>3770 gene 1078..2601 /gene="ZNF127" CDS 1078..2601 /gene="ZNF127" /codon_start=1 /product="ZNF127" /db_xref="PID:g1001959" /translation="MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAP DSALPHAARGWAPFPVAPVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICR YYIHGQCKEGENCRYSHDLSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAP PAASSLSLPVIGSAAERGFFEAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVA SAPEAPLQSSETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHP MDAAQREEHMRACIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKANPNDRRFGILSN CNHSFCIRCIRRWRSARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEEKQKLIQQYK EAMSNKACRYFAEGRGNCPFGDTCFYKHEYPEGWGDEPPGPGGGSFSAYWHQLVEPVR MGEGNMLYKSIKKELVVLRLASLLFKRFLSLRDELPFSEDQWDLLHYELEEYFNLIL" polyA_signal 3765..3770 BASE COUNT 1168 a 800 c 1007 g 990 t ORIGIN 1 gaattcagta tgcagtcatg atacgtactc tcagcaaagt aggaatacaa aaaaatgatt 61 aatctactac atttatctgc aaaaatgaca aaacgctcat acatggaaat taaagaccat 121 tcctttgtgg ataatatggc acaggatatg ttatgaaagg ctgtcctgcc acaacacagc 181 acagatgggg ttgaaagaaa ggaagggaaa caaagggctg ccatgaatta aaccagtaga 241 gaaaataaaa gcctgggtca tgcaggggac agtgtcttat taggccagca tcaggactgg 301 ttgaagtctg gcgcttctaa cacagtggga aaatgagcat agtaatcaca ataaagaaag 361 acatctgaaa ggtgctaact tagcctcgga tcagaagtgc tctctgctaa tgccttgctg 421 gtgaaaagcg tatccagtgt aagccctcag aaatcccaga aaacaaagac caaaagaaac 481 ggcgccttgt caaccgaacg aattgaaaaa agcttcctgc cgcgatcggg cattaaaaga 541 aaaaaaccac agacataaga tggagtgaat aaaaaaagaa tttatataat tcatggatga 601 aatgtttcag aatctatagg attatatttt tttttaaggc agacagatac gaaaatacaa 661 cgagcgtgca tgaccgaaac cagaagagat taaagtaaaa cctcattctc ctgaggaaat 721 cgtgtgagaa gggacttagg gactgccgga acacagcgaa cgagaggcag aaggcagaca 781 aaaggggctg ggttggtccc cgcctgtgtg aacgaaaaaa tatgtcagat tggaaaattg 841 cggtaaaaac caggcagagc acgtacgttg cccccacagg aagtgtccgc catgctgcct 901 gtgcccggaa gtggtaggaa cacacagtca gagggaccca aaagcagggg ggaaggaaaa 961 agagatgcac acttccccca gagaagcctc cgagcgcggc cgccattccg ggcctcaagc 1021 ccataaagaa aaaataccgg agaggttctg gcaccatttc ggggtgccaa agcagccatg 1081 gaagagcctg cagctccctc agaagcccac gaggcagccg gggcccaggc aggtgctgag 1141 gcagcaaggg agggtgtgtc tgggccggac cttcccgtct gtgagccctc cggggaatct 1201 gctgctccag attcagccct gccacatgcg gcaaggggct gggccccctt ccctgtagct 1261 ccagtccctg cccacctccg cagaggaggc ctgaggcctg ccccagcctc aggaggagga 1321 gcctggccca gtccgttgcc aagccgaagc agcggcattt ggacaaagca gatcatctgc 1381 aggtattata tacatgggca gtgcaaggag ggggagaact gtcgctattc gcacgacctt 1441 tctggtcgga agatggccac tgagggtggc gtttcgccgc ctggggcctc tgcaggtgga 1501 ggccctagca cggctgcgca catcgagccc ccgactcagg aagtggcgga agcccccccg 1561 gctgcatcct ccctttcctt gcctgtgatt ggctcggctg ctgaaagggg tttctttgaa 1621 gccgagagag acaatgcaga ccgtggagct gctggaggag caggtgtaga aagctgggcg 1681 gatgccattg agtttgttcc agggcagccc taccggggcc gctgggttgc atctgccccc 1741 gaggctcctc tacagagctc agagactgag aggaagcaga tggctgtggg cagtgggttg 1801 cggttttgct attatgcttc caggggagtt tgctttcgtg gggagagctg tatgtacctc 1861 catggagaca tatgcgacat gtgtgggctg cagaccttgc accccatgga tgctgcccag 1921 agggaagaac atatgagggc ctgcattgaa gcacacgaga aagatatgga actctcgttt 1981 gctgtgcagc gtggtatgga caaggtgtgt ggcatctgca tggaggttgt ctatgagaag 2041 gccaacccca atgaccgccg ctttggcatt ctttccaatt gcaaccattc cttctgtatt 2101 aggtgtatcc gcaggtggag aagtgccaga cagtttgaga acaggatcgt caagtcttgc 2161 ccacagtgca gggtcacctc tgaattggtc attcccagtg agttctgggt ggaggaggag 2221 gaagagaagc agaaacttat tcagcaatac aaggaggcaa tgagcaacaa ggcctgcagg 2281 tattttgcgg aaggcagggg taactgccca tttggagaca catgctttta caagcatgaa 2341 taccctgagg gctggggaga tgagcctcct gggccaggtg gtgggtcatt cagcgcatac 2401 tggcatcaac ttgtggagcc tgtgcgaatg ggagagggca acatgctcta taaaagcatt 2461 aagaaggagc ttgtcgtgct tcggctggcc agtctgttgt ttaagcggtt tctttcactg 2521 agagatgagt tacccttctc tgaggaccag tgggacttgc ttcattatga gctggaagaa 2581 tatttcaatt tgattctgta gcatcgtgct gtggcatgtg gtctagtctg ctgaggttct 2641 gtcgtctgct attgcctgtt ttccctgtgt tgacactctt actgctttca ggggctgttg 2701 aggcagtgct tctgttttct tgtctattct gcatatcttt ccccctagga ttatggtgat 2761 tatctgtgtt aaaaaataag tccttaaagt tactgttttg gtgaaattaa tattaatgtc 2821 agcttatggc ttttttttgt catctctgtt gtcaacagga ttaactcagt tctagtgtag 2881 tgtttactga atttccacac ttattttgaa gaccctcaag agtaaatgtg gcagagtgaa 2941 aggagaagtt ttaattgaac tagtagcttt gtgctataat agccttaaca aatggaccct 3001 tgcagggctt tgcagctgct catctgtttg tttacagttt gttctttccc tccttcccct 3061 tcaagtgcac ttgttaaact gtgatgaact tgtgattttg tgttttatct gaccaaaacc 3121 aagtgtatat gtttacatgt ttttatcctg tttagcttga catgaaataa tttatatttg 3181 gaaatatata tttaagaatt atatatataa aaatatatat ggtataagga ggttatggta 3241 tttgaaaaaa atatataaaa gaatatacat cacaatataa tatttatgtt tatgtaataa 3301 agtaaataca gagctgaaag ctgaaggtca aagcctaaca ggactggctg ttgtgtggat 3361 gtgagttgtg tgaataatct ttctgtccct cgcacagaag ccagtaatta gcatctaatg 3421 aaaaggactg ttcaagtggg tctggccaaa tgtgacagat gcagatctta gaggacttac 3481 aaagcactat attggtaatt cttacaatgg cattagtagc ttactctata aatacagaga 3541 tggttttcct atgcagttta gccaccttct cattaattct ttgtaacagc aaatcctagg 3601 ctcagaggca cagtgctttg tatttgatat acaaagtctc tagactttcc caacaagggg 3661 cttttgacaa aagagttcaa cataaaagta acaagattat aaaagggaaa atagaacaaa 3721 aaaatattaa aaccaatgag aataaaagaa tggaaagata aaataataaa atatagaaag 3781 cactaaaaac taagtgtaag agtaatatag aaaatttaca tacaacttaa ttagattaaa 3841 tgaaagcagt ttaaagactg aaagtaaagg ctataagact tcattacaat cagaatgaaa 3901 catcctgttt aaaatacaca caacatgaaa atacagaaac actcaaagta gtaggatggg 3961 aaaag // LOCUS HSU20734 8628 bp DNA PRI 29-AUG-1995 DEFINITION Human transcription factor junB (junB) gene, 5' region and complete cds. ACCESSION U20734 NID g965002 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8628) AUTHORS Phinney,D.G., Tseng,S. and Ryder,K. TITLE Complex Genetic Organization of junB: Multiple Blocks of Flanking Evolutionary Conserved Sequence at the Murine and Human junB Loci JOURNAL Genomics (1995) In press REFERENCE 2 (bases 1 to 8628) AUTHORS Ryder,K. TITLE Direct Submission JOURNAL Submitted (07-FEB-1995) Kevin Ryder, Division of Basic Science, Fox Chase Cancer Center, 7701 Burholme Ave., Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..8628 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="placenta" /clone_lib="lamda phage library" misc_feature 1..8628 /note="FECS" misc_feature 201..247 /note="FECS VIII; flanking evolutionary conserved sequence; conserved between mouse and human junB" misc_feature 585..841 /note="FECS VII; flanking evolutionary conserved sequence; conserved between mouse and human junB" misc_feature 1056..1181 /note="FECS VI; flanking evolutionary conserved sequence; conserved between mouse and human junB" misc_feature 2609..2734 /note="FECS V; flanking evolutionary conserved sequence; conserved between mouse and human junB" misc_feature 2936..3234 /note="FECS IIII; flanking evolutionary conserved sequence; conserved between mouse and human junB" misc_feature 3920..4240 /note="FECS III; flanking evolutionary conserved sequence; conserved between mouse and human junB" misc_feature 4444..4646 /note="FECS II; flanking evolutionary conserved sequence; conserved between mouse and human junB" misc_feature 5501..5714 /note="FECS I; flanking evolutionary conserved sequence; conserved between mouse and human junB" TATA_signal 5681..5685 mRNA 5708 gene 5998..7041 /gene="junB" CDS 5998..7041 /gene="junB" /codon_start=1 /product="transcription factor junB" /db_xref="PID:g710346" /translation="MCTKMEQPFYHDDSYTATGYGRAPGGLSLHDYKLLKPSLAVNLA DPYRSLKAPGARGPGPEGGGGGSYFSGQGSDTGASLKLASSELERLIVPNSNGVITTT PTPPGQYFYPRGGGSGGGAGGAGGGVTEEQEGFADGFVKALDDLHKMNHVTPPNVSLG ATGGPPAGPGGVYAGPEPPPVYTNLSSYSPASASSGGAGAAVGTGSSYPTTTISYLPH APPFAGGHPAQLGLGRGASTFKEEPQTVPEARSRDATPPVSPINMEDQERIKVERKRL RNRLAATKCRKRKLERIARLEDKVKTLKAENAGLSSTAGLLREQVAQLKQKVMTHVSN GCQLLLGVKGHAF" misc_feature 7830..8324 /note="FECS IX; flanking evolutionary conserved sequence; conserved between mouse and human junB" BASE COUNT 1668 a 2702 c 2553 g 1698 t 7 others ORIGIN 1 ctacacagat agccctacat agacctttgc atccaggctg gcacagaagc ggcagaggct 61 actcctgtct ctcaggcctg ggtgtcaggt tggaccagca cacactctca catgcaccct 121 ataggccagt cacacgcttg ccccctggac cttgccaagg ttctgtgggg gcactgacac 181 cccagtctgt ctctgcccca cagagatcct ggcacttcct ctctccaaac atcaccccca 241 cttccctccc tctttgcttt tgccacttcc cctaggagcc tagctcacag cctctgctcc 301 tgctggggtg ggggcctgct ataaagaggt cacccagggt cttgtgatgc ccagtccccc 361 aaggcttcac cagcagtggc agggatggaa gcagcttcct gagaaactcc cctcatgccc 421 aaccatgtca gcctggagag agagaaagag agagactctc cttcctgctt ccaagaaaac 481 ctagcagctt ctctccagga gaggggcaaa ggacccccaa acacagaggt gggagaggga 541 ggcggctggt aggcagagtt cctgggggac tgccgggcct cagatccaca cccaggggca 601 gggacacggg agggaggctg ggtgctgtgt gctgcgattt gtgggttctc gccctggcct 661 gcccactgct gactcacagg cagcccccag cccagtttcc tagtgagggc cccactggtc 721 agtggggcca gcagggaggg gggtcaccgg gctatttata agaaggagaa gtccctcgtg 781 ccaagaccaa aagccccaag acacttgtgg gtgggggtgg gtgaacttcc tccttatcct 841 gtgccctagg ccccaggccc tggggcactg gggaacctgc gcagggactt cttagccagg 901 tgctggtctg gccagggcct ggcagtaccc agctgccagt ctggacgctc aggccagttg 961 ggggaggggg ctgttctcca ccctcctcgt ggggccggcc tggccagtgg ggcggcctgg 1021 agggggtggt ccaggcacaa gggtgacact gcagaggcct ggaaccgggt ataccagctt 1081 ggcgtnggnc ccccgccctc cctgtctggc ccaatctcca ccccgtgctc aaacacccat 1141 gcagagtgtt tgaggaaccc tactcccaca tgctggcctg tccacagaga ccgtgcccct 1201 ctgtctgtgc cccaccacat ttctccccca attaaaggag acctcagtgg ggtgcggtgg 1261 ctcacacctg taatcccaga ctttgggagc aggtggagga ttagggcagg acttccagac 1321 cagcctaggc aacagaatga gacctgtctc aataaatata aaaaaataaa acttagcagt 1381 tgtggtgtgc ttgtagtccc agttactctg gaggctgagg caggaggatc gcttttaccc 1441 aggagttcag ggctgcagcg agctatgatt gtgtagctga actctagcct gagcaacaga 1501 gtgagatggt cttgttttgt ggccaggctg gagtccagtg tcaagatcat ggtcatgtga 1561 tcctcctggc tcagcttcgg agtagtggaa ctacaggaca ccctcctgac ctcagataat 1621 ttttaaattt ttttgtagag atgggggtct tgctgtgttc cccaggctgg tctcaaaaat 1681 ttctgggtag ggctgggcga ggtagatcat gcctgtaatc ccagactttg ggaggtgagg 1741 gggcagataa cttcaggtca ggagttcgag accggttggc aacatggtga aaccccatct 1801 ctactaaaaa tacaaaaaat tagctggggt ggtgggatgg cctgtaatcc caactattcg 1861 ggagggtgag gagaagaata ggttgacctg ggaggtggag gttgtagtga ctaatatttc 1921 cccttcactc caccctggcg acagatgata ttttgtctcc accacccccc caacaataaa 1981 ctcctgggct caagtgatcc ctccccctcg ccctcccaag tctgagatta tagacgtgac 2041 cccctgtgtt tgccaaagtg agttctcttt aaaaaacaaa aaaaggccgg gcatgtgccc 2101 acgcctgtaa tctcagtact ttgggagact caggcggtga tcacaaggtc aggagttcga 2161 gaccagctga ccaatatggt gaaatcccgt ctgtactaaa aacaaaaatt agctgggcat 2221 ggtggcatgc gcctgtaatc ccacctactc aggagtctga gcaggagaat cgttgaaccc 2281 aggaggcgga ggttcagtga ccaacatcat gccattgcac tccagcctgg cgacagagca 2341 agactccatc tccaaaaaac aaacaacaac aaacaacaaa acacccaaaa aaagagccct 2401 tcatgagacg ccctgcctgc agctggtact gcatacatgc tctctggttg tcattattca 2461 caacacagtg gctaatgctc tgaagaccag acagcttgga ttatatctca actctagtga 2521 cttctagctg ttgacaccag acaagttcct taatctcctg tggctcagtt tctcatctgt 2581 tacatgggga tacaaataac atgcattttg ctggattgtt ggaagcagtc gatgacttat 2641 tttatgttaa ccacttagaa gagtgcctag cccacagtaa gagccatgta agtgttggct 2701 attaaaataa taattattat ttctcagcct atctatctta agtgtttctg agtcactgcg 2761 tgacactaac cttgaaccca tttgtctctc tgggcatctc tccctgccct gcgcccctgc 2821 ccccatacac ccactgttcc ttgccctcct cctgggatgg actctgtctg ggatggcctc 2881 acctctctct aggactccct tgcatggcca gccagcctgg ccagggctga attactgtgg 2941 cctcctggct tttgggacct gcaagttagc ttcccaaggt gctgccctct ggcctgccct 3001 gccctgccca accccccaca cgctgccccg tgcgccccgc ggggcccagc cagatgtcag 3061 ctgcagttat tagcctgggg gggagacaca ggcctggcag tgtgcctgcc agactcacag 3121 gctccggaat ctcccttgag cctggggagg gggaggggag ccattcaagc aaggggaaga 3181 agggaggggg catttagggg tcacagggac cctccaagga cgggacgccc ccagcttctg 3241 ctgggacctt cctccagcaa ggccactgca ctgctggact tccccctgcc cccacatagc 3301 tcagtttctt tttcgcctct ttcgttttcc ttctctttcc cacacttggt gagatctagg 3361 ctgccccgga atggctgttt aacccctatc ccttagcctc tttcttaccc taaagaagag 3421 agaggctctc ttcatcctcc agaagacaga gctgagctga gagagagaga gagagacaaa 3481 gagagagaga gagagacaga gacagagaca gagacactag gcttctcacc tcttggtcta 3541 gtgggttcac acactaactg ctcggaagtc ccaccttaag tctaactgaa gcctgaccgt 3601 ctgcacctcg ccgcagcctt cccgggaagg ggctctggtt ccctcatatt ctccccttag 3661 attttcaggt cttctctgac tcggggagga agctctcaga tgacagagct actctaggga 3721 aggagggggg ctccctctag gtattctgga tgatagtagg atgggacctg cagactaaaa 3781 cgagagtggg gcccctgact cacatgagtg agtggggggt gttggggacg gagctccaga 3841 tctcctggca gaagggaagg caccccaccc gagggaggag cagccgccgt tccagcccgc 3901 cgcagggggc gggcatcggc ccgccctctc cgcgccccgt cggccggggg cgaagtccga 3961 cccaggcccg cgggcaggcg ggacctgcgg gctgggttgg gccaggccgg gctcgaggac 4021 gcagccccaa agcggcgggg tggggcgggg ccgggccagg cggcgtgggg gcccgggggc 4081 ggcggtcggg gctggtcggg actagcagtc tgggggccca ggcgcgccct gggacctggc 4141 agatagtcgg gttccccgct tctggctcgg ggccccagga cagcgctggt cccacctctt 4201 cctgtgccct aatatgggcg gcggggagag acttcggggg ccggcccgcg ctcagtcctg 4261 cccgcggact cccgctgttg ccatggcgac caggccccgt cctggtggtc cctcggccaa 4321 tggaacactc ccaagccgcc tgcctccgcg gtcccctgca ggcaccgcgg gcggaaccat 4381 tgcttgaggg gagaggatgg tggctggagg ctgccgcggc tgggcgggct ccaagcaccc 4441 gggcctccgc gggggctcgc acgcccaggt tcctcttccg aggcgcagga tgcgacggga 4501 cccagggcgg ggcgcacggt cccgtagatc cgagtgacgg agacagggcc gggctccttc 4561 ccccggggct gttgccacac ttcctgcctc ttctttcccc agcctgtttc taaggaaggg 4621 agtggggttg ggcgaccgca cccagccgtc gggctccagg ctctaggagc gacacggtgt 4681 tgggcaaagc agggtcaata ggggagggtt ttaggatggg ggacagagaa tacagatgac 4741 taagaggtta ccatcgaggg ggagcagcag tcgtggaaga tccagcagtc ctggtgcgcg 4801 ggaccctcaa ggccccctcc tcacatgtca actgagtacc tcttattgtc ttctctgctc 4861 cgaagatgtg tcggcccctt ctaactctct ccttcatccg ggctagcaga tgaccccagt 4921 ggtccccaat ttctggcaga catgtctcca tcttctacct ggcatatttt acctgcctca 4981 gtgtacccca ggccgcttac tagctttctg catatctaga cttcccctaa tgcctccttc 5041 ccgcttacgg agagcctcag actctggact cagctcccat gagctcctgg acccctactc 5101 atttcttgca atttaatggg tcatgcagct ccacccactc accccttttg atctctcccc 5161 tcctccgtcc tgtgaaaatt ccagtcccgc atccttctga gcccgggacc cccagtcaat 5221 tcctgggtca ggtgtctcct taaccctccc gatttacagt gcttaaccct catttctgct 5281 ttttggggtc tccaatggat tgtcagtcct cctacccctc tcgtatctgg gtacctcagg 5341 ggtttcttcg cacatactgg gaccctcacc ccacttgctg cgtaccaggt cctggtattt 5401 gtccagtgga ctccagggaa atcatcctcc tcctgaaacc cctcactcat gtgcctgggc 5461 cccccagcac ctccttccat gcgtaccccg aggtcctttg agcccctccc cctgcagccc 5521 cgccgagcca ccgccccgtg gccgctgttt acaaggacac gcgcttcctg acagtgacgc 5581 gagccgcctc ctccccttcc ccacgctcga ggaggggggc gcgggggccc ggctccggcg 5641 acggccaatc ggagcgcact tccgtggctg actagcgcgg tataaaggcg tgtggctcag 5701 gctgagcggc tgggaccttg agagcggcca ggccagcctc ggagccagca gggagctggg 5761 agctggggga aacgacgcca ggaaagctat cgcgccagag agggcgacgg gggctcggga 5821 agcctgacag ggcttttgcg cacagctgcc ggctggctgc tacccgcccg cgccagcccc 5881 cgagaacgcg cgaccaggca cccagtccgg tcaccgcagc ggagagctcg ccgctcgctg 5941 cagcgaggcc cggagcggcc ccgcagggac cctccccaga ccgcctgggc cgcccggatg 6001 tgcactaaaa tggaacagcc cttctaccac gacgactcat acacagctac gggatacggc 6061 cgggcccctg gtggcctctc tctacacgac tacaaactcc tgaaaccgag cctggcggtc 6121 aacctggccg acccctaccg gagtctcaaa gcgcctgggg ctcgcggacc cggcccagag 6181 ggcggcggtg gcggcagcta cttttctggt cagggctcgg acaccggcgc gtctctcaag 6241 ctcgcctctt cggagctgga acgcctgatt gtccccaaca gcaacggcgt gatcacgacg 6301 acgcctacac ccccgggaca gtacttttac ccccgcgggg gtggcagcgg tggaggtgca 6361 gggggcgcag ggggcggcgt caccgaggag caggagggct tcgccgacgg ctttgtcaaa 6421 gccctggacg atctgcacaa gatgaaccac gtgacacccc ccaacgtgtc cctgggcgct 6481 accggggggc ccccggctgg gcccgggggc gtctacgccg gcccggagcc acctcccgtt 6541 tacaccaacc tcagcagcta ctccccagcc tctgcgtcct cgggaggcgc cggggctgcc 6601 gtcgggaccg ggagctcgta cccgacgacc accatcagct acctcccaca cgcgccgccc 6661 ttcgccggtg gccacccggc gcagctgggc ttgggccgcg gcgcctccac cttcaaggag 6721 gaaccgcaga ccgtgccgga ggcgcgcagc cgggacgcca cgccgccggt gtcccccatc 6781 aacatggaag accaagagcg catcaaagtg gagcgcaagc ggctgcggaa ccggctggcg 6841 gccaccaagt gccggaagcg gaagctggag cgcatcgcgc gcctggagga caaggtgaag 6901 acgctcaagg ccgagaacgc ggggctgtcg agtaccgccg gcctcctccg ggagcaggtg 6961 gcccagctca aacagaaggt catgacccac gtcagcaacg gctgtcagct gctgcttggg 7021 gtcaagggac acgccttctg aacgtcccct gcccctttac ggacaccccc tcgcttggac 7081 ggctgggcac acgcctccca ctggggtcca gggagcaggc ggtgggcacc caccctggga 7141 cctaggggcg ccgcaaacca cactggactc cggccctcct accctgcgcc cagtccttcc 7201 acctcgacgt ttacaagccc ccccttccac ttttttttgt atgttttttt tctgctggaa 7261 acagactcga ttcatattga atataatata tttgtgtatt taacagggag gggaagaggg 7321 ggcgatcgcg gcggagctgg ccccgccgcc tggtactcaa gcccgcgggg acattgggaa 7381 ggggaccccc gccccctgcc ctcccctctc tgcaccgtac tgtggaaaag aaacacgcac 7441 ttagtctcta aagagtttat tttaagacgt gtttgtgttt gtgtgtgttt gttcttttta 7501 ttgaatctat ttaagtaaaa aaaaaattgg ttctttatta atttctgttg tctttttttc 7561 caagctggga gggcgggggg aaaaaaaaag cactggtttg cccccagctc agtgctgttg 7621 gtggctcggt cctgtatgtg tccccctcgt cggttcggcg caggcatctt gtggtcccag 7681 cccaggagtc ccacccttcc cgcgtcccca gatctccagg gttggatggt tggggcgcgg 7741 gacgcggtcc agggacccag gagctgaagg cagggtgctc cggccgagac ttggagtgcg 7801 caggcgcgtc cccgcccagc cgcgccgccg gggctttccc cgctgacgca gcggaagcgc 7861 tgcccataca aggaccgatt ctgcccagtg acgcgaccgc ggtctctggg cagattccgg 7921 gaatcccctc ccccgctctg ccgggcaggg ccgcggcacc gggaaggggg ccgggatttt 7981 cccgggcagg cgactgcccg ggcacggaaa gccccaggcc tgggtccaga gcncccgcgg 8041 tgggcgggca gcgctcctgg gctccgggag ccacccctgt gccaccttca agacactggc 8101 gcctaggccc cgccgcgccc acgccccctc cgtctgacct gaccggggcg gagggttgct 8161 gctgcctccg ctgctttggc gacgccgtcc cctctacccc caccccctcc aggaagggcg 8221 gcgcccgcct gccagtttcc cctcacgggt gccaggccag gagcggggcg actgncaggt 8281 ctggtcctgc cggggcggnt gctcgggccg ctgggcgccg ncaccgncct cgcgctgggt 8341 ccctcccgcg cagctgggac acgtggaggt gggggtgggg gtgctgtgct gctggccgcc 8401 tagagggagt ctggcggttc ggggttggaa gcccccagag cagtggagaa aaacaggtag 8461 agaaacagac aggcagtagt tatggaacag acacttcatc aattcatttg tacaatacct 8521 aggaacaact agaccctgga gacacagctg gaccactcca gacaagacaa taaataaaca 8581 agcaaacatg gttctttggg ggtggtggta agtgctagta agaggaaa // LOCUS HSU21051 2932 bp DNA PRI 26-APR-1996 DEFINITION Human G protein-coupled receptor (GPR4) gene, complete cds. ACCESSION U21051 NID g687793 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2932) AUTHORS Mahadevan,M.S., Baird,S., Bailly,J.E., Shutler,G.G., Sabourin,L.A., Tsilfidis,C., Neville,C.E., Narang,M. and Korneluk,R.G. TITLE Isolation of a novel G protein-coupled receptor (GPR4) localized to chromosome 19q13.3 JOURNAL Genomics 30 (1), 84-88 (1995) MEDLINE 96129306 REFERENCE 2 (bases 1 to 2932) AUTHORS Baird,S. TITLE Direct Submission JOURNAL Submitted (14-FEB-1995) Stephen Baird, Molecular Genetics, Children's Hospital of Eastern Ontario, 401 Smyth Rd., Ottawa, Ontario, K1H 8L1, Canada FEATURES Location/Qualifiers source 1..2932 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" mRNA 237..2932 mRNA 237..2052 gene 830..1918 /gene="GPR4" CDS 830..1918 /gene="GPR4" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g687794" /translation="MGNHTWEGCHVDSRVDHLFPPSLYIFVIGVGLPTNCLALWAAYR QVQQRNELGVYLMNLSIADLLYICTLPLWVDYFLHHDNWIHGPGSCKLFGFIFYTNIY ISIAFLCCISVDRYLAVAHPLRFARLRRVKTAVAVSSVVWATELGANSAPLFHDELFR DRYNHTFCFEKFPMEGWVAWMNLYRVFVGFLFPWALMLLSYRGILRAVRGSVSTERQE KAKIKRLALSLIAIVLVCFAPYHVLLLSRSAIYLGRPWDCGFEERVFSAYHSSLAFTS LNCVADPILYCLVNEGARSDVAKALHNLLRFLASDKPQEMANASLTLETPLTSKRNST AKAMTGSWAATPPSQGDQVQLKMLPPAQ" BASE COUNT 619 a 933 c 690 g 689 t 1 others ORIGIN 1 ctgcagtcag gcggtgaact gacttcatcc caatccctca gcccccacca ggaccagtct 61 ggagtccctc ccctgccccc antgaaattt cccttccgtc cccaaactta cctctgatct 121 agaccttact cacctccttc ctgtttccta agactccttc ctgccgtcca cagaccgagc 181 cttttatctt tgtccaccct gtgccagaca cctccttttc cagaaccttc tccttactgg 241 tgaccttact tatctctgtt gctttctggg gtcctaggaa atgccagcac tcccacccac 301 attgcctgaa ctttccaaca ctccctagct gcgctgtgtc ctatctcaac acttcctcat 361 gtatttcttg tgtcttctag aacattcccc cgccattatt acttcaatat ggctacacat 421 acttcctaat tgccctgcaa accatctcct tctcaccatt gcccagcgat gctttcgtct 481 cctccataaa cactcccgga gaccaatttt tgtgtcaccc ccatactccc tcgttgacac 541 actgactcca tacataacct ccttgaaaaa cctctttatt aatctcacca tcctccagac 601 ttccctcctg tcataattcc atccctcctc caacttttcc ctctcaagct ctgcccttcc 661 cagcccagcc cagcctaccc aacctcatct cttccctgta gaccacatcc caccatgttc 721 ccctgagcct ccaaggaagg ggctcagggg gccccatggc ctcccgctcc ctgtggcccc 781 acagcccccg tgggccaggg gaagcgcccc agaagccgaa gtgcccacca tgggcaacca 841 cacgtgggag ggctgccacg tggactcgcg cgtggaccac ctctttccgc catccctcta 901 catctttgtc atcggcgtgg ggctgcccac caactgcctg gctctgtggg cggcctaccg 961 ccaggtgcaa cagcgcaacg agctgggcgt ctacctgatg aacctcagca tcgccgacct 1021 gctgtacatc tgcacgctgc cgctgtgggt ggactacttc ctgcaccacg acaactggat 1081 ccacggcccc gggtcctgca agctctttgg gttcatcttc tacaccaata tctacatcag 1141 catcgccttc ctgtgctgca tctcggtgga ccgctacctg gctgtggccc acccactccg 1201 cttcgcccgc ctgcgccgcg tcaagaccgc cgtggccgtg agctccgtgg tctgggccac 1261 ggagctgggc gccaactcgg cgcccctgtt ccatgacgag ctcttccgag accgctacaa 1321 ccacaccttc tgctttgaga agttccccat ggaaggctgg gtggcctgga tgaacctcta 1381 tcgggtgttc gtgggcttcc tcttcccgtg ggcgctcatg ctgctgtcgt accggggcat 1441 cctgcgggcc gtgcggggca gcgtgtccac cgagcgccag gagaaggcca agatcaagcg 1501 gctggccctc agcctcatcg ccatcgtgct ggtctgcttt gcgccctatc acgtgctctt 1561 gctgtcccgc agcgccatct acctgggccg cccctgggac tgcggcttcg aggagcgcgt 1621 cttttctgca taccacagct cactggcttt caccagcctc aactgtgtgg cggaccccat 1681 cctctactgc ctggtcaacg agggcgcccg cagcgatgtg gccaaggccc tgcacaacct 1741 gctccgcttt ctggccagcg acaagcccca ggagatggcc aatgcctcgc tcaccctgga 1801 gaccccactc acctccaaga ggaacagcac agccaaagcc atgactggca gctgggcggc 1861 cactccgccc tcccaggggg accaggtgca gctgaagatg ctgccgccag cacaatgaac 1921 cccgagtggc acagaatccc cagttttccc ctctcatccc acagtccctt ctctcctggt 1981 ctggtgtatg caaattgtat ggaaaaaggg ctgtgttaat attcataaga atacaagaac 2041 ttaggaagag tgaggttggt gtgtcactgg tcaacctttg tgctcccaga tcccatcaca 2101 gtttggcgat tgtggagggc ctcctgaagg aggagatgag taaatatatt tttttggaga 2161 cagggtctca ctgtgttgcc caggctggag tgcagtagtg cagtcgtggc tcactgcagc 2221 ctccacctcc tgggctctcc agcgatcttc ccacatcagc ctcccgagta gctgggacca 2281 caaatgtgag cccacccatg cctggctaat ttttgtactt tttgtataaa tggagtctca 2341 ctatgtttcc ccaggctgat cttgaactcc tgggctcaag agatcctcct gccttggcct 2401 cccaaagtgc tcagattaga gatgtgagcc gccatgtctg gccagataaa ttaagtcaaa 2461 catttggttt ccagaaaata aagacaaata gagaaggtta gatttttttt tttccaacaa 2521 gtggataaaa gtctgtgact cgggggaaag tggaaggaga aatgcagccg atatagagtc 2581 attatgtttg caaagcccct ggtcatacag gccagggaac ataagaccgc aattctaagt 2641 ttctagataa acagcgatct ccaagtcaag actgaggatg aagagggaga atgtcagaac 2701 tcaagtgaag ggcaatcagg gcagactgcc tggaggagtg atgccagaag gtttgggaag 2761 aaggtgtggg acaagaagaa agggtattta ttcattcatt caacagaggt ttatgtaggg 2821 cactgtgctg ggtggggctg gggacacaac aatgactgag gcagcctggc cttgccttca 2881 cagggctcac catacacaag taaataaaaa atatgtaatg tttggaattg ct // LOCUS HSU22491 1596 bp DNA PRI 06-SEP-1995 DEFINITION Human G protein-coupled receptor (GPR7), complete cds. ACCESSION U22491 NID g953232 KEYWORDS Opioid; somatostatin; intronless. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1596) AUTHORS O'Dowd,B.F., Scheideler,M.A., Nguyen,T., Cheng,R., Rasmussen,J.S., Zastawny,R., Marchese,A., Heng,H.H.Q., Tsui,L.-C., Shi,X., Asa,S., Puy,L. and George,S.R. TITLE The cloning and chromosomal mapping of two novel human opioid-somatostatin-like receptor genes, GPR7 and GPR8, expressed in discrete areas of the brain JOURNAL Genomics 28 (1), 84-91 (1995) MEDLINE 96070436 REFERENCE 2 (bases 1 to 1596) AUTHORS O'Dowd,B. TITLE Direct Submission JOURNAL Submitted (13-MAR-1995) Brian O'Dowd, Pharmacology, University of Toronto, 8 Taddle Creek, Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1596 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q11.2-q21.1" gene 526..1512 /gene="GPR7" CDS 526..1512 /gene="GPR7" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g953233" /translation="MDNASFSEPWPANASGPDPALSCSNASTLAPLPAPLAVAVPVVY AVICAVGLAGNSAVLYVLLRAPRMKTVTNLFILNLAIADELFTLVLPINIADFLLRQW PFGELMCKLIVAIDQYNTFSSLYFLTVMSADRYLVVLATAESRRVAGRTYSAARAVSL AVWGIVTLVVLPFAVFARLDDEQGRRQCVLVFPQPEAFWWRASRLYTLVLGFAIPVST ICVLYTTLLCRLHAMRLDSHAKALERAKKRVTFLVVAILAVCLLCWTPYHLSTVVALT TDLPQTPLVIAISYFITSLTYANSCLNPFLYAFLDASFRRNLRQLITCRAAA" BASE COUNT 253 a 550 c 477 g 316 t ORIGIN 1 attctgcaga tatccatcac actggcggcc gctcgagcat gcatctagag cttgccactg 61 cgggattctg tggggtaacc tgggtctacg gaagtttcct gaaagagggg agaagggttt 121 gcatttttcc tatggaggat tcttctctct ctagcatttc gtttgatgta ttcaactggt 181 agaagtgaga tttcaacagg tagcagagag cgctcacgtg gaggaggttt ggggcgccgc 241 ggcacccccc acccctcctc gggaccgcgc ctatttctaa agttacacgt cgacgaacta 301 acctatgctt taaattcctc tttccgaccc cgtgagtccg cggcgacatt gggccgtggg 361 gtggctggga acggtcccct cctccggaaa aaccagagaa cggcttggag agctggaaac 421 gagcgtccgc gagcaggtcc gtgcagaacc gggcttcagg accgctgagc tccgtagggc 481 gtccttgggg gacgccaggt cgccggctcc tctgccctcg ttgagatgga caacgcctcg 541 ttctcggagc cctggcccgc caacgcatcg ggcccggacc cggcgctgag ctgctccaac 601 gcgtcgactc tggcgccgct gccggcgccg ctggcggtgg ctgtaccagt tgtctacgcg 661 gtgatctgcg ccgtgggtct ggcgggcaac tccgccgtgc tgtacgtgtt gctgcgggcg 721 ccccgcatga agaccgtcac caacctgttc atcctcaacc tggccatcgc cgacgagctc 781 ttcacgctgg tgctgcccat caacatcgcc gacttcctgc tgcggcagtg gcccttcggg 841 gagctcatgt gcaagctcat cgtggctatc gaccagtaca acaccttctc cagcctctac 901 ttcctcaccg tcatgagcgc cgaccgctac ctggtggtgt tggccactgc ggagtcgcgc 961 cgggtggccg gccgcaccta cagcgccgcg cgcgcggtga gcctggccgt gtgggggatc 1021 gtcacactcg tcgtgctgcc cttcgcagtc ttcgcccggc tagacgacga gcagggccgg 1081 cgccagtgcg tgctagtctt tccgcagccc gaggccttct ggtggcgcgc gagccgcctc 1141 tacacgctcg tgctgggctt cgccatcccc gtgtccacca tctgtgtcct ctataccacc 1201 ctgctgtgcc ggctgcatgc catgcggctg gacagccacg ccaaggccct ggagcgcgcc 1261 aagaagcggg tgaccttcct ggtggtggca atcctggcgg tgtgcctcct ctgctggacg 1321 ccctaccacc tgagcaccgt ggtggcgctc accaccgacc tcccgcagac gccgctggtc 1381 atcgctatct cctacttcat caccagcctg acgtacgcca acagctgcct caaccccttc 1441 ctctacgcct tcctggacgc cagcttccgc aggaacctcc gccagctgat aacttgccgc 1501 gcggcagcct gactccccca gcgtccggct ccgcaactgc gcgccactcc tggccagcga 1561 gggaggagcc ggcgccagag tgcgggacca gacagg // LOCUS HSU22492 1518 bp DNA PRI 06-SEP-1995 DEFINITION Human G protein-coupled receptor gene (GPR8), complete cds. ACCESSION U22492 NID g953234 KEYWORDS Opioid, somatostatin, intronless. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1518) AUTHORS O'Dowd,B.F., Scheideler,M.A., Nguyen,T., Cheng,R., Rasmussen,J.S., Zastawny,R., Marchese,A., Heng,H.H.Q., Tsui,L.-C., Shi,X., Asa,S., Puy,L. and George,S.R. TITLE The cloning and chromosomal mapping of two novel human opioid-somatostatin-like receptor genes, GPR7 and GPR8, expressed in discrete areas of the brain JOURNAL Genomics 28 (1), 84-91 (1995) MEDLINE 96070436 REFERENCE 2 (bases 1 to 1518) AUTHORS O'Dowd,B.F., Scheideler,M.A., Nguyen,T., Cheng,R., Rasmussen,J.S., Zastawny,R., Marchese,A., Heng,H.H.Q., Tsui,L.-C., Shi,X., Asa,S., Puy,L. and George,S.R. TITLE Direct Submission JOURNAL Submitted (13-MAR-1995) Brian O'Dowd, Pharmacology, University of Toronto, 8 Taddle Creek, Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1518 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q13.3" gene 349..1350 /gene="GPR8" CDS 349..1350 /gene="GPR8" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g953235" /translation="MQAAGHPEPLDSRGSFSLPTMGANVSQDNGTGHNATFSEPLPFL YVLLPAVYSGICAVGLTGNTAVILVILRAPKMKTVTNVFILNLAVADGLFTLVLPVNI AEHLLQYWPFGELLCKLVLAVDHYNIFSSIYFLAVMSVDRYLVVLATVRSRHMPWRTY RGAKVASLCVWLGVTVLVLPFFSFAGVYSNELQVPSCGLSFPWPERVWFKASRVYTLV LGFVLPVCTICVLYTDLLRRLRAVRLRSGAKALGKARRKVTVLVLVVLAVCLLCWTPF HLASVVALTTDLPQTPLVISMSYVITSLTYANSCLNPFLYAFLDDNFRKNFRSILRC" BASE COUNT 269 a 554 c 382 g 313 t ORIGIN 1 tccactagta acggccgcca ggatccacat ctcttcccag gagggtggcc agcagctgct 61 ctctgcggga ggagggaact gatctgctga agtctcacca ggaagaggcg ggaaggcccc 121 cacacacccc accaggctcc ctctggcccc atgtccttga cctggcaaag tggccgcagt 181 ctctgccaga gaacctggag tggctgtgcc taacagacgg ctggatctca aagtctctgg 241 ttgtttttct ttcctagaat ccagcctaag gaggccccca accagatacc caactccaag 301 gcacctccca cctgcccagg gcgcaaatcg tcaacggtcc cagctacaat gcaggccgct 361 gggcacccag agccccttga cagcaggggc tccttctccc tccccacgat gggtgccaac 421 gtctctcagg acaatggcac tggccacaat gccaccttct ccgagccact gccgttcctc 481 tatgtgctcc tgcccgccgt gtactccggg atctgtgctg tggggctgac tggcaacacg 541 gccgtcatcc ttgtaatcct aagggcgccc aagatgaaga cggtgaccaa cgtgttcatc 601 ctgaacctgg ccgtcgccga cgggctcttc acgctggtac tgcccgtcaa catcgcggag 661 cacctgctgc agtactggcc cttcggggag ctgctctgca agctggtgct ggccgtcgac 721 cactacaaca tcttctccag catctacttc ctagccgtga tgagcgtgga ccgatacctg 781 gtggtgctgg ccaccgtgag gtcccgccac atgccctggc gcacctaccg gggggcgaag 841 gtcgccagcc tgtgtgtctg gctgggcgtc acggtcctgg ttctgccctt cttctctttc 901 gctggcgtct acagcaacga gctgcaggtc ccaagctgtg ggctgagctt cccgtggccc 961 gagcgggtct ggttcaaggc cagccgtgtc tacactttgg tcctgggctt cgtgctgccc 1021 gtgtgcacca tctgtgtgct ctacacagac ctcctgcgca ggctgcgggc cgtgcggctc 1081 cgctctggag ccaaggctct aggcaaggcc aggcggaagg tgaccgtcct ggtcctcgtc 1141 gtgctggccg tgtgcctcct ctgctggacg cccttccacc tggcctctgt cgtggccctg 1201 accacggacc tgccccagac cccactggtc atcagtatgt cctacgtcat caccagcctc 1261 acgtacgcca actcgtgcct gaaccccttc ctctacgcct ttctagatga caacttccgg 1321 aagaacttcc gcagcatatt gcggtgctga agggcctggg caccatcatc cccatcatca 1381 tcatcacccc catcatcatc acccccacca ttacccccat cgtcacgccc atcatcacgc 1441 ccatcatcac cccccatcat cacccccatc atcatgccca tcatcacccc ccatcatcat 1501 catgcccacc cctcatca // LOCUS HSU23052 873 bp DNA PRI 31-MAR-1995 DEFINITION Human arylamine N-acetyltransferase (NAT2), allele NAT2*14A, complete cds. ACCESSION U23052 NID g747646 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 873) AUTHORS Bell,D.A., Taylor,J.A., Butler,M.A., Stephens,E.A., Wiest,J., Brubaker,L.H., Kadlubar,F.F. and Lucier,G.W. TITLE Genotype/phenotype discordance for human arylamine N-acetyltransferase (NAT2) reveals a new slow-acetylator allele common in African-Americans JOURNAL Carcinogenesis 14 (8), 1689-1692 (1993) MEDLINE 93358376 REFERENCE 2 (bases 1 to 873) AUTHORS Ferguson,R.J., Doll,M.A., Rustan,T.D., Gray,K. and Hein,D.W. TITLE Cloning, expression, and functional characterization of two mutant (NAT2(191) and NAT2(341/803)) and wild-type human polymorphic N-acetyltransferase (NAT2) alleles JOURNAL Drug Metab. Dispos. 22 (3), 371-376 (1994) MEDLINE 94349811 REFERENCE 3 (bases 1 to 873) AUTHORS Hein,D.W. TITLE Direct Submission JOURNAL Submitted (17-MAR-1995) David W. Hein, Univ. of North Dakota Sch. of Medicine, Pharmacology-Toxicology, 501 N. Columbia Road., Grand Forks, ND 58202-9037, USA FEATURES Location/Qualifiers source 1..873 /organism="Homo sapiens" /note="human" /db_xref="taxon:9606" gene 1..873 /gene="NAT2" source 1..870 /organism="Homo sapiens" /map="8p21.3-23.1" /tissue_type="colon surgical samples" /chromosome="8" CDS 1..873 /gene="NAT2" /EC_number="2.3.1.5" /note="Allele: NAT2*14A" /codon_start=1 /product="arylamine N-acetyltransferase" /db_xref="PID:g727413" /translation="MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHC GQAMELGLEAIFDHIVRRNQGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYS TGMVHLLLQVTIDGRNYIVDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWY LDQIRREQYITNKEFLNSHLLPKKKHQKIYLFTLEPRTIEDFESMNTYLQTSPTSSFI TTSFCSLQTPEGVYCLVGFILTYRKFNYKDNTDLVEFKTLTEEEVEEVLKNIFKISLG RNLVPKPGDGSLTI" BASE COUNT 260 a 179 c 188 g 246 t ORIGIN 1 atggacattg aagcatattt tgaaagaatt ggctataaga actctaggaa caaattggac 61 ttggaaacat taactgacat tcttgagcac cagatccggg ctgttccctt tgagaacctt 121 aacatgcatt gtgggcaagc catggagttg ggcttagagg ctatttttga tcacattgta 181 agaagaaacc agggtgggtg gtgtctccag gtcaatcaac ttctgtactg ggctctgacc 241 acaatcggtt ttcagaccac aatgttagga gggtattttt acatccctcc agttaacaaa 301 tacagcactg gcatggttca ccttctcctg caggtgacca ttgacggcag gaattacatt 361 gtcgatgctg ggtctggaag ctcctcccag atgtggcagc ctctagaatt aatttctggg 421 aaggatcagc ctcaggtgcc ttgcattttc tgcttgacag aagagagagg aatctggtac 481 ctggaccaaa tcaggagaga gcagtatatt acaaacaaag aatttcttaa ttctcatctc 541 ctgccaaaga agaaacacca aaaaatatac ttatttacgc ttgaacctcg aacaattgaa 601 gattttgagt ctatgaatac atacctgcag acgtctccaa catcttcatt tataaccaca 661 tcattttgtt ccttgcagac cccagaaggg gtttactgtt tggtgggctt catcctcacc 721 tatagaaaat tcaattataa agacaataca gatctggtcg agtttaaaac tctcactgag 781 gaagaggttg aagaagtgct gaaaaatata tttaagattt ccttggggag aaatctcgtg 841 cccaaacctg gtgatggatc ccttactatt tag // LOCUS HSU24186 1565 bp DNA PRI 20-SEP-1996 DEFINITION Human replication protein A complex subunit homolog Rpa4 gene, complete cds. ACCESSION U24186 NID g887964 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1565) AUTHORS Keshav,K.F., Chen,C. and Dutta,A. TITLE Rpa4, a homolog of the 34-kilodalton subunit of the replication protein A complex JOURNAL Mol. Cell. Biol. 15 (6), 3119-3128 (1995) MEDLINE 95280910 REFERENCE 2 (bases 1 to 1565) AUTHORS Keshav,K.F. TITLE Direct Submission JOURNAL Submitted (05-APR-1995) Kylie F. Keshav, Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, 75 Francis St, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1565 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Acid activation-tagged HeLa cDNA library from R. Brent" CDS 405..1190 /note="replication protein A complex 34 kd subunit homolog" /codon_start=1 /product="Rpa4" /db_xref="PID:g887965" /translation="MSKSGFGSYASISAADGASGGSDQLCERDATPAIKTQRPKVRIQ DVVPCNVNQLLSSTVFDPVFKVRGIIVSQVSIVGVIRGAEKASNHICYKIDDMTAKPI EARQWFGREKVKQVTPLSVGVYVKVFGILKCPTGTKSLEVLKIHVLEDMNEFTVHILE TVNAHMMLDKARRDTTVESVPVSPSEVNDAGDNDESHRNFIQDEVLRLIHECPHQEGK SIHELRAQLCDLSVKAIKEAIDYLTVEGHIYPTVDREHFKSAD" BASE COUNT 441 a 320 c 403 g 401 t ORIGIN 1 tctagtaaaa atgcattttt atagagatgt tgggaaaggc ttcttgaaat tacacgtggg 61 acttttaata gataggcgct ttgaccagct aagcaacagg gctcccctcg tgtgggactt 121 ttagaatgta gcaaccactg acacgcaggg aaggattatg cgatcaggtg agaaggtggc 181 cgaccctgac tggctggaag cagatgcatt ctggtagttg attggtccac aggtagcgtg 241 acgcttgtca cgtcctcagc ctcccagcat tcaatcgtag cctttcggac agctcgaagc 301 ccttctgtgg agagctcgaa gccttctgtg gagaactcaa agccgtccgt ggagccccag 361 acgagccaaa gcccaccttc tcctcagcct gagctgtctt gaagatgagt aagagtgggt 421 ttgggagcta tgcgagcatt tctgctgctg atggagcgag tggaggcagt gaccaactgt 481 gtgagagaga tgcaactcct gctattaaga cccaaagacc taaggtccga attcaggacg 541 ttgtaccgtg taacgtgaac cagcttctca gctctactgt gtttgaccct gtgttcaagg 601 ttaggggaat tatagtttcc caggtctcca tcgtgggggt aatcagaggg gcagagaagg 661 cttcaaatca catttgttac aaaattgatg atatgaccgc gaaaccaatc gaggcccgac 721 agtggtttgg tagagagaaa gtcaagcagg tgactccatt gtcagtcgga gtatatgtca 781 aagtgtttgg tatcctcaaa tgtcccacgg gaacaaagag ccttgaggta ttgaaaattc 841 atgtcctaga ggacatgaac gagttcaccg tgcatattct ggaaacggtc aatgcacaca 901 tgatgctgga taaagcccgt cgtgatacca ctgtagaaag tgtgcctgtg tctccatcag 961 aagtgaatga tgctggggat aacgatgaga gtcaccgcaa tttcatccag gacgaagtgc 1021 tgcgtttgat tcatgagtgt cctcatcagg aagggaagag catccatgag ctccgggctc 1081 agctctgcga ccttagcgtc aaggccatca aggaagcgat tgattatctg accgttgagg 1141 gccacatcta tcccactgtg gatcgggagc attttaagtc tgctgattga ggcagggaaa 1201 acatcctttc atttttcgaa gacccttgca tccagctgtg agtaattttg acctgttgac 1261 tttttaggag taggactaaa aaaaaaaatc tcaagtggca ttctttgtca actcgctgct 1321 tttctaactg ctttgaactt ttcggatttt ctgtatttga agctcagaga gagacggtga 1381 tggataaatt gacaactctg taggatttac tagcaagcta atggaaacat gattttcggg 1441 gaagaaaaac tacagaaaat gtagaaattt attatttaat tgtgttggag cttctttttc 1501 caaaagaaaa actagttgca gtcagggagc cagcgaaaag acaaaaaaaa aaaaaaaaaa 1561 cacga // LOCUS HSU29589 3906 bp DNA PRI 24-JUL-1995 DEFINITION Human m3 muscarinic acetylcholine receptor (CHRM3) gene, complete cds. ACCESSION U29589 NID g903978 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3906) AUTHORS Bonner,T.I., Young,A.C., Brann,M.R. and Buckley,N.J. TITLE Cloning and expression of the human and rat m5 muscarinic acetylcholine receptor genes JOURNAL Neuron 1 (5), 403-410 (1988) MEDLINE 90166521 REFERENCE 2 (bases 1 to 3906) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (20-JUN-1995) Tom I. Bonner, Lab of Cell Biology, NIMH, National Institutes of Health, Bldg. 36, Room 3A-17, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..3906 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q43-44" /clone_lib="partial HaeIII-AluI genomic library of R.M. Lawn, E.F. Fritsch, R.C. Parker, G. Blake, and T. Maniatis, Cell 15, 1157-1174, 1978." intron <1..182 exon 183..3713 /note="based on comparison to the rat cDNA rather than a human cDNA" gene 202..1974 /gene="CHRM3" CDS 202..1974 /gene="CHRM3" /codon_start=1 /product="m3 muscarinic acetylcholine receptor" /db_xref="PID:g903979" /translation="MTLHNNSTTSPLFPNISSSWIHSPSDAGLPPGTVTHFGSYNVSR AAGNFSSPDGTTDDPLGGHTVWQVVFIAFLTGILALVTIIGNILVIVSFKVNKQLKTV NNYFLLSLACADLIIGVISMNLFTTYIIMNRWALGNLACDLWLAIDYVASNASVMNLL VISFDRYFSITRPLTYRAKRTTKRAGVMIGLAWVISFVLWAPAILFWQYFVGKRTVPP GECFIQFLSEPTITFGTAIAAFYMPVTIMTILYWRIYKETEKRTKELAGLQASGTEAE TENFVHPTGSSRSCSSYELQQQSMKRSNRRKYGRCHFWFTTKSWKPSSEQMDQDHSSS DSWNNNDAAASLENSASSDEEDIGSETRAIYSIVLKLPGHSTILNSTKLPSSDNLQVP EEELGMVDLERKADKLQAQKSVDDGGSFPKSFSKLPIQLESAVDTAKTSDVNSSVGKS TATLPLSFKEATLAKRFALKTRSQITKRKRMSLVKEKKAAQTLSAILLAFIITWTPYN IMVLVNTFCDSCIPKTFWNLGYWLCYINSTVNPVCYALCNKTFRTTFKMLLLCQCDKK KRRKQQYQQRQSVIFHKRAPEQAL" polyA_site 3714 /note="based on comparison to the rat cDNA rather than a human cDNA" BASE COUNT 1073 a 897 c 875 g 1061 t ORIGIN 1 atcgatgtct gtctgcccta gactccactt atttaaaata agagaatgaa cttgatgttt 61 ggcttcatag agattcagca ccctgtaata ggccttccat gtcttttaac gtatgtaatg 121 caaagaacaa acaaataaag gcagaaattt ttctaactct gtctcttctc tctttccccc 181 agactatgtc agagagtcac aatgaccttg cacaataaca gtacaacctc gcctttgttt 241 ccaaacatca gctcctcctg gatacacagc ccctccgatg cagggctgcc cccgggaacc 301 gtcactcatt tcggcagcta caatgtttct cgagcagctg gcaatttctc ctctccagac 361 ggtaccaccg atgaccctct gggaggtcat accgtctggc aagtggtctt catcgctttc 421 ttaacgggca tcctggcctt ggtgaccatc atcggcaaca tcctggtaat tgtgtcattt 481 aaggtcaaca agcagctgaa gacggtcaac aactacttcc tcttaagcct ggcctgtgcc 541 gatctgatta tcggggtcat ttcaatgaat ctgtttacga cctacatcat catgaatcga 601 tgggccttag ggaacttggc ctgtgacctc tggcttgcca ttgactacgt agccagcaat 661 gcctctgtta tgaatcttct ggtcatcagc tttgacagat acttttccat cacgaggccg 721 ctcacgtacc gagccaaacg aacaacaaag agagccggtg tgatgatcgg tctggcttgg 781 gtcatctcct ttgtcctttg ggctcctgcc atcttgttct ggcaatactt tgttggaaag 841 agaactgtgc ctccgggaga gtgcttcatt cagttcctca gtgagcccac cattactttt 901 ggcacagcca tcgctgcttt ttatatgcct gtcaccatta tgactatttt atactggagg 961 atctataagg aaactgaaaa gcgtaccaaa gagcttgctg gcctgcaagc ctctgggaca 1021 gaggcagaga cagaaaactt tgtccacccc acgggcagtt ctcgaagctg cagcagttac 1081 gaacttcaac agcaaagcat gaaacgctcc aacaggagga agtatggccg ctgccacttc 1141 tggttcacaa ccaagagctg gaaacccagc tccgagcaga tggaccaaga ccacagcagc 1201 agtgacagtt ggaacaacaa tgatgctgct gcctccctgg agaactccgc ctcctccgac 1261 gaggaggaca ttggctccga gacgagagcc atctactcca tcgtgctcaa gcttccgggt 1321 cacagcacca tcctcaactc caccaagtta ccctcatcgg acaacctgca ggtgcctgag 1381 gaggagctgg ggatggtgga cttggagagg aaagccgaca agctgcaggc ccagaagagc 1441 gtggacgatg gaggcagttt tccaaaaagc ttctccaagc ttcccatcca gctagagtca 1501 gccgtggaca cagctaagac ttctgacgtc aactcctcag tgggtaagag cacggccact 1561 ctacctctgt ccttcaagga agccactctg gccaagaggt ttgctctgaa gaccagaagt 1621 cagatcacta agcggaaaag gatgtccctg gtcaaggaga agaaagcggc ccagaccctc 1681 agtgcgatct tgcttgcctt catcatcact tggaccccat acaacatcat ggttctggtg 1741 aacacctttt gtgacagctg catacccaaa accttttgga atctgggcta ctggctgtgc 1801 tacatcaaca gcaccgtgaa ccccgtgtgc tatgctctgt gcaacaaaac attcagaacc 1861 actttcaaga tgctgctgct gtgccagtgt gacaaaaaaa agaggcgcaa gcagcagtac 1921 cagcagagac agtcggtcat ttttcacaag cgcgcacccg agcaggcctt gtagaatgag 1981 gttgtatcaa tagcagtgac aaaacgcaca catcaaccca cagaccttag gaggaggaag 2041 gcgagggcgg ggtgacttct ggtgatgata aaaatggttt tatcacccag atgtgaaaga 2101 agctgcctgt ttactgatcc attgaataaa cccattttaa tagaaaaagt caataccaat 2161 tcagcaaaaa gaaaaaaaaa acatactact gaatataaag aaatttattc tgaaatagac 2221 tttacgtgtt tttttcttaa agaggagaaa aatattgctt gacggcaatt atatacccaa 2281 agtgatttgc ctgggtcctt taattcccat tagctttgga atctcagatg agcatagctg 2341 acccagttcc cacattcttc ccaaggatcc aaaagtggga atccagaccc caagtggaac 2401 actgcaggct tacgaatctg tggttccaaa attatttcat acgttgcaaa gctgaatctt 2461 cttgtcccaa tagagcttcc tgtcttttct ttggtgtgtt gttaaactct atttgtggac 2521 ttgattcttg attcttgcaa agtactgttt tgtgcagttc aagtttcgta caaataaaat 2581 acttaagtat atatatatgt gtgagttctg cacgcacaca catagtgtat ataatatcat 2641 gggaaacact gaactggcaa attattcctg caacatacgc tttcagtact ttggtaactg 2701 aagttctcta ggatcctaat gcaacattaa cgtgaaataa gcccagtgta atgtttttgc 2761 aaaccagggc tgttttccac agagagcagc caggccttcc cagcaggtct gtgcagagcg 2821 gacaggctcg tgagtcagct gagcgccgtg gcttcgccag acttggtgtt aagcaacctc 2881 ctttgttgat gtctcaacag agctaaatcg gggcccctct gagctcaaag aatgaaccac 2941 atccacacgt ttgaatttaa tcatctaaat ctgaatgttt cagaacaaaa tttctgctat 3001 ctaaactgct tgaaactcaa taatagtgtc acgtttgaat gtcatacaca gcaatatata 3061 tatatgtgta tatatatata tatggcaaag caaaaaaaaa aacatggtaa gagagaatga 3121 aggagaacat tgtgtttgat tcttgctgaa tggcaccttc tcaaagaaaa tagggcttgc 3181 acctttgtta atcagctgtg gccagtgctt tctggtgttc attgtgtaac cttcacccag 3241 gaataggtga ggttttagga agttacatgt cctctgaaga aagaattaca ctctgaaaag 3301 taatgcttca aattgatttc cttacctttt gggaaaaaaa aaaaattgtt tttttgcatt 3361 ctcccttgaa ttgaccaaaa tgttaactgt ttcatttggg gaggggatgg ggtgctgcca 3421 tcattgtcgt tgttgttgct gctgtagctg ttggggtttc ttttcctgtt gccggggctg 3481 tttggggaga gggaggggag ggaggtggga gggccgcgga gatatcttcc cctttgtaca 3541 gggcattctg tgttgtgaac ccagagctgg gtagaagctg cttttgtatt cagtgtgagg 3601 tggtgtttac agacgacttt gacaacagta gaagtgtact cagtggtgtc tgtgtatctg 3661 aactatttaa tttcgtgtta tgtttatatg cagaaatatt tatggatact acaccaagtg 3721 tttatttatt gttgataaat atgactcttc agtcgtcagc catggtgtcc tttcaaatga 3781 ttctttaagg tccacttgag caatgaatag agtatattgg agctttcctg tggctaagaa 3841 gaagaaacat gtcatcctgt tgccatcacc aagcacctaa ctctttctag gtaataaaaa 3901 gtcaac // LOCUS HSU29727 2848 bp DNA PRI 05-DEC-1995 DEFINITION Human BMK1 gamma kinase mRNA, complete cds. ACCESSION U29727 NID g973310 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2848) AUTHORS Lee,J.D., Ulevitch,R.J. and Han,J. TITLE Primary structure of BMK1: a new mammalian map kinase JOURNAL Biochem. Biophys. Res. Commun. 213 (2), 715-724 (1995) MEDLINE 95374539 REFERENCE 2 (bases 1 to 2848) AUTHORS Lee,J.D., Ulevitch,R.J. and Han,J. TITLE Direct Submission JOURNAL Submitted (21-JUN-1995) Jiing-Dwan Lee, Immunology(IMM-12), TSRI, 10666 N. Torrey Pines Rd., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2848 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 90..2540 /codon_start=1 /function="MAP kinase" /product="BMK1 gamma kinase" /db_xref="PID:g973311" /translation="MAEPLKEEDGEDGSAEPPGPVKVEPAHTAASVAAKNLALLKARS FDVTFDVGDEYEIIETIGNGAYGVVSSARRRLTGQQVAIKKIPNAFDVVTNAKRTLRE LKILKHFKHDNIIAIKDILRPTVPYGEFKSVYVVLDLMESDLHQIIHSSQPLTLEHVR YFLYQLLRGLKYMHSAQVIHRDLKPSNLLVNENCELKIGDFGMARGLCTSPAEHQYFM TEYVATRWYRAPELMLSLHEYTQAIDLWSVGCIFGEMLARRQLFPGKNYVHQLQLIMM VLGTPSPAVIQAVGAERVRAYIQSLPPRQPVPWETVYPGADRQALSLLGRMLRFEPSA RISAAAALRHPFLAKYHDPDDEPDCAPPFDFAFDREALTRERIKEAIVAEIEDFHARR EGIRQQIRFQPSLQPVASEPGCPDVEMPSPWAPSGDCAMESPPPAPPPCPGPAPDTID LTLQPPPPVSEPAPPKKDGAISDNTKAALKAALLKSLRSRLRDGPSAPLEAPEPRKPV TAQERQREREEKRRRRQERAKEREKRRQERERKERGAGASGGPSTDPLAGLVLSDNDR SLLERWTRMARPAAPALTSVPAPAPAPTPTPTPVQPTSPPPGPVAQPTGPQPQSAGST SGPVPQPACPPPGPAPHPTGPPGPIPVPAPPQIATSTSLLAAQSLVPPPGLPGSSTPG VLPYFPPGLPPPDAGGAPQSSMSESPDVNLVTQQLSKSQVEDPLPPVFSGTPKGSGAG YGVGFDLEEFLNQSFDMGVADGPQDGQADSASLSASLLADWLEGHGMNPADIESLQRE IQMDSPMLLADLPDLQDP" polyA_site 2848 /note="22 A nucleotides" BASE COUNT 559 a 967 c 766 g 556 t ORIGIN 1 gtgagccacc ctcggagacc cccgcgctgg ggacgggagg ccggcgagcc tcgggacctc 61 tgaaagcctt gaggaggcgc ggggacacca tggccgagcc tctgaaggag gaagacggcg 121 aggacggctc tgcggagccc cccgggcccg tgaaggtcga acccgcccac accgctgcct 181 ctgtagcggc caagaacctg gccctgctta aagcccgctc cttcgatgtg acctttgacg 241 tgggcgacga gtacgagatc atcgagacca taggcaacgg ggcctatgga gtggtgtcct 301 ccgcccgccg ccgcctcacc ggccagcagg tggccatcaa gaagatccct aatgctttcg 361 atgtggtgac caatgccaag cggaccctca gggagctgaa gatcctcaag cactttaaac 421 acgacaacat catcgccatc aaggacatcc tgaggcccac cgtgccctat ggcgaattca 481 aatctgtcta cgtggtcctg gacctgatgg aaagcgacct gcaccagatc atccactcct 541 cacagcccct cacactggaa cacgtgcgct acttcctgta ccaactgctg cggggcctga 601 agtacatgca ctcggctcag gtcatccacc gtgacctgaa gccctccaac ctattggtga 661 atgagaactg tgagctcaag attggtgact ttggtatggc tcgtggcctg tgcacctcgc 721 ccgctgaaca tcagtacttc atgactgagt atgtggccac gcgctggtac cgtgcgcccg 781 agctcatgct ctctttgcat gagtatacac aggctattga cctctggtct gtgggctgca 841 tctttggtga gatgctggcc cggcgccagc tcttcccagg caaaaactat gtacaccagc 901 tacagctcat catgatggtg ctgggtaccc catcaccagc cgtgattcag gctgtggggg 961 ctgagagggt gcgggcctat atccagagct tgccaccacg ccagcctgtg ccctgggaga 1021 cagtgtaccc aggtgccgac cgccaggccc tatcactgct gggtcgcatg ctgcgttttg 1081 agcccagcgc tcgcatctca gcagctgctg cccttcgcca ccctttcctg gccaagtacc 1141 atgatcctga tgatgagcct gactgtgccc cgccctttga ctttgccttt gaccgcgaag 1201 ccctcactcg ggagcgcatt aaggaggcca ttgtggctga aattgaggac ttccatgcaa 1261 ggcgtgaggg catccgccaa cagatccgct tccagccttc tctacagcct gtggctagtg 1321 agcctggctg tccagatgtt gaaatgccca gtccctgggc tcccagtggg gactgtgcca 1381 tggagtctcc accaccagcc ccgccaccat gccccggccc tgcacctgac accattgatc 1441 tgaccctgca gccacctcca ccagtcagtg agcctgcccc accaaagaaa gatggtgcca 1501 tctcagacaa tactaaggct gcccttaaag ctgccctgct caagtctttg aggagccggc 1561 tcagagatgg ccccagcgca cccctggagg ctcctgagcc tcggaagccg gtgacagccc 1621 aggagcgcca gcgggagcgg gaggagaagc ggcggaggcg gcaagaacga gccaaggagc 1681 gggagaaacg gcggcaggag cgggagcgaa aggaacgggg ggctggggcc tctgggggcc 1741 cctccactga ccccttggct ggactagtgc tcagtgacaa tgacagaagc ctgttggaac 1801 gctggactcg aatggcccgg cccgcagccc cagccctcac ctctgtgccg gcccctgccc 1861 cagcgccaac gccaacccca accccagtcc aacctaccag tcctcctcct ggccctgtag 1921 cccagcccac tggcccgcaa ccacaatctg cgggctctac ctctggccct gtaccccagc 1981 ctgcctgccc accccctggc cctgcacccc accccactgg ccctcctggg cccatccctg 2041 tccccgcgcc accccagatt gccacctcca ccagcctcct ggctgcccag tcacttgtgc 2101 caccccctgg gctgcctggc tccagcaccc caggagtttt gccttacttc ccacctggcc 2161 tgccgccccc agacgccggg ggagcccctc agtcttccat gtcagagtca cctgatgtca 2221 accttgtgac ccagcagcta tctaagtcac aggtggagga ccccctgccc cctgtgttct 2281 caggcacacc aaagggcagt ggggctggct acggtgttgg ctttgacctg gaggaattct 2341 taaaccagtc tttcgacatg ggcgtggctg atgggccaca ggatggccag gcagattcag 2401 cctctctctc agcctccctg cttgctgact ggctcgaagg ccatggcatg aaccctgccg 2461 atattgagtc cctgcagcgt gagatccaga tggactcccc aatgctgctg gctgacctgc 2521 ctgacctcca ggacccctga ggcccccagc ctgtgccttg ctgccacagt agacctagtt 2581 ccaggatcca tgggagcatt ctcaaaggct ttagccctgg acccagcagg tgaggctcgg 2641 cttggattat tctgcaggtt catctcagac ccacctttca gccttaagca gccacctgag 2701 ccaccaccga gccatggcag gatcgggaga ccccaactcc ccctgaacaa tccttttcag 2761 tattatattt ttattattat tatgttatta ttacactgtc tttttgccat caaaatgagg 2821 cctgtgaaat acaaggttcc cttctgca // LOCUS HSU31202 1557 bp DNA PRI 14-DEC-1995 DEFINITION Human noggin (NOGGIN) gene, complete cds. ACCESSION U31202 NID g1117816 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1557) AUTHORS Valenzuela,D.M., Economides,A.N., Rojas,E., Lamb,T.M., Nunez,L., Jones,P., Ip,N.Y., Espinosa,R., Brannan,C.I., Gilbert,D.J., Copeland,N.G., Jenkins,N.A., LeBeau,M.M., Harland,R.M. and Yancopoulos,G.D. TITLE Identification of mammalian noggin and its expression in the adult nervous system JOURNAL J. Neurosci. 15 (9), 6077-6084 (1995) MEDLINE 95395592 REFERENCE 2 (bases 1 to 1557) AUTHORS Valenzuela,D.M. TITLE Direct Submission JOURNAL Submitted (07-JUL-1995) David M. Valenzuela, Discovery Group, Regeneron Pharmaceuticals, Inc., 777 Old Saw Mill River Rd., Tarrytown, NY 10591, USA FEATURES Location/Qualifiers source 1..1557 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q22" /chromosome="17" gene 812..1510 /gene="NOGGIN" CDS 812..1510 /gene="NOGGIN" /codon_start=1 /product="noggin" /db_xref="PID:g1117817" /translation="MERCPSLGVTLYALVVVLGLRATPAGGQHYLHIRPAPSDNLPLV DLIEHPDPIFDPKEKDLNETLLRSLLGGHYDPGFMATSPPEDRPGGGGGAAGGAEDLA ELDQLLRQRPSGAMPSEIKGLEFSEGLAQGKKQRLSKKLRRKLQMWLWSQTFCPVLYA WNDLGSRFWPRYVKVGSCFSKRSCSVPEGMVCKPSKSVHLTVLRWRCQRRGGQRCGWI PIQYPIISECKCSC" BASE COUNT 242 a 521 c 580 g 213 t 1 others ORIGIN 1 gagctccggc gggtcagccg gactgtcggc ttcccggggc atctgggtcc ggcggggcac 61 agccctgggc gctgccgaag ccgccgccgc cgcctccgcg gcgagtacag gcggcttccc 121 ccggagcctg tgcagctcca gctcctcggg ggtggagaag tggggggtgg gggtgatgta 181 tggggggaag aagggggagg ggccaacccc gagagagtca gtggtttcca tggtgatgga 241 gctgaaagtg caggaaattt aaaggcttgg accctgcgag acagacaaac cggtgccaac 301 gtgcgcggac gccgccgccg ccgccgccgc tggagtccgc cgggcagagc cggccgcgga 361 gcccggagca ggcggaggga agtgccccta gaaccagctc agccagcggc gcttgcacag 421 agcggccggn cgaagagcag cgagaggagg aggggagagc ggctcgtcca cgcgccctgc 481 gccgccgccg gcccgggaag gcagcgagga gccggcgcct cccgcgcccc gcggtcgccc 541 tggagtaatt tcggatgccc agccgcggcc gccttcccca gtagacccgg gagaggagtt 601 gcggccaact tgtgtgcctt tcttccgccc cggtgggagc cggcgctgcg cgaagggctc 661 tcccggcggc tcatgctgcc ggccctgcgc ctgcccagcc tcgggtgagc cgcctccgga 721 gagacggggg agcgcggcgg cgccgcgggc tcggcgtgct ctcctccggg gacgcgggac 781 gaagcagcag ccccgggcgc gcgccagagg catggagcgc tgccccagcc taggggtcac 841 cctctacgcc ctggtggtgg tcctggggct gcgggcgaca ccggccggcg gccagcacta 901 tctccacatc cgcccggcac ccagcgacaa cctgcccctg gtggacctca tcgaacaccc 961 agaccctatc tttgacccca aggaaaagga tctgaacgag acgctgctgc gctcgctgct 1021 cgggggccac tacgacccag gcttcatggc cacctcgccc cccgaggacc ggcccggcgg 1081 gggcgggggt gcagctgggg gcgcggagga cctggcggag ctggaccagc tgctgcggca 1141 gcggccgtcg ggggccatgc cgagcgagat caaagggcta gagttctccg agggcttggc 1201 ccagggcaag aagcagcgcc taagcaagaa gctgcggagg aagttacaga tgtggctgtg 1261 gtcgcagaca ttctgccccg tgctgtacgc gtggaacgac ctgggcagcc gcttttggcc 1321 gcgctacgtg aaggtgggca gctgcttcag taagcgctcg tgctccgtgc ccgagggcat 1381 ggtgtgcaag ccgtccaagt ccgtgcacct cacggtgctg cggtggcgct gtcagcggcg 1441 cgggggccag cgctgcggct ggattcccat ccagtacccc atcatttccg agtgcaagtg 1501 ctcgtgctag aactcggggg ccccctgccc gcacccggac acttgatcct cgagctc // LOCUS HSU32672 1535 bp DNA PRI 05-JUN-1996 DEFINITION Human orphan receptor GPR10 (GPR10) gene, complete cds. ACCESSION U32672 NID g1002738 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1535) AUTHORS Marchese,A., Heiber,M., Nguyen,T., Heng,H.H.Q., Saldivia,V.R., Cheng,R., Murphy,P.M., Tsui,L.-C., Shi,X., George,S.R., O'Dowd,B.F. and Docherty,J.M. TITLE Cloning and chromosomal mapping of three novel genes, GPR9, GPR10, and GPR14, encoding receptors related to interleukin 8, neuropeptide Y, and somatostatin receptors JOURNAL Genomics 29 (2), 335-344 (1995) MEDLINE 96115583 REFERENCE 2 (bases 1 to 1535) AUTHORS Marchese,A., Heiber,M., Nguyen,T., Heng,H.H.Q., Saldivia,V.R., Cheng,R., Murphy,P.M., Tsui,L.-C., Shi,X., George,S.R., O'Dowd,B.F. and Docherty,J.M. TITLE Direct Submission JOURNAL Submitted (31-JUL-1995) B.F. O'Dowd, Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1535 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q25.3-q26" gene 308..1417 /gene="GPR10" CDS 308..1417 /gene="GPR10" /note="orphan receptor; G protein-coupled receptor" /codon_start=1 /product="GPR10" /db_xref="PID:g1002739" /translation="MASSTTRGPRVSDLFSGLPPAVTTPANQSAEASAGNGSVAGADA PAVTPFQSLQLVHQLKGLIVLLYSVVVVVGLVGNCLLVLVIARVPRLHNVTNFLIGNL ALSDVLMCTACVPLTLAYAFEPRGWVFGGGLCHLVFFLQPVTVYVSVFTLTTIAVDRY VVLVHPLRRASRCASAYAVLAIWALSAVLALPPAVHTYHVELKPHDVRLCEEFWGSQE RQRQLYAWGLLLVTYLLPLLVILLSYVRVSVKLRNRVVPGCVTQSQADWDRARRRRTF CLLVVVVVVFAVCWLPLHVFNLLRDLDPHAIDPYAFGLVQLLCHWLAMSSACYNPFIY AWLHDSFREELRKLLVAWPRKIAPHGQNMTVSVVI" BASE COUNT 228 a 493 c 479 g 335 t ORIGIN 1 gcgagtgctt tcccgctctc caaaccccac tcccaggtcg gatcgcgctc ctgagtctgc 61 ctgcgtggac tgcgaggacc gtaaatagag gcggaagcgt ttaaataaac cgtatgttct 121 aaccgtgctt aggtttattt tacagaggtg ataggacaac actttttgct acttttgctg 181 ttgttctgtg gccggttatt cccaagggag gactcaggcg atgggggagg ggcgcggctg 241 gtgtaggagg tcggatggag tatggcaagg gaagtgacgg actttgatta cctttgaaca 301 ggtggccatg gcctcatcga ccactcgggg ccccagggtt tctgacttat tttctgggct 361 gccgccggcg gtcacaactc ccgccaacca gagcgcagag gcctcggcgg gcaacgggtc 421 ggtggctggc gcggacgctc cagccgtcac gcccttccag agcctgcagc tggtgcatca 481 gctgaagggg ctgatcgtgc tgctctacag cgtcgtggtg gtcgtggggc tggtgggcaa 541 ctgcctgctg gtgctggtga tcgcgcgggt gccgcggctg cacaacgtga cgaacttcct 601 catcggcaac ctggccttgt ccgacgtgct catgtgcacc gcctgcgtgc cgctcacgct 661 ggcctatgcc ttcgagccac gcggctgggt gttcggcggc ggcctgtgcc acctggtctt 721 cttcctgcag ccggtcaccg tctatgtgtc ggtgttcacg ctcaccacca tcgcagtgga 781 ccgctacgtc gtgctggtgc acccgctgag gcgcgcatct cgctgcgcct cagcctacgc 841 tgtgctggcc atctgggcgc tgtccgcggt gctggcgctg ccgcccgccg tgcacaccta 901 tcacgtggag ctcaagccgc acgacgtgcg cctctgcgag gagttctggg gctcccagga 961 gcgccagcgc cagctctacg cctgggggct gctgctggtc acctacctgc tccctctgct 1021 ggtcatcctc ctgtcttacg tccgggtgtc agtgaagctc cgcaaccgcg tggtgccggg 1081 ctgcgtgacc cagagccagg ccgactggga ccgcgctcgg cgccggcgca ccttctgctt 1141 gctggtggtg gtcgtggtgg tgttcgccgt ctgctggctg ccgctgcacg tcttcaacct 1201 gctgcgggac ctcgaccccc acgccatcga cccttacgcc tttgggctgg tgcagctgct 1261 ctgccactgg ctcgccatga gttcggcctg ctacaacccc ttcatctacg cctggctgca 1321 cgacagcttc cgcgaggagc tgcgcaaact gttggtcgct tggccccgca agatagcccc 1381 ccatggccag aatatgaccg tcagcgtggt catctgatgc cacttagcca ggccttggtc 1441 aaggagctcc acttcaactg gcctcctagg gcaccactcg aggtcaatct ggtgcttatt 1501 cttcagcacc agagctagct aagccaacat agggc // LOCUS HSU33447 1900 bp DNA PRI 08-OCT-1996 DEFINITION Human putative G-protein-coupled receptor (GPR17) gene, complete cds. ACCESSION U33447 NID g992699 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1900) AUTHORS Raport,C.J., Schweickart,V.L., Chantry,D., Eddy,R.L. Jr., Shows,T.B., Godiska,R. and Gray,P.W. TITLE New members of the chemokine receptor gene family JOURNAL J. Leukoc. Biol. 59 (1), 18-23 (1996) MEDLINE 96145150 REFERENCE 2 (bases 1 to 1900) AUTHORS Godiska,R. and Gray,P.W. TITLE Direct Submission JOURNAL Submitted (08-AUG-1995) Ronald Godiska, ICOS Corporation, 22021 20th Avenue SE, Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..1900 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" gene 700..1719 /gene="GPR17" CDS 700..1719 /gene="GPR17" /note="R12" /codon_start=1 /product="putative G-protein-coupled receptor" /db_xref="PID:g992700" /translation="MNGLEVAPPGLITNFSLATAEQCGQETPLENMLFASFYLLDFIL ALVGNTLALWLFIRDHKSGTPANVFLMHLAVADLSCVLVLPTRLVYHFSGNHWPFGEI ACRLTGFLFYLNMYASIYFLTCISADRFLAIVHPVKSLKLRRPLYAHLACAFLWVVVA VAMAPLLVSPQTVQTNHTVVCLQLYREKASHHALVSLAVAFTFPFITTVTCYLLIIRS LRQGLRVEKRLKTKAVRMIAIVLAIFLVCFVPYHVNRSVYVLHYRSHGASCATQRILA LANRITSCLTSLNGALDPIMYFFVAEKFRHALCNLLCGKRLKGPPPSFEGKTNESSLS AKSEL" BASE COUNT 381 a 640 c 507 g 372 t ORIGIN 1 gatccagaaa gcccccaaga gagatgctga aactctcagg tgggtaaaaa gagtagacct 61 ctgacgtccc agggtacagc ccttgctgcc atcctggggg caccctccta agtgccaggg 121 gcaagccatg gtcaggggaa gcagaaagcg gtgacacccc ggccactgca cctgtgggca 181 ggtgggtcag ggagggtcca ggcactcagg atgaacagaa ctcacctgcc aaggcttggg 241 ctgaggagga gctggaatcc tggagacaca ctgcccccgc ccctcaccac ccctgtcact 301 cagacagcac acctcagagg cagaacagaa aacccagagc ctcacccagg caaggctcac 361 gtcccattcc ccgccatggc actgacccgg tcctcccagc tctgaggagc ctcagatctc 421 ctgggtggca ggggtgcagc tgcatagcgc cgaaattcca agccctggtt ctgcgtttgc 481 cttgtgctga agttcagaat gcctctgacg ctcacgcaca ccaaatggac aaggaggtcc 541 cctcagcagc cccgtgggcg gtgctgagct tgaaagtggg aggttctgaa ggcattggag 601 gcctgacttc tggacttcag agagcgtgaa gctgcctaga tcgcaagctc attgtgaact 661 gtttgcttgt tccctccagg ctctgactcc agccaaagca tgaatggcct tgaagtggct 721 cccccaggtc tgatcaccaa cttctccctg gccacggcag agcaatgtgg ccaggagacg 781 ccactggaga acatgctgtt cgcctccttc taccttctgg attttatcct ggctttagtt 841 ggcaataccc tggctctgtg gcttttcatc cgagaccaca agtccgggac cccggccaac 901 gtgttcctga tgcatctggc cgtggccgac ttgtcgtgcg tgctggtcct gcccacccgc 961 ctggtctacc acttctctgg gaaccactgg ccatttgggg aaatcgcatg ccgtctcacc 1021 ggcttcctct tctacctcaa catgtacgcc agcatctact tcctcacctg catcagcgcc 1081 gaccgtttcc tggccattgt gcacccggtc aagtccctca agctccgcag gcccctctac 1141 gcacacctgg cctgtgcctt cctgtgggtg gtggtggctg tggccatggc cccgctgctg 1201 gtgagcccac agaccgtgca gaccaaccac acggtggtct gcctgcagct gtaccgggag 1261 aaggcctccc accatgccct ggtgtccctg gcagtggcct tcaccttccc gttcatcacc 1321 acggtcacct gctacctgct gatcatccgc agcctgcggc agggcctgcg tgtggagaag 1381 cgcctcaaga ccaaggcagt gcgcatgatc gccatagtgc tggccatctt cctggtctgc 1441 ttcgtgccct accacgtcaa ccgctccgtc tacgtgctgc actaccgcag ccatggggcc 1501 tcctgcgcca cccagcgcat cctggccctg gcaaaccgca tcacctcctg cctcaccagc 1561 ctcaacgggg cactcgaccc catcatgtat ttcttcgtgg ctgagaagtt ccgccacgcc 1621 ctgtgcaact tgctctgtgg caaaaggctc aagggcccgc cccccagctt cgaagggaaa 1681 accaacgaga gctcgctgag tgccaagtca gagctgtgag cggggggcgc cgtccagcgc 1741 gagcgcagac tgtttaggac tcagcagacc cagcaagagg catctgccct ttccccagcc 1801 acctccccgg caagcaacct gaaatctcag cagatgccca ccatttctct agatcgccta 1861 gtctcaaccc ataaaaagga agaactgaca aaggggatcc // LOCUS HSU33448 2097 bp DNA PRI 08-OCT-1996 DEFINITION Human putative G-protein-coupled receptor (GPR16) gene, complete cds. ACCESSION U33448 NID g1613770 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2097) AUTHORS Raport,C.J., Schweickart,V.L., Chantry,D., Eddy,R.L. Jr., Shows,T.B., Godiska,R. and Gray,P.W. TITLE New members of the chemokine receptor gene family JOURNAL J. Leukoc. Biol. 59 (1), 18-23 (1996) MEDLINE 96145150 REFERENCE 2 (bases 1 to 2097) AUTHORS Godiska,R. and Gray,P.W. TITLE Direct Submission JOURNAL Submitted (08-AUG-1995) Ronald Godiska, ICOS Corporation, 22021 20th Avenue SE, Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..2097 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" gene 550..1683 /gene="GPR16" CDS 550..1683 /gene="GPR16" /note="R2" /codon_start=1 /product="putative G-protein-coupled receptor" /db_xref="PID:g1613771" /translation="MASGNPWSSTLMRVSALTLQVLPTAMNTTSSAAPPSLGVEFISL LAIILLSVALAVGLPGNSFVVWSILKRMQKRSVTALMVLNLALADLAVLLTAPFFLHF LAQGTWSFGLAGCRLCHYVCGVSMYASVLLITAMSLDRSLAVARPFVSQKLRTKAMAR RVLAGIWVLSFLLATPVLAYRTVVPWKTNMSLCFPRYPSEGHRAFHLIFEAVTGFLLP FLAVVASYSDIGRRLQARRFRRSRRTGRLVVLIILTFAAFWLPYHVVNLAEAGRALAG QAAGLGLVGKRLSLARNVLIALAFLSSSVNPVLYACAGGGLLRSAGVGFVAKLLEGTG SEASSTRRGGSLGQTARSGPAALEPGPSESLTASSPLKLNELN" BASE COUNT 370 a 592 c 673 g 462 t ORIGIN 1 gtttcctggg agcggccgtg gttgtggtgg tggtgggaca gtgtgcagcc atgaaagaag 61 gtgcaaagga atctccaaag aaagcctgac cagcgtaaaa agttgggagg ctttgtcctt 121 gtcacttgtc cactaaactc ctcccctccc tgttatcctg gttgaccctg ggcatctctg 181 gggacagtag gcaggtgatt gggaaagtta atgggattga ggggctgagg gctggcaggg 241 ggcaaaaaga ctggcctttc aaggggtgca gcattggtag gaactctgtt tggttctggg 301 ctttagggtc tcctaaggga ggagactgaa aaggtctgga aatgctgctg ctgctgtggt 361 cactgtatat tttgcaattg ggtctgtgga caggaagggg ccgcatgacc cagttaggaa 421 actagtcttt gtactcaacc agatcccttt aagttgtcag tctgcagcga tgggggcagt 481 atatttcagg gggacctctg atgctgctga ccctggagat agactagagt tctcagccta 541 ggtgtgtcca tggcgtcagg aaacccttgg tcctctactc tcatgcgtgt gtccgccctc 601 actctccagg tcctcccgac ggccatgaac actacatctt ctgcagcacc cccctcacta 661 ggtgtagagt tcatctctct gctggctatc atcctgctgt cagtggcgct ggctgtgggg 721 cttcccggca acagctttgt ggtgtggagt atcctgaaaa ggatgcagaa gcgctctgtc 781 actgccctga tggtgctgaa cctggccctg gccgacctgg ccgtattgct cactgctccc 841 tttttccttc acttcctggc ccaaggcacc tggagttttg gactggctgg ttgccgcctg 901 tgtcactatg tctgcggagt cagcatgtac gccagcgtcc tgcttatcac ggccatgagt 961 ctagaccgct cactggcggt ggcccgcccc tttgtgtccc agaagctacg caccaaggcg 1021 atggcccggc gggtgctggc aggcatctgg gtgttgtcct ttctgctggc cacacccgtc 1081 ctcgcgtacc gcacagtagt gccctggaaa acgaacatga gcctgtgctt cccgcggtac 1141 cccagcgaag ggcaccgggc cttccatcta atcttcgagg ctgtcacggg cttcctgctg 1201 cccttcctgg ctgtggtggc cagctactcg gacatagggc gtcggctaca ggcccggcgc 1261 ttccgccgca gccgccgcac cggccgcctg gtggtgctca tcatcctgac cttcgccgcc 1321 ttctggctgc cctaccacgt ggtgaacctg gctgaggcgg gccgcgcgct ggccggccag 1381 gccgccgggt tagggctcgt ggggaagcgg ctgagcctgg cccgcaacgt gctcatcgca 1441 ctcgccttcc tgagcagcag cgtgaacccc gtgctgtacg cgtgcgccgg cggcggcctg 1501 ctgcgctcgg cgggcgtggg cttcgtcgcc aagctgctgg agggcacggg ctccgaggcg 1561 tccagcacgc gccgcggggg cagcctgggc cagaccgcta ggagcggccc cgccgctctg 1621 gagcccggcc cttccgagag cctcactgcc tccagccctc tcaagttaaa cgaactgaac 1681 taggcctggt ggaaggaggc gcactttcct cctggcagaa tcgtagctct gagccagttc 1741 agtacctgga ggaggagcag gggcgtggag ggcgtggagg gcgtgggagc gtgggaggcg 1801 ggagtggagt ggaagaagag ggagagatgg agcaaagtga gggccgagtg agagcgtgct 1861 ccagcctggc tcccacaggc agctttaacc attaaaactg aagtctgaaa tttggtcaac 1921 cttgtgagtg gggtacatgt gctgtgggta tcggggtgct cgtgggcgcc ctggtggggc 1981 ccctctcggt agttgagagt cacgtccttt agttccccat gatttacaat tttggaaggg 2041 acacaaagaa acatagactg cccccatccc agatgattcc gagtacatag tctgcag // LOCUS HSU34070 3318 bp DNA PRI 22-OCT-1995 DEFINITION Human CCAAT/enhancer binding protein alpha gene, complete cds. ACCESSION U34070 NID g1041732 KEYWORDS Transcription factor; DNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3318) AUTHORS Antonson,P. and Xanthopoulos,K.G. TITLE Molecular cloning, sequence, and expression patterns of the human gene encoding CCAAT/enhancer binding protein alpha (C/EBP alpha) JOURNAL Biochem. Biophys. Res. Commun. 215 (1), 106-113 (1995) MEDLINE 96003748 REFERENCE 2 (bases 1 to 3318) AUTHORS Antonson,P. and Xanthopoulos,K.G. TITLE Direct Submission JOURNAL Submitted (16-AUG-1995) Per Antonson, Department of Bioscience at Novum, Karolinska Institute, Huddinge S-141 57, Sweden FEATURES Location/Qualifiers source 1..3318 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.1" /chromosome="19" /tissue_type="umbilical cord" promoter 1..471 TATA_signal 442..448 CDS 592..1668 /note="bZIP transcription factor; C/EBPa" /codon_start=1 /product="CCAAT/enhancer binding protein alpha" /db_xref="PID:g1041733" /translation="MESADFYEAEPRPPMSSHLQSPPHAPSSAAFGFPRGAGPPKPPA PPAAPEPLGGICEHETSIDISAYIDPAAFNDEFLADLFQHSRQQEKAKAAVGPTGGGG GGDFDYPGAPAGPGGAVMPGGAHGPPPGYGCAAAGYLDGRLEPLYERVGAPALRPLVI KQEPREEDEAKQLALAGLFPYQPPPPPPPSHPHPHPPPAHLAAPHLQFQIAHCGQTTM HLQPGHPTPPPTPVPSPHPAPALGAAGLPGPGSALKGLGAAHPDLRASGGTGAGKAKK SVDKNSNEYRVRRERNNIAVRKSRDKAKQRNVETQQKVLELTSDNDRLRKRVEQLSRE LDTLRGIFRQLPESSLVKAMGNCA" polyA_signal 3249..3254 BASE COUNT 566 a 1099 c 1096 g 557 t ORIGIN 1 ctgcagcctc cccgggacgc gggtccggga caggcctggt tctggctttg aaagagaatc 61 cgcgccccag cagctcaaga ccaagactcg ccctccgccc cccaccccta ccccgtgcag 121 cctcgggata ctcctgggct cccggccgtg gctggatacg ggcgcctagg gcaggcagga 181 ggagggggcc cccgctaccg accacgtggg cgcgggggcg acggccgggc cgggggcgga 241 gcttggagcg agcgccgcgg ctctgctggg cgcgctggag gcggtgggcg ttgcgccgcg 301 gcctgcctgg ggagcgcggc gctgtgccgc gtggttcgcc gccccatgcc ggccgcgcgc 361 taggacccag caggcgccgc gccgccgcag cccggggaca gaggccgcct cggactctag 421 ggggcgacgc ggcctgccgg gtataaaagc tgggccggcg cgggccgggc cattcgcgac 481 ccggaggtgc gcgggcgcgg gcgagcaggg tctccgggtg ggcggcggcg acgccccgcg 541 caggctggag gccgccgagg ctcgccatgc cgggagaact ctaactcccc catggagtcg 601 gccgacttct acgaggcgga gccgcggccc ccgatgagca gccacctgca gagccccccg 661 cacgcgccca gcagcgccgc cttcggcttt ccccggggcg cgggcccgcc gaagcctccc 721 gccccacctg ccgccccgga gccgctgggc ggcatctgcg agcacgagac gtccatcgac 781 atcagcgcct acatcgaccc ggccgccttc aacgacgagt tcctggccga cctgttccag 841 cacagccggc agcaggagaa ggccaaggcg gccgtgggcc ccacgggcgg cggcggcggc 901 ggcgactttg actacccggg cgcgcccgcg ggccccggcg gcgccgtcat gcccggggga 961 gcgcacgggc ccccgcccgg ctacggctgc gcggccgccg gctacctgga cggcaggctg 1021 gagcccctgt acgagcgcgt cggggcgccg gcgctgcggc cgctggtgat caagcaggag 1081 ccccgcgagg aggatgaagc caagcagctg gcgctggccg gcctcttccc ttaccagccg 1141 ccgccgccgc cgccgccctc gcacccgcac ccgcacccgc cgcccgcgca cctggccgcc 1201 ccgcacctgc agttccagat cgcgcactgc ggccagacca ccatgcacct gcagcccggt 1261 caccccacgc cgccgcccac gcccgtgccc agcccgcacc ccgcgcccgc gctcggtgcc 1321 gccggcctgc cgggccctgg cagcgcgctc aaggggctgg gcgccgcgca ccccgacctc 1381 cgcgcgagtg gcggcacggg cgcgggcaag gccaagaagt cggtggacaa gaacagcaac 1441 gagtaccggg tgcggcgcga gcgcaacaac atcgcggtgc gcaagagccg cgacaaggcc 1501 aagcagcgca acgtggagac gcagcagaag gtgctggagc tgaccagtga caatgaccgc 1561 ctgcgcaagc gggtggaaca gctgagccgc gaactggaca cgctgcgggg catcttccgc 1621 cagctgccag agagctcctt ggtcaaggcc atgggcaact gcgcgtgagg cgcgcggctg 1681 tgggaccgcc ctgggccagc ctccggcggg gacccaggga gtggtttggg gtcgccggat 1741 ctcgaggctt gcccgagccg tgcgagccag gactaggaga ttccggtgcc tcctgaaagc 1801 ctggcctgct ccgcgtgtcc cctcccttcc tctgcgccgg acttggtgcg tctaagatga 1861 gggggccagg cggtggcttc tccctgcgag gaggggagaa ttcttggggc tgagctggga 1921 gcccggcaac tctagtattt aggataacct tgtgccttgg aaatgcaaac tcaccgctcc 1981 aatgcctact gagtaggggg agcaaatcgt gccttgtcat tttatttgga ggtttcctgc 2041 ctccttcccg aggctacagc agacccccat gagagaagga aggggagcag gcccgtggca 2101 ggaggagggc tcagggagct gagatcccga caagcccgcc agccccagcc gctcctccac 2161 gcctgtcctt agaaaggggt ggaaacatag ggacttgggg cttggaacct aaggttgttc 2221 ccctagttct acatgaaggt ggagggtctc tagttccacg cctctcccac ctccctccgc 2281 acacacccca cccccagcct gctataggct gggcttccct tgggcggaac tcactgcgat 2341 gggggtcacc aggtgaccag tgggagcccc caccccgagt cacaccagaa agctaggtcg 2401 tgggtcagct ctgaggatgt atacccctgg tgggagaggg agacctagag atctggctgt 2461 ggggcgggca tggggggtga agggccactg ggaccctcag ccttgtttgt actgtatgcc 2521 ttcagcattg cctaggaaca cgaagcacga tcagtccatc ccagagggac cggagttatg 2581 acaagctttc caaatatttt gctttatcag ccgatatcaa cacttgtatc tggcctctgt 2641 gccccagcag tgccttgtgc aatgtgaatg tgcgcgtctc tgctaaacca ccattttatt 2701 tgggttttgt tttgttttgg ttttgctcgg atacttgcca aaatgagact ctccgtcggc 2761 agctggggga agggtctgag actccctttc cttttggttt tgggattact tttgatcctg 2821 ggggaccaat gaggtgaggg gggttctcct ttgccctcag ctttccccag cccctccggc 2881 ctgggctgcc cacaaggctt gtcccccaga ggccctggct cctggtcggg aagggaggtg 2941 gcctcccgcc aacgcatcac tggggctggg agcagggaag gacggcttgg ttctcttctt 3001 ttggggagaa cgtagagtct cactctagat gttttatgta ttatatctat aatataaaca 3061 tatcaaagtc aatgtcggtg tctttttaaa accagaaaga agctacttcc aaggttgtct 3121 gtgggccagg tcacatttgt aaataataca gcattttccc tggcggcaat cctgactttc 3181 atgagctctc catccatcct gagcccctct taccctaagg gggtgactta cttcccccag 3241 gcaagacaaa taaatagcag aggacaaggc tccaaatgga gtatgtccag agcctgaagg 3301 cagtctcttg gcgtcagg // LOCUS HSU34623 2817 bp DNA PRI 31-JAN-1996 DEFINITION Human T cell surface glycoprotein CD-6 mRNA, complete cds. ACCESSION U34623 NID g1015963 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2817) AUTHORS Robinson,W.H., Neuman de Vegvar,H.E., Prohaska,S.S., Rhee,J.W. and Parnes,J.R. TITLE Human CD6 possesses a large, alternatively spliced cytoplasmic domain JOURNAL Eur. J. Immunol. 25 (10), 2765-2769 (1995) MEDLINE 96062022 REFERENCE 2 (bases 1 to 2817) AUTHORS Parnes,J.R. TITLE Direct Submission JOURNAL Submitted (22-AUG-1995) Jane Parnes, Medicine, Stanford Universiy, MSLS Bldg. P-306, 1201 Welch Rd., Stanford, CA, 94305, USA COMMENT Aruffo, A. et al. J. Exp. Med. 174,949-952,1991. FEATURES Location/Qualifiers source 1..2817 /organism="Homo sapiens" /note="hybrid CD6-PB1" /db_xref="taxon:9606" CDS 121..2127 /codon_start=1 /product="T cell surface glycoprotein CD6" /db_xref="PID:g1015964" /translation="MWLFFGITGLLTAALSGHPSPAPPDQLNTSSAESELWEPGERLP VRLTNGSSSCSGTVEVRLEASWEPACGALWDSRAAEAVCRALGCGGAEAASQLAPPTP ELPPPPAAGNTSVAANATLAGAPALLCSGAEWRLCEVVEHACRSDGRRARVTCAENRA LRLVDGGGACAGRVEMLEHGEWGSVCDDTWDLEDAHVVCRQLGCGWAVQALPGLHFTP GRGPIHRDQVNCSGAEAYLWDCPGLPGQHYCGHKEDAGVVCSEHQSWRLTGGADRCEG QVEVHFRGVWNTVCDSEWYPSEAKVLCQSLGCGTAVERPKGLPHSLSGRMYYSCNGEE LTLSNCSWRFNNSNLCSQSLAARVLCSASRSLHNLSTPEVPASVQTVTIESSVTVKIE NKESRELMLLIPSIVLGILLLGSLIFIAFILLRIKGKYALPVMVNHQHLPTTIPAGSN SYQPVPITIPKEVFMLPIQVQAPPPEDSDSGSDSDYEHYDFSAQPPVALTTFYNSQRH RVTDEEVQQSRFQMPPLEEGLEELHASHIPTANPGHCITDPPSLGPQYHPRSNSESST SSGEDYCNSPKSKLPPWNPQVFSSERSSFLEQPPNLELAGTQPAFSAGPPADDSSSTS SGEWYQNFQPPPQPPSEEQFGCPGSPSPQPDSTDNDDYDDISAA" BASE COUNT 543 a 930 c 845 g 499 t ORIGIN 1 gaacagcaaa gggtagagca gacctgcgcc aggggcgcac aacggccgtg tccacctccc 61 ggccccaaga tggtgcttcc cacaggcagc cacgcgtagc agccagagac agctccagac 121 atgtggctct tcttcgggat cactggattg ctgacggcag ccctctcagg tcatccatct 181 ccagccccac ctgaccagct caacaccagc agtgcagaga gtgagctctg ggagccaggg 241 gagcggcttc cggtccgtct gacaaacggg agcagcagct gcagcgggac ggtggaggtg 301 cggctcgagg cgtcctggga gcccgcgtgc ggggcgctct gggacagccg cgccgccgag 361 gccgtgtgcc gagcactggg ctgcggcggg gcggaggccg cctctcagct cgccccgccg 421 acccctgagc tgccgccccc gcctgcagcc gggaacacca gcgtagcagc taatgccact 481 ctggccgggg cgcccgccct cctgtgcagc ggcgccgagt ggcggctctg cgaggtggtg 541 gagcacgcgt gccgcagcga cgggaggcgg gcccgtgtca cctgtgcaga gaaccgcgcg 601 ctgcgcctgg tggacggtgg cggcgcctgc gccggccgcg tggagatgct ggagcatggc 661 gagtggggat cagtgtgcga tgacacttgg gacctggagg acgcccacgt ggtgtgcagg 721 caactgggct gcggctgggc agtccaggcc ctgcccggct tgcacttcac gcccggccgc 781 gggcctatcc accgggacca ggtgaactgc tcgggggccg aagcttacct gtgggactgc 841 ccggggctgc caggacagca ctactgcggc cacaaagagg acgcgggcgt ggtgtgctca 901 gagcaccagt cctggcgcct gacagggggc gctgaccgct gcgaggggca ggtggaggta 961 cacttccgag gggtctggaa cacagtgtgt gacagtgagt ggtacccatc ggaggccaag 1021 gtgctctgcc agtccttggg ctgtggaact gcggttgaga ggcccaaggg gctgccccac 1081 tccttgtccg gcaggatgta ctactcatgc aatggggagg agctcaccct ctccaactgc 1141 tcctggcggt tcaacaactc caacctctgc agccagtcgc tggcagccag ggtcctctgc 1201 tcagcttccc ggagtttgca caatctgtcc actcccgaag tccctgcaag tgttcagaca 1261 gtcactatag aatcttctgt gacagtgaaa atagagaaca aggaatctcg ggagctaatg 1321 ctcctcatcc cctccatcgt tctgggaatt ctcctccttg gctccctcat cttcatagcc 1381 ttcatcctct tgagaattaa aggaaaatat gccctccccg taatggtgaa ccaccagcac 1441 ctacccacca ccatcccggc agggagcaat agctatcaac cggtccccat caccatcccc 1501 aaagaagttt tcatgctgcc catccaggtc caggccccgc cccctgagga ctcagactct 1561 ggctcggact cagactatga gcactatgac ttcagcgccc agcctcctgt ggccctgacc 1621 accttctaca attcccagcg gcatcgggtc acagatgagg aggtccagca aagcaggttc 1681 cagatgccac ccttggagga aggacttgaa gagttgcatg cctcccacat cccaactgcc 1741 aaccctggac actgcattac agacccgcca tccctgggcc ctcagtatca cccgaggagc 1801 aacagtgagt cgagcacctc ttcaggggag gattactgca atagtcccaa aagcaagctg 1861 cctccatgga acccccaggt gttttcttca gagaggagtt ccttcctgga gcagccccca 1921 aacttggagc tggccggcac ccagccagcc ttttcagcag ggcccccggc tgatgacagc 1981 tccagcacct catccgggga gtggtaccag aacttccagc caccacccca gcccccttcg 2041 gaggagcagt ttggctgtcc agggtccccc agccctcagc ctgactccac cgacaacgat 2101 gactacgatg acatcagcgc agcctaggcc ggggccagcc gaggctcctg gggtggctct 2161 gaccctctgg cctcctgctc tacctactcc ctttcccctt tcccaccctc ccagctcacc 2221 tccccatgga gctgagaggc ctcccttgga gagatggaag gaaacgttat accttgtacc 2281 cctcggtctc catccatcaa gccaaacctg ctgccacagc cctcccccgg ccccagatag 2341 cagccccagg gaggatgctg cctccaagag gtgtgagccc tctgtctcgg ggatgaacaa 2401 gcagagtctg ggctacctct tgacagctgg tggaggggag ttggggagct ggactggatg 2461 actctggagg ccccttccaa acctcaagtg tccggcgctt tgattgcctg agtttctgac 2521 acttcagggc ccagaggtcc tgcgaggggc agaactggac ccccatgcca gtgctgctgc 2581 aggagggccc atatactagg gtctgctgag ctgttgtcac tgatcggtgg gcgctggggg 2641 ggtagggtag cacaccagct gtcccaggct ttgctccggg tggtaactgc acttgggcag 2701 ggaatatagc cttcctgggc acaactagct gacaatgaca ggttgactgt gtacccccaa 2761 ccaaggagct ggggcccaag gccagtcctg ccccagagac actccaagtc cgccagg // LOCUS HSU34802 1299 bp DNA PRI 02-OCT-1995 DEFINITION Human intrinsic membrane protein MP70 (Cx50) gene, complete cds. ACCESSION U34802 NID g1002998 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1299) AUTHORS Church,R.L., Wang,J.H. and Steele,E. TITLE The human lens intrinsic membrane protein MP70 (Cx50) gene: clonal analysis and chromosome mapping JOURNAL Curr. Eye Res. 14 (3), 215-221 (1995) MEDLINE 95317073 REFERENCE 2 (bases 1 to 1299) AUTHORS Church,R.L., Wang,J.H. and Steele,E. TITLE Direct Submission JOURNAL Submitted (24-AUG-1995) Robert L. Church, Ophthalmology, Emory Eye Center, 1327 Clifton Road, N.E., Atlanta, GA 30322, USA FEATURES Location/Qualifiers source 1..1299 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" gene 1..1299 /gene="Cx50" CDS 1..1299 /gene="Cx50" /codon_start=1 /product="intrinsic membrane protein MP70" /db_xref="PID:g1002999" /translation="MGDWSFLGNILEEVNEHSTVIGRVWLTVLFIFRILILGTAAEFV WGDEQSDFVCNTQQPGCENVCYDEAFPISHIRLWVLQIIFVSTPSLMYVGHAVHYVRM EEKRKSRDEELGQQAGTNGGPDQGSVKKSSGSKGTKKFRLEGTLLRTYICHIIFKTLF EVGFIVGHYFLYGFRILPLYRCSRWPCPNVVDCFVSRPTEKTIFILFMLSVASVSLFL NVMELSHLGLKGIRSALKRPVEQPLGEIPEKSLHSIAVSSIQKAKGYQLLEEEKIVSH YFPLTEVGMVETSPLPAKPFNQFEEKISTGPLGDLSRGYQETLPSYAQVGAQEVEGEG PPAEEGAEPEVGEKKEEAERLTTEEQEKVAVPEGEKVETPGVDKEGEKEEPQSEKVSK QGLPAEKTPSLCPELTTDDARPLSRLSKASSRARSDDLTV" BASE COUNT 294 a 363 c 406 g 236 t ORIGIN 1 atgggcgact ggagtttcct ggggaacatc ttggaggagg tgaatgagca ctccaccgtc 61 atcggcagag tctggctcac cgtgcttttc atcttccgga tcctcatcct tggcacggcc 121 gcagagttcg tgtgggggga tgagcaatcc gacttcgtgt gcaacaccca gcagcctggc 181 tgcgagaacg tctgctacga cgaggccttt cccatctccc acattcgcct ctgggtgctg 241 cagatcatct tcgtctccac cccgtccctg atgtacgtgg ggcacgcggt gcactacgtc 301 cgcatggagg agaagcgcaa aagccgcgac gaggagctgg gccagcaggc ggggactaac 361 ggcggcccgg accagggcag cgtcaagaag agcagcggca gcaaaggcac taagaagttc 421 cggctggagg ggaccctgct gaggacctac atctgccaca tcatcttcaa gaccctcttt 481 gaagtgggct tcatcgtggg ccactacttc ctgtacgggt tccggatcct gcctctgtac 541 cgctgcagcc ggtggccctg ccccaatgtg gtggactgct tcgtgtcccg gcccacggag 601 aaaaccatct tcatcctgtt catgttgtct gtggcctctg tgtccctatt cctcaacgtg 661 atggagttga gccacctggg cctgaagggg atccggtctg ccttgaagag gcctgtagag 721 cagcccctgg gggagattcc tgagaaatcc ctccactcca ttgctgtctc ctccatccag 781 aaagccaagg gctatcagct tctagaagaa gagaaaatcg tttcccacta tttccccttg 841 accgaggttg ggatggtgga gaccagccca ctgcctgcca agcctttcaa tcagttcgag 901 gagaagatca gcacaggacc cctgggggac ttgtcccggg gctaccaaga gacactgcct 961 tcctacgctc aggtgggggc acaagaagtg gagggcgagg ggccgcctgc agaggaggga 1021 gccgaacccg aggtgggaga gaagaaggag gaagcagaga ggctgaccac ggaggagcag 1081 gagaaggtgg ccgtgccaga gggggagaaa gtagagaccc ccggagtgga taaggagggt 1141 gaaaaagaag agccgcagtc ggagaaggtg tcaaagcaag ggctgccagc tgagaagaca 1201 ccttcactct gtccagagct gacaacagat gatgccagac ccctgagcag gctaagcaaa 1261 gccagcagcc gagccaggtc agacgatcta accgtatga // LOCUS HSU34806 1232 bp DNA PRI 19-NOV-1996 DEFINITION Human G protein-coupled receptor (GPR15) gene, complete cds. ACCESSION U34806 NID g1171145 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1232) AUTHORS Heiber,M., Marchese,A., Nguyen,T., Heng,H.H., George,S.R. and O'Dowd,B.F. TITLE A novel human gene encoding a G-protein-coupled receptor (GPR15) is located on chromosome 3 JOURNAL Genomics 32 (3), 462-465 (1996) MEDLINE 96435926 REFERENCE 2 (bases 1 to 1232) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (24-AUG-1995) Brian F. O'Dowd, Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1232 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q11.2-q13.1" /chromosome="3" gene 83..1165 /gene="GPR15" CDS 83..1165 /gene="GPR15" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g1171146" /translation="MDPEETSVYLDYYYATSPNSDIRETHSHVPYTSVFLPVFYTAVF LTGVLGNLVLMGALHFKPGSRRLIDIFIINLAASDFIFLVTLPLWVDKEASLGLWRTG SFLCKGSSYMISVNMHCSVLLLTCMSVDRYLAIVWPVVSRKFRRTDCAYVVCASIWFI SCLLGLPTLLSRELTLIDDKPYCAEKKATPIKLIWSLVALIFTFFVPLLSIVTCYCCI ARKLCAHYQQSGKHNKKLKKSIKIIFIVVAAFLVSWLPFNTFKFLAIVSGLRQEHYLP SAILQLGMEVSGPLAFANSCVNPFIYYIFDSYIRRAIVHCLCPCLKNYDFGSSTETSD SHLTKALSTFIHAEDFARRRKRSVSL" BASE COUNT 291 a 302 c 270 g 369 t ORIGIN 1 aagcttcctt gaggtttcta aaatttatac aaaaacatca tatgtaagta aactcaccag 61 atttggcatc tgctctttgg tgatggaccc agaagaaact tcagtttatt tggattatta 121 ctatgctacg agcccaaact ctgacatcag ggagacccac tcccatgttc cttacacctc 181 tgtcttcctt ccagtctttt acacagctgt gttcctgact ggagtgctgg ggaaccttgt 241 tctcatggga gcgttgcatt tcaaacccgg cagccgaaga ctgatcgaca tctttatcat 301 caatctggct gcctctgact tcatttttct tgtcacattg cctctctggg tggataaaga 361 agcatctcta ggactgtgga ggacgggctc cttcctgtgc aaagggagct cctacatgat 421 ctccgtcaat atgcactgca gtgtcctcct gctcacttgc atgagtgttg accgctacct 481 ggccattgtg tggccagtcg tatccaggaa attcagaagg acagactgtg catatgtagt 541 ctgtgccagc atctggttta tctcctgcct gctggggttg cctactcttc tgtccaggga 601 gctcacgctg attgatgata agccatactg tgcagagaaa aaggcaactc caattaaact 661 catatggtcc ctggtggcct taattttcac cttttttgtc cctttgttga gcattgtgac 721 ctgctactgt tgcattgcaa ggaagctgtg tgcccattac cagcaatcag gaaagcacaa 781 caaaaagctg aagaaatcta taaagatcat ctttattgtc gtggcagcct ttcttgtctc 841 ctggctgccc ttcaatactt tcaagttcct ggccattgtc tctgggttgc ggcaagaaca 901 ctatttaccc tcagctattc ttcagcttgg tatggaggtg agtggaccct tggcatttgc 961 caacagctgt gtcaaccctt tcatttacta tatcttcgac agctacatcc gccgggccat 1021 tgtccactgc ttgtgccctt gcctgaaaaa ctatgacttt gggagtagca ctgagacatc 1081 agatagtcac ctcactaagg ctctctccac cttcattcat gcagaagatt ttgccaggag 1141 gaggaagagg tctgtgtcac tctaaaggga actgtgacat ttcaagctct gttggtgggt 1201 ttaggagtta atttttgtca gcaacaaaga aa // LOCUS HSU35146 1993 bp DNA PRI 31-DEC-1996 DEFINITION Human p56 KKIAMRE protein kinase (KKIAMRE), complete cds. ACCESSION U35146 NID g1517819 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1993) AUTHORS Taglienti,C.A., Wysk,M. and Davis,R.J. TITLE Molecular cloning of the epidermal growth factor-stimulated protein kinase p56 KKIAMRE JOURNAL Oncogene 13 (12), 2563-2574 (1996) MEDLINE 97152547 REFERENCE 2 (bases 1 to 1993) AUTHORS Taglienti,C.A. TITLE Direct Submission JOURNAL Submitted (31-AUG-1995) Cherie A. Taglienti, Molecular Medicine, University of Massachusetts Medical School, 373 Plantation Street, Worcester, MA 01605, USA FEATURES Location/Qualifiers source 1..1993 /organism="Homo sapiens" /db_xref="taxon:9606" gene 376..1857 /gene="KKIAMRE" CDS 376..1857 /gene="KKIAMRE" /note="similar to human p42 KKIALRE gene, GenBank Accession Number X66358; these protein kinases have mutually exclusive expression in testis (p56 KKIAMRE) and ovary (p42 KKIALRE)" /codon_start=1 /product="p56 KKIAMRE protein kinase" /db_xref="PID:g1517820" /translation="MEKYENLGLVGEGSYGMVMKCRNKDTGRIVAIKKFLESDDDKMV KKIAMREIKLLKQLRHENLVNLLEVCKKKKRWYLVFEFVDHTILDDLELFPNGLDYQV VQKYLFQIINGIGFCHSHNIIHRDIKPENILVSQSGVVKLCDFGFARTLAAPGEVYTD YVATRWYRAPELLVGDVKYGKAVDVWAIGCLVTEMFMGEPLFPGDSDIDQLYHIMMCL GNLIPRHQELFNKNPVFAGVRLPEIKEREPLERRYPKLSEVVIDLAKKCLHIDPDKRP FCAELLHHDFFQMDGFAERFSQELQLKVQKDARNVSLSKKSQNRKKEKEKDDSLVEER KTLVVQDTNADPKIKDYKLFKIKGSKIDGEKAEKGNRASNASCLHDSRTSHNKIVPST SLKDCSNVSVDHTRNPSVAIPPLTHNLSAVAPSINSGMGTETIPIQGYRVDEKTKKCS IPFVKPNRHSPSGIYNINVTTLVSGPPLSDDSGADLPQMEHQH" BASE COUNT 668 a 371 c 439 g 515 t ORIGIN 1 caggtgttgg tgcctgccgt gaacgcattc tgacctgggc cgtatctgtc tcccaagact 61 ttgtgcctat ggttggggac agagtgaggt cgttgccttg acgacgacag catgcggccc 121 gtggtcctcc taagtgtgag cttgcggcgg accgaggccc acctgcctcc ctgcctgctt 181 cgccctggac tcgtgactgc gtccgcagaa gaaatcacaa cagcgctgga attgctagtt 241 tgctaggcag catcttttgg acctgcgaac catatgcatt tcacctcaaa tttgtttcca 301 agttgaaaac ctttgggtct ttctatgcga acggattgaa gaaacgcaaa aagtttctac 361 ggactttaaa ttaaaatgga aaaatatgaa aacctgggtt tggttggaga agggagttat 421 ggaatggtga tgaagtgtag gaataaagat actggaagaa ttgtggccat aaagaagttc 481 ttagaaagtg acgatgacaa aatggttaaa aagattgcaa tgcgagaaat caagttacta 541 aagcaactta ggcatgaaaa cttggtgaat ctcttggaag tgtgtaagaa aaaaaaacga 601 tggtacctag tctttgaatt tgttgaccac acaattcttg atgacttgga gctctttcca 661 aatggactag actaccaagt agttcaaaag tatttgtttc agattattaa tggaattgga 721 ttttgtcaca gtcacaatat catacacaga gatataaagc cagagaatat attagtctcc 781 cagtctggcg ttgtcaagct atgcgatttt ggatttgcgc gaacattggc agctcctggg 841 gaggtttata ctgattatgt ggcaacccga tggtacagag ctccagaact attggttggt 901 gatgtcaagt atggcaaggc tgttgatgtg tgggccattg gttgtctggt aactgaaatg 961 ttcatggggg aacccctatt tcctggagat tctgatattg atcagctata tcatattatg 1021 atgtgtttag gtaatctaat tccaaggcat caggagcttt ttaataaaaa tcctgtgttt 1081 gctggagtaa ggttgcctga aatcaaggaa agagaacctc ttgaaagacg ctatcctaag 1141 ctctctgaag tggtgataga tttagcaaag aaatgcttac atattgaccc cgacaaaaga 1201 cccttctgtg ctgagctcct acaccatgat ttctttcaaa tggatggatt tgctgagagg 1261 ttttcccaag aactacagtt aaaagtacag aaagatgcca gaaatgtttc tttatctaaa 1321 aaatcccaaa acagaaagaa ggaaaaagaa aaagatgatt ccttagttga agaaagaaaa 1381 acacttgtgg tacaggatac caatgctgat cccaaaatta aggattataa actatttaaa 1441 ataaaaggct caaaaattga tggagaaaaa gctgaaaaag gcaatagagc ttcaaatgcc 1501 agctgtctcc atgacagtag gacaagccac aacaaaatag tgccttcaac aagcctcaaa 1561 gactgcagca atgtcagcgt ggaccacaca aggaatccaa gcgtggcaat tcccccactt 1621 acacacaatc tttctgcagt tgctcccagc attaattctg gaatggggac tgagactata 1681 ccaattcagg gttacagagt ggatgagaaa actaagaagt gttctattcc atttgttaaa 1741 ccgaacagac attccccatc aggcatttat aacattaatg tgaccacatt agtatcagga 1801 cctcccctgt cagatgattc aggggctgat ttgcctcaaa tggaacacca gcactgagaa 1861 ccattttggt tctgaactgg atgatgctct tgcacttgag atgacatctt cttgcagcaa 1921 gaaaaaaaaa aaaaaaaaaa aaaaaaaaac aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1981 aaaaaaaaaa aaa // LOCUS HSU35232 1180 bp DNA PRI 15-NOV-1995 DEFINITION Human neuropeptide Y4 receptor protein gene, complete cds. ACCESSION U35232 NID g1063629 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1180) AUTHORS Bard,J.A., Walker,M.W., Branchek,T.A. and Weinshank,R.L. TITLE Cloning and functional expression of a human Y4 subtype receptor for pancreatic polypeptide, neuropeptide Y, and peptide YY JOURNAL J. Biol. Chem. 270 (45), 26762-26765 (1995) MEDLINE 96070761 REFERENCE 2 (bases 1 to 1180) AUTHORS Bard,J.A. TITLE Direct Submission JOURNAL Submitted (31-AUG-1995) Jonathan A. Bard, Molecular Biology, Synaptic Pharmaceutical Corporation, 215 College Rd., Paramus, NJ 07652, USA FEATURES Location/Qualifiers source 1..1180 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hp25a" /tissue_type="placenta" 5'UTR 1..27 CDS 28..1155 /codon_start=1 /product="neuropeptide Y4 receptor protein" /db_xref="PID:g1063630" /translation="MNTSHLLALLLPKSPQGENRSKPLGTPYNFSEHCQDSVDVMVFI VTSYSIETVVGVLGNLCLMCVTVRQKEKANVTNLLIANLAFSDFLMCLLCQPLTAVYT IMDYWIFGETLCKMSAFIQCMSVTVSILSLVLVALERHQLIINPTGWKPSISQAYLGI VLIWVIACVLSLPFLANSILENVFHKNHSKALEFLADKVVCTESWPLAHHRTIYTTFL LLFQYCLPLGFILVCYARIYRRLQRQGRVFHKGTYSLRAGHMKQVNVVLVVMVVAFAV LWLPLHVFNSLEDWHHEAIPICHGNLIFLVCHLLAMASTCVNPFIYGFLNTNFKKEIK ALVLTCQQSAPLEESEHLPLSTVHTEVSKGSLRLSGRSNPI" 3'UTR 1156..1180 BASE COUNT 234 a 382 c 285 g 279 t ORIGIN 1 gagtcctgga atcttttcac atccactatg aacacctctc acctcctggc cttgctgctc 61 ccaaaatctc cacaaggtga aaacagaagc aaacccctgg gcaccccata caacttctct 121 gaacattgcc aggattccgt ggacgtgatg gtcttcatcg tcacttccta cagcattgag 181 actgtcgtgg gggtcctggg taacctctgc ctgatgtgtg tgactgtgag gcagaaggag 241 aaagccaacg tgaccaacct gcttatcgcc aacctggcct tctctgactt cctcatgtgc 301 ctcctctgcc agccgctgac cgccgtctac accatcatgg actactggat ctttggagag 361 accctctgca agatgtcggc cttcatccag tgcatgtcgg tgacggtctc catcctctcg 421 ctcgtcctcg tggccctgga gaggcatcag ctcatcatca acccaacagg ctggaagccc 481 agcatctcac aggcctacct ggggattgtg ctcatctggg tcattgcctg tgtcctctcc 541 ctgcccttcc tggccaacag catcctggag aatgtcttcc acaagaacca ctccaaggct 601 ctggagttcc tggcagataa ggtggtctgt accgagtcct ggccactggc tcaccaccgc 661 accatctaca ccaccttcct gctcctcttc cagtactgcc tcccactggg cttcatcctg 721 gtctgttatg cacgcatcta ccggcgcctg cagaggcagg ggcgcgtgtt tcacaagggc 781 acctacagct tgcgagctgg gcacatgaag caggtcaatg tggtgctggt ggtgatggtg 841 gtggcctttg ccgtgctctg gctgcctctg catgtgttca acagcctgga agactggcac 901 catgaggcca tccccatctg ccacgggaac ctcatcttct tagtgtgcca cttgcttgcc 961 atggcctcca cctgcgtcaa cccattcatc tatggctttc tcaacaccaa cttcaagaag 1021 gagatcaagg ccctggtgct gacttgccag cagagcgccc ccctggagga gtcggagcat 1081 ctgcccctgt ccacagtaca tacggaagtc tccaaagggt ccctgaggct aagtggcagg 1141 tccaatccca tttaaccagg tctaggtctt ctccctgcca // LOCUS HSU38178 3339 bp DNA PRI 08-OCT-1996 DEFINITION Human cyclic nucleotide phophodiesterase (HSPDE3B) mRNA, complete cds. ACCESSION U38178 NID g1145301 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3339) AUTHORS Miki,T., Taira,M., Hockman,S., Shimada,F., Lieman,J., Napolitano,M., Ward,D., Taira,M., Makino,H. and Manganiello,V.C. TITLE Characterization of the cDNA and gene encoding human PDE3B, the cGIP1 isoform of the human cyclic GMP-inhibited cyclic nucleotide phosphodiesterase family JOURNAL Genomics 36 (3), 476-485 (1996) MEDLINE 97038690 REFERENCE 2 (bases 1 to 3339) AUTHORS Taira,M., Hockman,S. and Manganiello,V. TITLE Direct Submission JOURNAL Submitted (10-OCT-1995) Vincent Manganiello, PCCMB, NHLBI, NIH, 9000 Rockville Pike, Bldg. 10 5N307, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3339 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fat" gene 1..3339 /gene="HSPDE3B" CDS 1..3339 /gene="HSPDE3B" /codon_start=1 /product="cyclic nucleotide phophodiesterase" /db_xref="PID:g1145302" /translation="MRRDERDAKAMRSLQPPDGAGSPPESLRNGYVKSCVSPLRQDPP RGFFFHLCRFCNVELRPPPASPQQPRRCSPFCRARLSLGDLAAFVLALLLGAEPESWA AGAAWLRTLLSVCSHSLSPLFSIACAFFFLTCFLTRTKRGPGPGRSCGSWWLLALPAC CYLGDFLVWQWWSWPWGDGDAGSAAPHTPPEAAAGRLLLVLSCVGLLLTLAHPLRLRH CVLVLLLASFVWWVSFTSLGSLPSALRPLLSGLVGGAGCLLALGLDHFFQIREAPLHP RLSSAAEEKVPVIRPRRRSSCVSLGETAASYYGSCKIFRRPSLPCISREQMILWDWDL KQWYKPHYQNSGGGNGVDLSVLNEARNMVSDLLTDPSLPPQVISSLRSISSLMGAFSG SCRPKINPLTPFPGFYPCSEIEDPAEKGDRKLNKGLNRNSLPTPQLRRSSGTSGLLPV EQSSRWDRNNGKRPHQEFGISSQGCYLNGPFNSNLLTIPKQRSSSVSLTHHVGLRRAG VLSSLSPVNSSNHGPVSTGSLTNRSPIEFPDTADFLNKPSVILQRSLGNAPNTPDFYQ QLRNSDSNLCNSCGHQMLKYVSTSESDGTDCCSGKSGEEENIFSKESFKLMETQQEEE TEKKDSRKLFQEGDKWLTEEAQSEQQTNIEQEVSLDLILVEEYDSLIEKMSNWNFPIF ELVEKMGEKSGRILSQVMYTLFQDTGLLEIFKIPTQQFMNYFRALENGYRDIPYHNRI HATDVLHAVWYLTTRPVPGLQQIHNGCGTGNETDSDGRINHGRIAYISSKSCSNPDES YGCLSSNIPALELMALYVAAAMHDYDHPGRTNAFLVATNAPQAVLYNDRSVLENHHAA SAWNLYLSRPEYNFLLHLDHVEFKRFRFLVIEAILATDLKKHFDFLAEFNAKANDVNS NGIEWSNENDRLLVCQVCIKLADINGPAKVRDLHLKWTEGIVNEFYEQGDEEANLGLP ISPFMDRSSPQLAKLQESFITHIVGPLCNSYDAAGLLPGQWLEAEEDNDTESGDDEDG EELDTEDEEMENNLNPKPPRRKSRRRIFCQLMHHLTENHKIWKEIVEEEEKCKADGNK LQVENSSLPQADEIQVIEEADEEE" BASE COUNT 882 a 782 c 838 g 837 t ORIGIN 1 atgaggaggg acgagcgaga cgccaaagcc atgcggtccc tgcagccgcc ggatggggcc 61 ggctcgcccc ccgagagtct gaggaacggc tacgtgaaga gctgcgtgag ccccttgcgg 121 caggaccctc cgcgcggctt cttcttccac ctctgccgct tctgcaacgt ggagctgcgg 181 ccgccgccgg cctctcccca gcagccgcgg cgctgctccc ccttctgccg ggcgcgcctc 241 tcgctgggcg acctggctgc ctttgtcctc gccctgctgc tgggagcgga acccgagagc 301 tgggctgccg gggccgcctg gctgcggacg ctgctgagcg tgtgttcgca cagcttgagc 361 cccctcttca gcatcgcctg tgccttcttc ttcctcacct gcttcctcac ccggaccaag 421 cggggacccg gcccgggccg gagctgcggc tcctggtggc tgctggcgct gcccgcctgc 481 tgttacctgg gggacttctt ggtgtggcag tggtggtctt ggccttgggg ggatggcgac 541 gcagggtccg cggccccgca cacgcccccg gaggcggcag cgggcaggtt gctgctggtg 601 ctgagctgcg tagggctgct gctgacgctc gcgcacccgc tgcggctccg gcactgcgtt 661 ctggtgctgc tcctggccag cttcgtctgg tgggtctcct tcaccagcct cgggtcgctg 721 ccctccgccc tcaggccgct gctctccggc ctggtggggg gcgctggctg cctgctggcc 781 ctggggttgg atcacttctt tcaaatcagg gaagcgcctc ttcatcctcg actgtccagt 841 gccgccgaag aaaaagtgcc tgtgatccga ccccggagga ggtccagctg cgtgtcgtta 901 ggagaaactg cagccagtta ctatggcagt tgcaaaatat tcaggagacc gtcgttgcct 961 tgtatttcca gagaacagat gattctttgg gattgggact taaaacaatg gtataagcct 1021 cattatcaaa attctggagg tggaaatgga gttgatcttt cagtgctaaa tgaggctcgc 1081 aatatggtgt cagatcttct gactgatcca agccttccac cacaagtcat ttcctctcta 1141 cggagtatta gtagcttaat gggtgctttc tcaggttcct gtaggccaaa gattaatcct 1201 ctcacaccat ttcctggatt ttacccctgt tctgaaatag aggacccagc tgagaaaggg 1261 gatagaaaac ttaacaaggg actaaatagg aatagtttgc caactccaca gctgaggaga 1321 agctcaggaa cttcaggatt gctacctgtt gaacagtctt caaggtggga tcgtaataat 1381 ggcaaaaggc ctcaccaaga atttggcatt tcaagtcaag gatgctatct aaatgggcct 1441 tttaattcaa atctactgac tatcccgaag caaaggtcat cttctgtatc actgactcac 1501 catgtaggtc tcagaagagc tggtgttttg tccagtctga gtcctgtgaa ttcttccaac 1561 catggaccag tgtctactgg ctctctaact aatcgatcac ccatagaatt tcctgatact 1621 gctgattttc ttaataagcc aagcgttatc ttgcagagat ctctgggcaa tgcacctaat 1681 actccagatt tttatcagca acttagaaat tctgatagca atctgtgtaa cagctgtgga 1741 catcaaatgc tgaaatatgt ttcaacatct gaatcagatg gtacagattg ctgcagtgga 1801 aaatcaggtg aagaagaaaa cattttctcg aaagaatcat tcaaacttat ggaaactcaa 1861 caagaagagg aaacagagaa gaaagacagc agaaaattat ttcaggaagg tgataagtgg 1921 ctaacagaag aggcacagag tgaacagcaa acaaatattg aacaggaagt atcactggac 1981 ctgattttag tagaagagta tgactcatta atagaaaaga tgagcaactg gaattttcca 2041 atttttgaac ttgtagaaaa gatgggagag aaatcaggaa ggattctcag tcaggttatg 2101 tataccttat ttcaagacac tggtttattg gaaatattta aaattcccac tcaacaattt 2161 atgaactatt ttcgtgcatt agaaaatggc tatcgagaca ttccttatca caatcgtata 2221 catgccacag atgtgctaca tgcagtttgg tatctgacaa cacggccagt tcctggctta 2281 cagcagatcc acaatggttg tggaacagga aatgaaacag attctgatgg tagaattaac 2341 catgggcgaa ttgcttatat ttcttcgaag agctgctcta atcctgatga gagttatggc 2401 tgcctgtctt caaacattcc tgcattagaa ttgatggctc tatacgtggc agctgccatg 2461 catgattatg atcacccagg gaggacaaat gcatttctag tggctacaaa tgcccctcag 2521 gcagttttat acaatgacag atctgttctg gaaaatcatc atgctgcgtc agcttggaat 2581 ctatatcttt ctcgcccaga atacaacttc cttcttcatc ttgatcatgt ggaattcaag 2641 cgctttcgtt ttttagtcat tgaagcaatc cttgctacgg atcttaaaaa gcattttgat 2701 tttctcgcag aattcaatgc caaggcaaat gatgtaaata gtaatggcat agaatggagt 2761 aatgaaaatg atcgcctctt ggtatgccag gtgtgcatca aactggcaga tataaatggc 2821 ccagcaaaag ttcgagactt gcatttgaaa tggacagaag gcattgtcaa tgaattttat 2881 gagcagggag atgaagaagc aaatcttggt ctgcccatca gtccattcat ggatcgttct 2941 tctcctcaac tagcaaaact ccaagaatct tttatcaccc acatagtggg tcccctgtgt 3001 aactcctatg atgctgctgg tttgctacca ggtcagtggt tagaagcaga agaggataat 3061 gatactgaaa gtggtgatga tgaagacggt gaagaattag atacagaaga tgaagaaatg 3121 gaaaacaatc taaatccaaa accaccaaga aggaaaagca gacggcgaat attttgtcag 3181 ctaatgcacc acctcactga aaaccacaag atatggaagg aaatcgtaga ggaagaagaa 3241 aaatgtaaag ctgatgggaa taaactgcag gtggagaatt cctccttacc tcaagcagat 3301 gagattcagg taattgaaga ggcagatgaa gaggaatag // LOCUS HSU40223 1651 bp DNA PRI 19-JAN-1996 DEFINITION Human uridine nucleotide receptor (UNR) gene, complete cds. ACCESSION U40223 NID g1117912 KEYWORDS G protein-coupled receptor; purinoceptor; PCR; intronless; UTP. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1651) AUTHORS Nguyen,T., Erb,L., Weisman,G.A., Marchese,A., Heng,H.H., Garrad,R.C., George,S.R., Turner,J.T. and O'Dowd,B.F. TITLE Cloning, expression, and chromosomal localization of the human uridine nucleotide receptor gene JOURNAL J. Biol. Chem. 270 (52), 30845-30848 (1995) MEDLINE 96125054 REFERENCE 2 (bases 1 to 1651) AUTHORS O'Dowd,B.F., Nguyen,T., Marchese,A. and George,S.R. TITLE Direct Submission JOURNAL Submitted (07-NOV-1995) Brian F. O'Dowd, Department of Pharmacology, University of Toronto, 8 Taddle Creek Road, Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1651 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq13" gene 391..1488 /gene="UNR" CDS 391..1488 /gene="UNR" /note="intronless coding region; Purinoceptor" /codon_start=1 /product="uridine nucleotide receptor" /db_xref="PID:g1117913" /translation="MASTESSLLRSLGLSPGPGSSEVELDCWFDEDFKFILLPVSYAV VFVLGLGLNAPTLWLFIFRLRPWDATATYMFHLALSDTLYVVSLPTLIYYYAAHNHWP FGTEICKFVRFLFYWNLYCSVLFLTCISVHRYLGICHPLRALRWGRPRLAGLLCLAVW LVVAGCLVPNLFFVTTSTKGTTVLCHDTTRPEEFDHYVHFSSAVMGLLFGVPCLVTLV CYGLMARRLYQPLPGAAQSSSRLRSLRTIAVVLTVFAVCFVPFHITRTIYYLARLLEA DCRVLNIVNVVYKVTRPLASANSCLDPVLYLLTGDKYRRQLRQLCGGGKPQPRTAASS LALVSLPEDSSCRWAATPQDSSCSTPRADRL" BASE COUNT 290 a 522 c 429 g 410 t ORIGIN 1 tcctcttcca ggatatagct gtgatgacga gtcagaagac acttggtctg gtatcttccc 61 acttgatagt gctgggaggc ctccaccctc ttcagccagc caggctctta gggacagagt 121 gagctgcaga gtcagtacaa cccaaataca cgggctgcct gcctgagccc cagcactgcc 181 tgctgcccac caacttccca agctggacca agggaggctt gggtaggggc caggctagcc 241 tgagtgcacc cagatgcgct tctgtcagct ctccctagtg cttcaaccac tgctctccct 301 gctctacttt ttttgctcca gctcagggat gggggtgggc agggaaatcc tgccaccctc 361 acttctcccc ttcccatctc caggggggcc atggccagta cagagtcctc cctgttgaga 421 tccctaggcc tcagcccagg tcctggcagc agtgaggtgg agctggactg ttggtttgat 481 gaggatttca agttcatcct gctgcctgtg agctatgcag ttgtctttgt gctgggcttg 541 ggccttaacg ccccaaccct atggctcttc atcttccgcc tccgaccctg ggatgcaacg 601 gccacctaca tgttccacct ggcattgtca gacaccttgt atgtcgtgtc gctgcccacc 661 ctcatctact attatgcagc ccacaaccac tggccctttg gcactgagat ctgcaagttc 721 gtccgctttc ttttctattg gaacctctac tgcagtgtcc ttttcctcac ctgcatcagc 781 gtgcaccgct acctgggcat ctgccaccca cttcgggcac tacgctgggg ccgccctcgc 841 ctcgcaggcc ttctctgcct ggcagtttgg ttggtcgtag ccggctgcct cgtgcccaac 901 ctgttctttg tcacaaccag caccaaaggg accaccgtcc tgtgccatga caccactcgg 961 cctgaagagt ttgaccacta tgtgcacttc agctcggcgg tcatggggct gctctttggc 1021 gtgccctgcc tggtcactct tgtttgctat ggactcatgg ctcgtcgcct gtatcagccc 1081 ttgccaggcg ctgcacagtc gtcttctcgc ctccgatctc tccgcaccat agctgtggtg 1141 ctgactgtct ttgctgtctg cttcgtgcct ttccacatca cccgcaccat ttactacctg 1201 gccaggctgt tggaagctga ctgccgagta ctgaacattg tcaacgtggt ctataaagtg 1261 actcggcccc tggccagtgc caacagctgc ctggatcctg tgctctactt gctcactggg 1321 gacaaatatc gacgtcagct ccgtcagctc tgtggtggtg gcaagcccca gccccgcacg 1381 gctgcctctt ccctggcact agtgtccctg cctgaggata gcagctgcag gtgggcggcc 1441 accccccagg acagtagctg ctctactcct agggcagata gattgtaaca cgggaagccg 1501 ggaagtgaga gaaaagggga tgagtgcagg gcagaggtga gggaacccaa tagtgatacc 1561 tggtaaggtg cttcttccct cttttcccag ggctcctgga gagaagccct caccctgagg 1621 ttgcatttat tgatttatat catgggtgac c // LOCUS HSU40574 1693 bp DNA PRI 08-OCT-1997 DEFINITION Homo sapiens glutaredoxin (GLRX) gene, complete cds. ACCESSION U40574 NID g1172130 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Park,J.B. and Levine,M. TITLE The human glutaredoxin gene: determination of its organization, transcription start point, and promoter analysis JOURNAL Gene 197 (1-2), 189-193 (1997) MEDLINE 97473512 REFERENCE 2 (bases 1 to 1693) AUTHORS Park,J.B. TITLE Direct Submission JOURNAL Submitted (13-NOV-1995) Jae B. Park, NIDDK, NIH, Bldg. 8, Rm. 419, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1693 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ggrx3" /cell_type="neutrophil" gene 1365..1685 /gene="GLRX" CDS 1365..1685 /gene="GLRX" /codon_start=1 /product="glutaredoxin" /db_xref="PID:g1172131" /translation="MAQEFVNCKIQPGKVVVFIKPTCPYCRRAQEILSQLPIKQGLLE FVDITATNHTNEIQDYLQQLTGARTVPRVFIGKDCIGGCSDLVSLQQSGELLTRLKQI GALQ" BASE COUNT 475 a 394 c 379 g 445 t ORIGIN 1 cttctcattt gccattttat tctagtaacg atgagtcaca gtaaaaagct gcttctggag 61 tggattttct tcccttccct tgatcgcagt atgcttggat tcctcttacg tggcagtcgt 121 ggtgtccagc tctgccttgc cccgcctctt tgaagatact ctggtttcac cataaaactt 181 cctggggaaa ggatgattca cgtttaaagg acgatagttt ttccttgatt tgtttattta 241 agctaccata cactagaggg ggccggaact agacccagga gtagaggact ttgctttgcg 301 ggataaggag aatgggcacc ctgggaagag cgaaagattt cctactgtta caacccgtgt 361 cttataacac ttgaagtgaa tgtggtgtct gagttacaac actacttccc ctcacaaaca 421 gttaaaagca gagggtcaca aacataggct caaccagaaa aaaagattct catttcatct 481 gtagttttgt atcaaggatt gtgttggaat ttgcctccat attaaactgc tgttcaaggg 541 ttaccaactc aaatggttat ggggtgaagc cggtgtctca ggcatgggcc tgtcgttggg 601 acctgtggca aaccaaagcc aaagccacta cttgccctat acaaaggggg caacagctgt 661 cagccccagc agcttccact aaaaggtgca agctctgtat tgcccgacct gacttttcag 721 agagaagttt taagtccgaa ttgttttgtg agacctccag atttttaaat actgagaatg 781 aattccatta aaaagataca caacaacaaa cacccaccat acagtccaag caaaatgctg 841 tgagagcaac cttcatgagg agtgcaaagt cacagtttca atttcacttc ttctaaagag 901 atttccaagt gttctgtttc ttaagtgatt ctgtttgcat tttaaaatac aactgtatct 961 cctagaggga aaacaagaat ttgctctaag aagtctctgc ctcctcagcc cctttgtgag 1021 taagcacttt atgcttcccc agaggtgact aaactctgat cattgccaat gggcaggcac 1081 tccccaaatg tccaaggaca acaaagatac ccagagtgtc tttcatagct accaatgatt 1141 aaatagcaag tattgcattc ctgggcattg ctaactagtg aagtatacca gatggaaatg 1201 tcttcgaagc tgtcccttta aaactcgagc aagctaccag gcaaactccg cctccaggga 1261 ggttccttat taaataggag ccaactggct gggtcggggc tcaatagccc caagcaatac 1321 ctgcaactga ggaattcttc ccggggagac cgcagcccat cggcatggct caagagtttg 1381 tgaactgcaa aatccagcct gggaaggtgg ttgtgttcat caagcccacc tgcccgtact 1441 gcaggagggc ccaagagatc ctcagtcaat tgcccatcaa acaagggctt ctggaatttg 1501 tcgatatcac agccaccaac cacactaacg agattcaaga ttatttgcaa cagctcacgg 1561 gagcaagaac ggtgcctcga gtctttattg gtaaagattg tataggcgga tgcagtgatc 1621 tagtctcttt gcaacagagt ggggaactgc tgacgcggct aaagcagatt ggagctctgc 1681 agtaaccaca gat // LOCUS HSU41315 9358 bp DNA PRI 02-MAY-1996 DEFINITION Human ring zinc-finger protein (ZNF127-Xp) gene and 5' flanking sequence. ACCESSION U41315 NID g1304598 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9358) AUTHORS Hendrich,B.D., Longstreet,M., Gustashaw,K., Nicholls,R.D. and Willard,H.F. TITLE An X-linked homologue of the autosomal imprinted gene ZNF127 escapes X chromosome inactivation JOURNAL Unpublished REFERENCE 2 (bases 1 to 9358) AUTHORS Hendrich,B.D., Longstreet,M., Gustashaw,K., Nicholls,R.D. and Willard,H.F. TITLE Direct Submission JOURNAL Submitted (27-NOV-1995) B.D. Hendrich, Genetics, Case Western Reserve University, 2109 Adelbert Road, Cleveland, OH 44106-4955, USA FEATURES Location/Qualifiers source 1..9358 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp11.4" repeat_region 1070..1380 /rpt_family="Alu" misc_feature 3500..3800 /note="CpG island" repeat_region 3935..4295 /rpt_family="Alu" mRNA 4765..8450 /gene="ZNF127-Xp" gene 4765..8450 /gene="ZNF127-Xp" mRNA 4805..8450 /gene="ZNF127-Xp" /note="alternate transcription initiation site" misc_feature 4877..5373 /gene="ZNF127-Xp" /note="CpG Island" CDS 5107..6564 /gene="ZNF127-Xp" /note="ring zinc-finger protein; escapes X chromosome inactivation" /codon_start=1 /product="ZNF127-Xp" /db_xref="PID:g1304599" /translation="MAEAAAPGTTVTTSGAGAAAAEAAETAEAVSPTPIPTVTAPSPR AGGGVGGSDGSDGSGGRGDSGAYDGSGACGGSDACDGSGDSSGDSWTKQVTCRYFKYG ICKEGDNCRYSHDLSDRLCGVVCKYFQRGCCVYGDRCRCEHSKPLKQEEATATELTTK SSLAASSSLSSIVGPLVEMNTNEAESRNSNFATVVAGSEDWANAIEFVPGQPYCGRTV PSCTEAPLQGSVTKEESEEEQTAVETKKQLCPYAAVGQCRYGENCVYLHGDLCDMCGL QVLHPMDAAQRSQHIQACIEAHEKDMEFSFAVQRSKDKVCGICMEVVYEKANPNEHRF GILSNCNHTFCLKCIRKWRSAKEFESRIVKSCPQCRITSNFVIPSEYWVEEKEEKQKL IQKYKEAMSNKACKYFDEGRGSCPFGENCFYKHMYPDGRREEPQRQQVGTSSRNPGQQ RNHFWEFFEEGANSNPFDDEEEAVTFELGEMLLML" misc_feature 5392..5448 /gene="ZNF127-Xp" /note="C3H zinc finger motif (XRYFKYGICKEGDNCRYSH)" misc_feature 5479..5535 /gene="ZNF127-Xp" /note="encodes C3H zinc finger motif (XKYFQRGCCVYGDRCRCEH)" misc_feature 5849..5907 /gene="ZNF127-Xp" /note="encodes C3H zincfFinger motif (XAPMLQWDSADMGRTVCIST)" misc_feature 5917..6000 /gene="ZNF127-Xp" /note="encodes C2H2CH zinc finger motif (XDMCGLQVLHPMDAAQRSQHIQACIEAH)" misc_feature 6050..6216 /gene="ZNF127-Xp" /note="encodes RING zinc finger motif (XVGSAWRWSMRKPTPTSTASGSSPTATTPSVSSAFASGGVLRNL RAGSSSPAHNA)" misc_feature 6319..6381 /gene="ZNF127-Xp" /note="encodes C3H zinc finger motif (XKYFDEGRGSCPFGENCFYKH)" repeat_region 6870..7200 /rpt_family="Alu" repeat_region 7865..8165 /rpt_family="Alu" repeat_region 8415..8555 /rpt_family="Alu" repeat_region 9055..9270 /rpt_family="Alu" BASE COUNT 2229 a 2349 c 2316 g 2464 t ORIGIN 1 aagcttatgg tgcaaccaaa aatctttcag tgttttttcc tactcacact ttatactgac 61 tgaatagaga tcctgatggt tgacactacc aatttcagct ttggagaccc cttcactcag 121 tcatagctgc atgttttata gctgaagcta ttgggtttgt gacccctagc tgacttgacc 181 caggcttgca cagcagcaac cttctttgag aattccactt taattcaggg cctctgctgc 241 cccatcacca agataacatc tcttatgttc tcacctggtt tcaacacaag cgaagttttc 301 cagggcagca gcatctacct aggtttgatg gggtcagctt ctcattctcc agtgggtcac 361 ctttgccagc actgtaaacc caaattcaag gcagtgagag cctgaatact actcaatgcc 421 tcatctacgg tttcccatag gagtttgtct gtttcaggga aatctccctg aaaggcccac 481 agctccctta ttgcagccag gatctattgc aaaaaaaaaa aaccaaaaaa aaaaaaaaac 541 aaacaaaaaa aacctctaag accttctact gtctctccta gtggatctaa ggcagacacg 601 atggaccgca ggaggggctg agccataaga cccagtttac gttgccattc ttccttgaca 661 agcattacta gccgcacttc ctcatctccc aagtgaacca gccaaactcc taatgatgtg 721 cctggtttct gcccatagtg gtcatgaatt tccctccttc cacacatctc tgcaattgtt 781 gtctaaaaga gggcagaatc ttttatgtgc accagacaat acctagtact agttgttaca 841 gtggggcaga cctgagggta actccaaatt gttcaggaag ttctacacct gcattagagt 901 tagcaagaac cagggcacca tttgagcagt ggtctttgaa tctgcttgct ggtactcagc 961 ctgcatccac ctcttcaatc cccatcatga acaggcaata aagtgaagaa ctatttatgg 1021 cctagttgag cagataattt ggtcaacata tttcagcgac tttcttcaaa tttgttttct 1081 tttttttttg aaacagggtc tcgctctgtt gcccaggctg gagtgcagtg gcgcgctctt 1141 gctcactgca gcctcgactt cccaggttcg agcgattctc ccacctcagc ctcccgagca 1201 gctgggaaca caggcacaca tcaccatgcc cagctaattt ttgtattttt cgtagaggca 1261 aggttttgcc acgttgccca ggctggtctc aaactcctga gctcaagaaa tctgcttgcc 1321 ttggcctcac aaattgctgg gattacaggt gtgagccatc acgcccggcc ctcaaatttc 1381 attaaattta cccacaatat cctcatcctt ctccttctag aaagccatgt ctctgtcagt 1441 atcccgccct cctcactaaa gctgtcaaga actataaaag gtctgagatt ttgtcctact 1501 caacaagcta acaagtcagc ccactacagt ttcatggaat ctggtagaaa acatgacact 1561 cagagatggg cagtgttttg ctgtagaaag gtcagtcacc agatatcaac attttggcac 1621 cagtttcttg agccccgatt atgtagggga actcctctgt gctggtgccc tgaaaaccgt 1681 gcatatatcc tcataggcca tgtaagcaaa gctcaactgc aagctgatcc atgccgtggc 1741 agaaggctct gtgatcctcc ccaagcatga gaggataggc agtcttgtga ggagctgcca 1801 taccgccatt tctcacttgc ctttctattt ttttttcaca gtagcaagga catttattat 1861 ataaataggt ggataaaata agtgtgcttt aacttctgga tatatggcaa atgctactgt 1921 gatgcagttt ttcagtgcag acatgtgact cttcaatgat ggggatgggc cccctagtgt 1981 gatgtgcagc cactatactc catatggcaa acctcctggt tcaaggcaga gaggagacac 2041 cttattctat agaacaggat ggggcttcca ccctggggac catacaagtg gctcccatag 2101 gcagccacgc ctttctggca agctgtggca tacccaggtg caggctgggg gaggggctgg 2161 cacagcagcc ccagcgtgct ctggcttcag cctgatggag aagcaaaact aggacctccc 2221 tctcccagat gttgctgacc ctatcgccac ctccaccctc taccctgcgc tgggagttcg 2281 ctgtgacgca gcctgttcca tagccatagg gagaccaaag tgaacgatca cagtgcccct 2341 cacctattct aggggacttg gatgctcagt agagcttaat gccatggcat ctgagagcac 2401 ccaccctgtc cctgagggca gctgtgccgg gccagggggc cgcaagggca tgggtggaat 2461 cttggtcact ggcagcactc agccacgtgc ctgaggagct cttcaccttg ttcgtcactg 2521 aagcactgca ggcagtgagg gcactgaagt tccccctggc ctctctgggc tgcgccagga 2581 tgcccgccct ctgcaggggg ttccaaccgc ctgggaccca gtcccaggcc tccagccacc 2641 aggcacgata agctccactg catcggtggc caagtacttg gcagttttgc tcccagcgtg 2701 aatccagccg gcgtctggct cttgagaatc ctgtctccag gacacctggt gcagcaaaga 2761 ggtgaccttt tcctccagtt cctgaatcct accttgagcc tgttcccgat ctgccctttc 2821 tgacatgaag tcatccttgt aagcgagaat ctgctattcc agtatctgca cccattccag 2881 tgcagcatcc cgtgccgtcc tcgaggctgc cagcttctac gtcaattctg cacagtcgtt 2941 tattttctct tccaactatc tgttgagccg ggagatctcc ttcctcatca gctcgggctc 3001 atgggggggg gggggtctgc agccccctga gctgtgcatg gagccccctc atgtattcgt 3061 ccctgctgac atcgtatagc tgccacttgg catagaggtc ttcaacgtga gtcaccttct 3121 gttttaacag tcgattttct tcctcagtaa cactctggac agaggtgtcc ctacctgtat 3181 gttctgactt tcaggacttc tctcccccac gttcctttgt gcatgctgtc gttcatccag 3241 acacttggcc agatgctgac acatgtgggc agtggaggtc agcgtcctcc gcagctggtg 3301 ggtctcattg gccaaggagc tgcacagaac gtcactggcg gggtggggcg gttcgccctc 3361 cgccatgctc ctccgcatca gggcgacttc cttctctcgc tggtgctgtg gctggctcag 3421 cagctgctgc atctccctct ctttcgcttc tagtcgctca gtaagcctct caatttcctg 3481 gcgcatctgg gcctccgcgg cgccgttctc ctgccgctgc agctgctccc ggaagtgcgc 3541 cacctgctcc agcagcgcat ccaccaggga cggcgcagtg tacccctcca gcgcggccag 3601 cctggcgtgg aggcacgcga tgagggagtc gcgggcggcc agctggtcct gcaggcgccg 3661 cagccgctac ccgacctggt agcacaggcc gcagagctct gcagatgcgc gcggggcccc 3721 ctcccggccg tccgaccccg ggtccccgca cgtgactgtg ggcctgcctg ggaggccgca 3781 cggccgccgg caacttcggt gcccggaccc tgtcggctct ctatcttgag ggagtttctc 3841 atcgcctctt ggttccaatg aggtgtggga ctttccagct tgtcccctct cccttaggga 3901 ggctaagctg caggctataa aactacttag cagcagtata gaaatggctc ctctggcgcg 3961 gtggctcatg cctataatcc cagcactttg ggaggccgca gcgggtggat cacttgaggt 4021 caggagttcc agaccagcct ggccaacatg gtgaaaccct gtctctacta aaaatacaaa 4081 aattagcttg gcgtggtgat gtgtgcctgt aatcccagct actcgggaag ctgaggcagg 4141 agaatcactt gaacccggga ggtggaggtt acagtgagcc aagatcatgc cccactacac 4201 tgcagcctgg gcaacagagc gagactgtct cccaaaaaaa aaaaatatat agaaagaaag 4261 aaagagagaa agatatagaa agaaagaaaa atgaaatgac tcctcagcaa cagggtgccc 4321 caacactcat gtttcttgtc atttggcctc ttttgtttgg gtgtcctatt gtgggacaag 4381 gaatgcaggg agctgacaac atgctgttgt ttctttttct gtctatgtaa gtaataaact 4441 gtctgaatct aatcagtgct tgttgtcctc ttaccagcca aatctgtaag cctgctcaac 4501 acattctcac agggcaactc taaagaggac caggtaacac ctgcacgtgc agtggattac 4561 attataggag agaaaccctg gacttaggga atctgaacct ttcataatgg gcagtaatca 4621 ttcctgccct tcactccaga gggaggctat ttttattata gtggacagta agcatgcttg 4681 cccttttctt tggagggaga cactacctct gtctttcaag gatgcactgt atacaaatat 4741 cctggaaaag atagtccgga acaaaggaca gtacagtgcc ttacttgcaa gatgtgcaag 4801 cacacaagac ctatgaagaa ctgtctccca acaaaggata catgcagaag gagaaaaaac 4861 tctcttaaac aaaaagaggc agagccggca gggaccgagc gggtgcctca gtctccttcc 4921 cctcccctcg cctgtcctcg ccatcttctt ctcacagccg gaccggaact atgtgatccc 4981 ggaagttccg ggtcctttgg ccctatatga tcccggaagt tccggggctt ttggacctct 5041 gtgattccgg aagttccggg gcgttccggg gcgttctggg gcctttggag cgtgggataa 5101 gcagtaatgg cggaggctgc agctcccgga acaacagtca caacatcggg agcaggagca 5161 gcagcggcgg aggcggcgga gacggcggaa gcagtctccc cgactccgat ccccacagtc 5221 accgccccgt ccccgagggc gggcggaggg gtcggcggca gcgacggcag cgacggtagt 5281 ggcggcaggg gcgacagtgg cgcgtatgac ggcagcggtg cgtgcggcgg cagcgacgcg 5341 tgcgatggca gcggcgacag cagcggcgac agctggacta aacaggtcac ttgtagatat 5401 tttaagtatg ggatttgtaa ggaaggagat aactgtcgct actcgcatga cctctctgac 5461 cgtctgtgtg gtgtagtgtg caagtatttt cagcgagggt gctgtgttta tggagaccgc 5521 tgcagatgtg aacatagcaa gccattgaaa caggaagaag caactgctac agagctaact 5581 acaaagtcat cccttgctgc ttcctcaagt ctctcatcaa tagttggacc acttgttgaa 5641 atgaatacaa acgaagctga gtcaagaaat tcaaattttg caactgtagt agcaggttca 5701 gaggactggg cgaatgccat tgagtttgtt cctgggcaac cctactgtgg ccgtactgtg 5761 ccttcctgca ctgaagcacc cctgcagggc tcagtgacca aggaagaatc agaggaagag 5821 caaaccgccg tggaaacaaa gaagcagctg tgcccctatg ctgcagtggg acagtgccga 5881 tatggggaga actgtgtgta tctccacgga gatttatgtg acatgtgtgg gctgcaggtc 5941 ctgcatccga tggatgctgc ccagagatca cagcatatac aagcgtgcat tgaagcccat 6001 gagaaagaca tggagttctc atttgctgtg cagcgcagca aggacaaggt gtgtgggatc 6061 tgcatggagg tggtctatga gaaagccaac cccaacgagc accgcttcgg gatcctctcc 6121 aactgcaacc acaccttctg tctcaagtgc attcgcaagt ggaggagtgc taaggaattt 6181 gagagcagga tcgtcaagtc ctgcccacaa tgccgaatca catctaactt tgtcattcca 6241 agtgagtact gggtggagga gaaagaagag aagcagaaac tcattcagaa atacaaggag 6301 gcaatgagca acaaggcatg caagtatttt gatgaaggac gtgggagctg cccatttgga 6361 gagaactgtt tttacaagca tatgtaccct gatggccgca gagaggagcc acagagacag 6421 caagtgggaa catcaagcag aaacccaggc caacaaagga accacttctg ggaattcttt 6481 gaggaaggag cgaacagcaa cccctttgac gatgaagaag aggctgtcac ctttgagctg 6541 ggtgagatgt tgcttatgct ttaggctgca ggtggggacg acaaactgac agactctgaa 6601 aatgagtggg acttgttttg tgatgaagaa ttttatgtct tagatctata gaaaccttgc 6661 gtagtgtgtg agctggtctg ctgaccccag atagcagctg tcccctgtgg tggtgtggca 6721 gtgcctatgt tctctcctag gcagacctat caactccagg tgctgcggtt aagaatatgt 6781 acccagggcc tgtcttgtca acccctcacc tttccccaag gagtgtgttg ttttccctgt 6841 tggaaaaagt tacaaaaata aatgttaaag gttttttttg ttttttgttt tattgagaca 6901 gagtcccact ctgtcaccca ggctggagtg cagtggtgca atcttggctc actgcaacct 6961 ccgtcttccg ggttcaagcc attctcctgt ctcagcctcc caagttgctg ggactacagg 7021 tgcatgctac aatgcccagc taatttttct catttttagt agaaacgggg tttcaccata 7081 ttggtcaggc tggtctcgaa ctcctgacct caggtgatcc acctgccact accttccaaa 7141 gtgctgggat tacaggcgtc agccaccatg cccagcctta aagttagttt tttgtaacag 7201 gcatgagcca ccgcgcctgg ctttaaagtt cgttttttgt aacaggcata agccactgtg 7261 cctggcctta aagttagttt ttttgtaaca cgaatttaac tgttggacag ttagtttaga 7321 tgtgttgcat catctgtttt caaccagatt gtgtttatgg acttttcaca cactaatttt 7381 gaggacccca ggttcaaaag taaaagcagt ggccctgctt tggggtccaa taataggagt 7441 gatgggtgaa gggacctaag ctggccagta gccttctgct ccagacatgg gacgcggatc 7501 cttgaggttt ctggttaaat ctgcacatct gtgtttttat atctgttccc tacccctgta 7561 atccctaccg catgcactag ttctgtagtt ttggtctctc gtttaattgt atgcaagtag 7621 tactactggg taaccagagc caagcgtgaa tgtgttcaga tttctactgt tttgcatgat 7681 aggaaaattg agaaagaata catataaaag atatagaggc ataacatcaa tgcagagttg 7741 gaagttgacc tccacggggt gacatggtgt gtgtgagtgt gggtgtgtga gtgtgggtgt 7801 gtgataagct tctcaaacct gcatagatgc agtattcttg gctttggtag aaagccttgg 7861 tttaaggctg ggcgcggtgg ctcacgcctg taatcccagc actttgggag gctgaggcgg 7921 ccggatcaca aggccaggag atcgagacca tcctgtgaat ggcgaaaccc tgtctctact 7981 aaaaatacaa aaaattagcc gggcgtagtg gcgggcgcct gtagtcccag ctactctgga 8041 ggctgaggcg ggagaatggt gtgaacctgg gaggcggagc ttgcaatgaa ctgagaatgt 8101 gccactggac tcccagcctg ggtgacagag cgagactctg tctcaaacaa aaaaaaacaa 8161 aaacggaaac ccttggttta ggggtttaag tcttatgtgg tggttaagat cttaaaggac 8221 aaagcagtat attggtagtt atcaatatag cagtactagc tctgtttata taaatagaga 8281 aatggagtta gccatagagg ttaaaactac ctggttatcc catatattaa cccaaactgg 8341 gtcttggata cacagttgta tttaatgttt tacgatctag cctttccagt ataggcactt 8401 cctgaaaaac ctttgtcctc atttggggca ttttgttgtt gggtttcgcc atgttggcca 8461 ggctggtctc gaactcctga cctcaggtga tccacctgcc tcagcctccc aaagtgctgg 8521 gattacgggc atgagccacc acacccggcc aggaaaaggt attttatatt tatcattgct 8581 gtgctgttta ttcatttgta taaattcaag ttcccatctg gtattgtttt ccttcagcat 8641 gaataacttc tttgaacatt tcttgtggtt catgtctgct ggcaacaaat tctctcagct 8701 atttatctgg aaaagtcttt atttcacctt catatttcac ctgtattttc tttgtgtaca 8761 gaattctagg ttgacatttg ttcctgagtg ttttaaagat gtcatttcat attcttctgg 8821 tttgcataat ttctgatgag aagtctgcag tttattttct ttctatttct ctctcttttt 8881 gctcctctgc aaagccaatt cctcttgcta tttttaatat tttctcatta tcattgtttc 8941 ataggaattt gattatgatt ttccttggta tgatttcctt aatatttatt ctatttggga 9001 ccattgagct tctcatagct gcatatctgt atgtttataa ttttcatcca gtttaaaaaa 9061 aaagttttga gaaagagtct cactctgtca cccagtctaa agtacagtga cacgatcttg 9121 gctcactgca acctccgcct ccctggttca agtgattctt gtgcctcagt ctcctgagta 9181 gctggaatta caagtgcatg ccatcatgcc ctggtacttt ttgtttttac agtagagagg 9241 gcgttttggc cactactttc gtcctcaagt gtagccagct acctccggtt tcccaggcga 9301 tatcctttgg agagtttggg tgtgcagtgt ctctagtctc atttgcttca tcgagatc // LOCUS HSU42604 1244 bp DNA PRI 02-FEB-1996 DEFINITION Human UDP-glucuronosyltransferase (UGT1H), exon 1. ACCESSION U42604 NID g1174043 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1244) AUTHORS Cho,J.W., Gholami,N. and Owens,I.S. TITLE Extension of UGT1 Locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 1244) AUTHORS Owens,I.S., Cho,J.W. and Gholami,N. TITLE Direct Submission JOURNAL Submitted (08-DEC-1995) Ida S. Owens, HDB, NICHD, 9000 Rockville Pike, Bldg. 10/8D43, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1244 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" TATA_signal 136..151 gene 264..1196 /gene="UGT1H" exon 264..1193 /gene="UGT1H" /number=1 CDS 264..1196 /gene="UGT1H" /codon_start=1 /product="UDP-glucuronosyltransferase" /db_xref="PID:g1174044" /translation="MARTGWTSPIPLCVSLLLTCGFAEAGKLLVVPMDGSHWFTMQSV VEKLILRGHEVVVVMPEVSWQLGKSLNCTVKTYSTSYTLEDLDREFMDFADAQWKAQV RSLFSLFLSSSNGFFNLFFSHCRSLFNDRKLVEYLKESSFDAVFLDPFDACALIVAKY FSLPSVVFARGIGCHYLEEGAQCPAPLSYVPRILLGFSDAMTFKERVRNHIMHLEEHL FCQYFSKNALEIASEILQTPVTAYDLYSHTSIWLLRTDFVLDYPKPVMPNMIFIGGIN CHQGKPLPMVSHLSFSTLGIILALEIKKRFLTEL" intron 1194..1244 /number=1 BASE COUNT 302 a 268 c 274 g 400 t ORIGIN 1 gggcatgatc tgtccaaggc agagactata agctactctt atagtactct tatgagatac 61 atacaagtag gtatctcaaa aaatgatact catgtattcc tgttcttatg agtaaatcat 121 tggcagtgag tgtgattttt ttttttttta tgacaggatc cctacacgcc ctctattggg 181 gtcaggtttt gtgcctgtag ttcttccgcc tacgtatcat agcagttaga atcccagctg 241 ctggctcggg ctgcagttct ctcatggctc gcacagggtg gaccagcccc attcccctat 301 gtgtttctct gctgctgacc tgtggctttg ctgaggcagg gaagctgctg gtagtgccca 361 tggatgggag tcactggttc accatgcagt cggtggtgga gaaacttatc ctcagggggc 421 atgaggtggt tgtagtcatg ccagaggtga gttggcaact gggaaaatca ctgaattgca 481 cagtgaagac ttactcaacc tcatacactc tggaggatct ggaccgggaa ttcatggatt 541 tcgccgatgc tcaatggaaa gcacaagtac gaagtttgtt ttctctattt ctgagttcat 601 ccaatggttt ttttaactta tttttttcgc attgcaggag tttgtttaat gaccgaaaat 661 tagtagaata cttaaaggag agttcttttg atgcggtgtt tcttgatcct tttgatgcct 721 gtgcgttaat tgttgccaaa tatttctccc tcccctctgt ggtcttcgcc aggggaatag 781 gttgccacta tcttgaagaa ggtgcacagt gccctgctcc tctttcctat gtccccagaa 841 ttctcttagg gttctcagat gccatgactt tcaaggagag agtacggaac cacatcatgc 901 acttggagga acatttattt tgccagtatt tttccaaaaa tgccctagaa atagcctctg 961 aaattctcca aacacctgtc acagcatatg atctctacag ccacacatca atttggttgt 1021 tgcgaacaga ctttgttttg gactatccca aacccgtgat gcccaatatg atcttcattg 1081 gtggtatcaa ctgccatcag ggaaagccat tgcctatggt aagtcacctc tcctttagca 1141 cattaggaat aatcttggct ttggaaatta aaaaaagatt ccttactgaa ttgtgatttg 1201 acattttcat ttgttgcatt tcaaatttct ttccagttta caga // LOCUS HSU43177 542 bp DNA PRI 16-OCT-1996 DEFINITION Human urocortin gene, complete cds. ACCESSION U43177 NID g1292909 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 542) AUTHORS Donaldson,C.J., Sutton,S.W., Perrin,M.H., Corrigan,A.Z., Lewis,K.A., Rivier,J.E., Vaughan,J.M. and Vale,W.W. TITLE Cloning and characterization of human urocortin JOURNAL Endocrinology 137 (5), 2167-2170 (1996) MEDLINE 96198824 REMARK Erratum:[Endocrinology 1996;137(9):3896] REFERENCE 2 (bases 1 to 542) AUTHORS Donaldson,C. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) Cynthia Donaldson, Peptide Biology Laboratory, The Salk Institute, 10010 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..542 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" CDS 41..415 /codon_start=1 /product="urocortin" /db_xref="PID:g1292910" /translation="MRQAGRAALLAALLLLVQLCPGSSQRSPEAAGVQDPSLRWSPGA RNQGGGARALLLLLAERFPRRAGPGRLGLGTAGERPRRDNPSLSIDLTFHLLRTLLEL ARTQSQRERAEQNRIIFDSVGK" BASE COUNT 81 a 181 c 195 g 85 t ORIGIN 1 caccctgcgc tgcccctgtg tgtccagggc cggcggcacc atgaggcagg cgggacgcgc 61 agcgctgctg gccgcgctgc tgctcctggt acagctgtgc cctgggagca gccagaggag 121 ccccgaggcg gccggggtcc aggacccgag tctgcgctgg agccccgggg cacggaacca 181 gggtggcggg gcccgcgcgc tcctcttgct gctggcggag cgcttcccgc gccgcgcggg 241 gcccggccga ttgggactcg ggacggcagg cgagcggccg cggcgggaca acccttctct 301 gtccattgac ctcacctttc acctgctgcg gaccctgctg gagctggcgc ggacgcagag 361 ccagcgggag cgcgccgagc agaaccgcat catattcgac tcggtgggca agtgatggcc 421 cggtttgggg ctgcgaaaac gttgacccct ttcccccacc ccagagttgg gatgcggggc 481 agagccacca gggcactgtc tgcgtgacta ttttttaata aaagtactga agacccgttg 541 gc // LOCUS HSU45982 2577 bp DNA PRI 02-APR-1996 DEFINITION Human G protein-coupled receptor GPR-9-6 gene, complete cds. ACCESSION U45982 NID g1245054 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2577) AUTHORS Lautens,L.L., Tiffany,H.L., Gao,J.-L., Modi,W., Murphy,P.M. and Bonner,T.I. TITLE Cloning, Tissue Distribution and Chromosomal Localization of two potential G-Protein-Linked Chemokine Receptors JOURNAL Unpublished REFERENCE 2 (bases 1 to 2577) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (16-JAN-1996) Tom I. Bonner, Lab of Cell Biology, NIMH, Bldg 36, Rm 3A-17, MSC 4090, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..2577 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21.3-22" CDS 58..1131 /note="G protein-coupled receptor" /codon_start=1 /product="GPR-9-6" /db_xref="PID:g1245055" /translation="MADDYGSESTSSMEDYVNFNFTDFYCEKNNVRQFASHFLPPLYW LVFIVGALGNSLVILVYWYCTRVKTMTDMFLLNLAIADLLFLVTLPFWAIAAADQWKF QTFMCKVVNSMYKMNFYSCVLLIMCISVDRYIAIAQAMRAHTWREKRLLYSKMVCFTI WVLAAALCIPEILYSQIKEESGIAICTMVYPSDESTKLKSAVLTLKVILGFFLPFVVM ACCYTIIIHTLIQAKKSSKHKALKVTITVLTVFVLSQFPYNCILLVQTIDAYAMFISN CAVSTNIDICFQVTQTIAFFHSCLNPVLYVFVGERFRRDLVKTLKNLGCISQAQWVSF TRREGSLKLSSMLLETTSGALSL" BASE COUNT 628 a 613 c 574 g 762 t ORIGIN 1 aatattttcc ttgacctaat gccatcttgt gtccccttgc agagccctat tcctaacatg 61 gctgatgact atggctctga atccacatct tccatggaag actacgttaa cttcaacttc 121 actgacttct actgtgagaa aaacaatgtc aggcagtttg cgagccattt cctcccaccc 181 ttgtactggc tcgtgttcat cgtgggtgcc ttgggcaaca gtcttgttat ccttgtctac 241 tggtactgca caagagtgaa gaccatgacc gacatgttcc ttttgaattt ggcaattgct 301 gacctcctct ttcttgtcac tcttcccttc tgggccattg ctgctgctga ccagtggaag 361 ttccagacct tcatgtgcaa ggtggtcaac agcatgtaca agatgaactt ctacagctgt 421 gtgttgctga tcatgtgcat cagcgtggac aggtacattg ccattgccca ggccatgaga 481 gcacatactt ggagggagaa aaggcttttg tacagcaaaa tggtttgctt taccatctgg 541 gtattggcag ctgctctctg catcccagaa atcttataca gccaaatcaa ggaggaatcc 601 ggcattgcta tctgcaccat ggtttaccct agcgatgaga gcaccaaact gaagtcagct 661 gtcttgaccc tgaaggtcat tctggggttc ttccttccct tcgtggtcat ggcttgctgc 721 tataccatca tcattcacac cctgatacaa gccaagaagt cttccaagca caaagcccta 781 aaagtgacca tcactgtcct gaccgtcttt gtcttgtctc agtttcccta caactgcatt 841 ttgttggtgc agaccattga cgcctatgcc atgttcatct ccaactgtgc cgtttccacc 901 aacattgaca tctgcttcca ggtcacccag accatcgcct tcttccacag ttgcctgaac 961 cctgttctct atgtttttgt gggtgagaga ttccgccggg atctcgtgaa aaccctgaag 1021 aacttgggtt gcatcagcca ggcccagtgg gtttcattta caaggagaga gggaagcttg 1081 aagctgtcgt ctatgttgct ggagacaacc tcaggagcac tctccctctg aggggtcttc 1141 tctgaggtgc atggttcttt tggaagaaat gagaaataca tgaaacagtt tccccactga 1201 tgggaccaga gagagtgaaa gagaaaagaa aactcagaaa gggatgaatc tgaactatat 1261 gattacttgt agtcagaatt tgccaaagca aatatttcaa aatcaactga ctagtgcagg 1321 aggctgttga ttggctcttg actgtgatgc ccgcaattct caaaggagga ctaaggaccg 1381 gcactgtgga gcaccctggc tttgccactc gccggagcat caatgccgct gcctctggag 1441 gagcccttgg attttctcca tgcactgtga acttctgtgg cttcagttct catgctgcct 1501 cttccaaaag gggacacaga agcactggct gctgctacag accgcaaaag cagaaagttt 1561 cgtgaaaatg tccatctttg ggaaattttc taccctgctc ttgagcctga taacccatgc 1621 caggtcttat agattcctga tctagaacct ttccaggcaa tctcagacct aatttccttc 1681 tgttctcctt gttctgttct gggccagtga aggtccttgt tctgattttg aaacgatctg 1741 caggtcttgc cagtgaaccc ctggacaact gaccacaccc acaaggcatc caaagtctgt 1801 tggcttccaa tccatttctg tgtcctgctg gaggttttaa cctagacaag gattccgctt 1861 attccttggt atggtgacag tgtctctcca tggcctgagc agggagatta taacagctgg 1921 gttcgcagga gccagccttg gccctgttgt aggcttgttc tgttgagtgg cacttgcttt 1981 gggtccaccg tctgtctgct ccctagaaaa tgggctggtt cttttggccc tcttctttct 2041 gaggcccact ttattctgag gaatacagtg agcagatatg ggcagcagcc aggtagggca 2101 aaggggtgaa gcgcaggcct tgctggaagg ctatttactt ccatgcttct ccttttctta 2161 ctctatagtg gcaacatttt aaaagctttt aacttagaga ttaggctgaa aaaaataagt 2221 aatggaattc acctttgcat cttttgtgtc tttcttatca tgatttggca aaatgcatca 2281 cctttgaaaa tatttcacat attggaaaag tgctttttaa tgtgtatatg aagcattaat 2341 tacttgtcac tttctttacc ctgtctcaat attttaagtg tgtgcaatta aagatcaaat 2401 agatacatta agagtgtgaa ggctggtctg aaggtagtga gctatctcaa tcggattgtt 2461 cacactcagt tacagattga actccttgtt ctacttccct gcttctctct actgcaattg 2521 actagtcttt aaaaaaaagt gtgaagagta agcaataggg ataaggaaat aagatct // LOCUS HSU46025 2898 bp DNA PRI 11-DEC-1996 DEFINITION Human translation initiation factor eIF-3 p110 subunit gene, complete cds. ACCESSION U46025 NID g1718196 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2898) AUTHORS Asano,K., Kinzy,T.G., Merrick,W.C. and Hershey,J.W.B. TITLE Conservation and diversity of eukaryotic translation initiation factor eIF3 JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 2898) AUTHORS Asano,K. and Hershey,J.W.B. TITLE Direct Submission JOURNAL Submitted (12-JAN-1996) Katsura Asano, Biological Chemistry, University of California at Davis, School of Medicine, Building MS1A, Davis, CA 95616, USA FEATURES Location/Qualifiers source 1..2898 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 50..2791 /codon_start=1 /product="translation intiation factor eIF-3 p110 subunit" /db_xref="PID:g1718197" /translation="MSRFFTTGSDSESESSLSGEELVTKPVGGNYGKQPLLLSEDEED TKRVVRSAKDKRFEELTNLIRTIRNAMKIRDVTKCLEEFELLGKAYGKAKSIVDKEGV PRFYIRILADLEDYLNELWEDKEGKKKMNKNNAKALSTLRQKIRKYNRDFESHITSYK QNPEQSADEDAEKNEEDSEGSSDEDEDEDGVSAATFLKKKSEAPSGESRKFLKKMDDE DEDSEDSEDDEDWDTGSTSSDSDSEEEEGKQTALASRFLKKAPTTDEDKKAAEKKRED KAKKKHDRKSKRLDEEEEDNEGGEWERVRGGVPLVKEKPKMFAKGTEITHAVVIKKLN EILQARGKKGTDRAAQIELLQLLVQIAAENNLGEGVIVKIKFNIIASLYDYNPNLATY MKPEMWGKCLDCINELMDILFANPNIFVGENILEESENLHNADQPLRVRGCILTLVER MDEEFTKIMQNTDPHSQEYVEHLKDEAQVCAIIERVQRYLEEKGTTEEVCRIYLLRIL HTYYKFDYKAHQRQLTPPEGSSKSEQDQAENEGEDSAVLMERLCKYIYAKDRTDRIRT CAILCHIYHHALHSRWYQARDLMLMSHLQDNIQHADPPVQILYNRTMVQLGICAFRQG LTKDAHNALLDIQSSGRAKELLGQGLLLRSLQERNQEQEKVERRRQVPFHLHINLELL ECVYLVSAMLLEIPYMAAHESDARRRMISKQFHHQLRVGERQPLLGPPESMREHVVAA SKAMKMGDWKTCHSFIINEKMNGKVWDLFPEADKVRTMLVRKIQEESLRTYLFTYSSV YDSISMETLSDMFELDLPTVHSIISKMIINEELMASLDQPTQTVVMHRTEPTAQQNLA LQLAEKLGSLVENNERVFDHKQGTYGGYFRDQKDGYRKNEGYMRRGGYRQQQSQTAY" polyA_site 2898 /note="47 A nucleotides" BASE COUNT 772 a 762 c 819 g 545 t ORIGIN 1 tgactcgcgg gctcagctgg tccggccgta gcacctccgc gccgtcgcca tgtcgcggtt 61 tttcaccacc ggttcggaca gcgagtccga gtcgtccttg tccggggagg agctcgtcac 121 caaacctgtc ggaggcaact atggcaaaca gccattgttg ctgagcgagg atgaagaaga 181 taccaagaga gttgtccgca gtgccaagga caagaggttt gaggagctga ccaaccttat 241 ccggaccatc cgtaatgcca tgaagattcg tgatgtcacc aagtgcctgg aagagtttga 301 gctcctggga aaagcatatg ggaaggccaa aagcattgtg gacaaagaag gtgtcccccg 361 gttctatatc cgcatcctgg ctgacctaga ggactatctt aatgagcttt gggaagataa 421 ggaagggaag aagaagatga acaagaacaa tgccaaggct ctgagcacct tgcgtcagaa 481 gatccgaaaa tacaaccgtg atttcgagtc ccatatcaca agctacaagc agaaccccga 541 gcagtctgcg gatgaagatg ctgagaaaaa tgaggaggat tcagaaggct cttcagatga 601 ggatgaggat gaggacggag tcagtgctgc aactttcttg aagaagaaat cagaagctcc 661 ttctggggag agtcgcaagt tcctcaaaaa gatggatgat gaagatgagg actcagaaga 721 ttccgaagat gatgaagact gggacacagg ttccacatct tccgattccg actcagagga 781 ggaagaaggg aaacaaaccg cgctggcctc aagatttctt aaaaaggcac ccaccacaga 841 tgaggacaag aaggcagccg agaagaaacg ggaggacaaa gctaagaaga agcacgacag 901 gaaatccaag cgcctggatg aggaggagga ggacaatgaa ggcggggagt gggaaagggt 961 ccggggcgga gtgccgttgg ttaaggagaa gccaaaaatg tttgccaagg gaactgagat 1021 cacccatgct gttgttatca agaaactgaa tgagatccta caggcacgag gcaagaaggg 1081 aactgatcgt gctgcccaga ttgagctgct gcaactgctg gttcagattg cagcggaaaa 1141 caacctggga gagggcgtca ttgtcaagat caagttcaat atcatcgcct ctctctatga 1201 ctacaacccc aacctggcaa cctacatgaa gccagagatg tgggggaagt gcctggactg 1261 catcaatgag ctgatggata tcctgtttgc aaatcccaac atttttgttg gagagaatat 1321 tctggaagag agtgagaacc tgcacaacgc tgaccagcca ctgcgtgtcc gtggctgcat 1381 cctaactctg gtggaacgaa tggatgaaga atttaccaaa ataatgcaaa atactgaccc 1441 tcactcccaa gagtacgtgg agcacttgaa ggatgaggcc caggtgtgtg ccatcatcga 1501 gcgtgtgcag cgctacctgg aggagaaggg cactaccgag gaggtctgcc gcatctacct 1561 gctgcgcatc ctgcacacct actacaagtt tgattacaag gcccatcagc gacagctgac 1621 cccgcctgag ggctcctcaa agtctgagca agaccaggca gaaaatgagg gcgaggactc 1681 ggctgtgttg atggagagac tgtgcaagta catctacgcc aaggaccgca cagaccggat 1741 ccgcacatgt gccatcctct gccacatcta ccaccatgct ctgcactcgc gctggtacca 1801 ggcccgcgac ctcatgctca tgagccactt gcaggacaac attcagcatg cagacccgcc 1861 agtgcagatc ctttacaacc gcaccatggt gcagctgggc atctgtgcct tccgccaagg 1921 cctgaccaag gacgcacaca acgccctgct ggacatccag tcgagtggcc gagccaagga 1981 gcttctgggc cagggcctgc tgctgcgcag cctgcaggag cgcaaccagg agcaggagaa 2041 ggtggagcgg cgccgtcagg tccccttcca cctgcacatc aacctggagc tgctggagtg 2101 tgtctacctg gtgtctgcca tgctcctgga gatcccctac atggccgccc atgagagcga 2161 tgcccgccga cgcatgatca gcaagcagtt ccaccaccag ctgcgcgtgg gcgagcgaca 2221 gcccctgctg ggtccccctg agtccatgcg ggaacatgtg gtcgctgcct ccaaggccat 2281 gaagatgggt gactggaaga cctgtcacag ttttatcatc aatgagaaga tgaatgggaa 2341 agtgtgggac cttttccccg aggctgacaa agtccgcacc atgctggtta ggaagatcca 2401 ggaagagtca ctgaggacct acctcttcac ctacagcagt gtctatgact ccatcagcat 2461 ggagacgctg tcagacatgt ttgagctgga tctgcccact gtgcactcca tcatcagcaa 2521 aatgatcatt aatgaggagc tgatggcctc cctggaccag ccaacacaga cagtggtgat 2581 gcaccgcact gagcccactg cccagcagaa cctggctctg cagctggccg agaagctggg 2641 cagcctggtg gagaacaacg aacgggtgtt tgaccacaag cagggcacct acgggggcta 2701 cttccgagac cagaaggacg gctaccgcaa aaacgagggc tacatgcgcc gcggtggcta 2761 ccgccagcag cagtctcaga cggcctactg agctctccac tctgtttccc gcctgggcca 2821 tccaaccttg aagtcctaaa ccacacctca gtcactaaag gtctgtttaa agttgttctg 2881 gttgattgct tgttgcca // LOCUS HSU48405 1697 bp DNA PRI 25-JUL-1996 DEFINITION Human G protein coupled receptor OGR1 gene, complete cds. ACCESSION U48405 NID g1457938 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1697) AUTHORS Xu,Y. and Casey,G. TITLE Identification of human OGR1, a novel G protein-coupled receptor that maps to chromosome 14 JOURNAL Genomics 35 (2), 397-402 (1996) MEDLINE 96299795 REFERENCE 2 (bases 1 to 1697) AUTHORS Xu,Y. and Casey,G. TITLE Direct Submission JOURNAL Submitted (05-FEB-1996) Yan Xu, Cancer Biology, Cleveland Clinic Foundation, 9500 Euclid Ave., Cleveland, OH 44195, USA FEATURES Location/Qualifiers source 1..1697 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q31" /tissue_type="ovarian carcinoma" CDS 324..1421 /codon_start=1 /product="G protein coupled receptor OGR1" /db_xref="PID:g1457939" /translation="MGNITADNSSMSCTIDHTIHQTLAPVVYVTVLVVGFPANCLSLY FGYLQIKARNELGVYLCNLTVADLFYICSLPFWLQYVLQHDNWSHGDLSCQVCGILLY ENIYISVGFLCCISVDRYLAVAHPFRFHQFRTLKAAVGVSVVIWAKELLTSIYFLMHE EVIEDENQHRVCFEHYPIQAWQRAINYYRFLVGFLFPICLLLASYQGILRAVRRSHGT QKSRKDQIQRLVLSTVVIFLACFLPYHVLLLVRSVWEASCDFAKGVFNAYHFSLLLTS FNCVADPVLYCFVSETTHRDLARLRGACLAFLTCSRTGRAREAYPLGAPEASGKSGAQ GEEPELLTKLHPAFQTPNSPGSGGFPTGRLA" BASE COUNT 287 a 575 c 488 g 347 t ORIGIN 1 actcccaaag tgctgggctt acaggtgtaa gccatcatgt ccagccgttc agatattcta 61 gttgaattgg agttggtggg ctagtacacc ttctaaatta aatgagtaaa ggatttagaa 121 tggtgcctga cacacagtag gtgctacatt catgttagct actattataa acctttcctg 181 cctctgactt tcagggtctt gcccaccacc agcgatgccc agcccttggt agagcttgaa 241 ccaccttcta taaacaggat ggcggtggag agacaggccc agtccctgag cccatgagga 301 gtgtggcccc ttcaggccca aagatgggga acatcactgc agacaactcc tcgatgagct 361 gtaccatcga ccataccatc caccagacgc tggccccggt ggtctatgtt accgtgctgg 421 tggtgggctt cccggccaac tgcctgtccc tctacttcgg ctacctgcag atcaaggccc 481 ggaacgagct gggcgtgtac ctgtgcaacc tgacggtggc cgacctcttc tacatctgct 541 cgctgccctt ctggctgcag tacgtgctgc agcacgacaa ctggtctcac ggcgacctgt 601 cctgccaggt gtgcggcatc ctcctgtacg agaacatcta catcagcgtg ggcttcctct 661 gctgcatctc cgtggaccgc tacctggctg tggcccatcc cttccgcttc caccagttcc 721 ggaccctgaa ggcggccgtc ggcgtcagcg tggtcatctg ggccaaggag ctgctgacca 781 gcatctactt cctgatgcac gaggaggtca tcgaggacga gaaccagcac cgcgtgtgct 841 ttgagcacta ccccatccag gcatggcagc gcgccatcaa ctactaccgc ttcctggtgg 901 gcttcctctt ccccatctgc ctgctgctgg cgtcctacca gggcatcctg cgcgccgtgc 961 gccggagcca cggcacccag aagagccgca aggaccagat ccagcggctg gtgctcagca 1021 ccgtggtcat cttcctggcc tgcttcctgc cctaccacgt gttgctgctg gtgcgcagcg 1081 tctgggaggc cagctgcgac ttcgccaagg gcgttttcaa cgcctaccac ttctccctcc 1141 tgctcaccag cttcaactgc gtcgccgacc ccgtgctcta ctgcttcgtc agcgagacca 1201 cccaccggga cctggcccgc ctccgcgggg cctgcctggc cttcctcacc tgctccagga 1261 ccggccgggc cagggaggcc tacccgctgg gtgcccccga ggcctccggg aaaagcgggg 1321 cccagggtga ggagcccgag ctgttgacca agctccaccc ggccttccag acccctaact 1381 cgccagggtc gggcgggttc cccacgggca ggttggccta gcctgggtcc tccgcgggtg 1441 gctccacgtg aggcctgagc cttcagccca cgggcctcag ggcctgccgc ctcctgcttc 1501 cctcgctgcg gaggcaggga agcccctgta actccggaag cctgctctcg cttgctgagc 1561 ccgctgggac cgccgagggt gggaataagc cccggttggc tcgtgggaat aagccgtgtc 1621 ctctgccgcg gctgcgatgt ggccacgctg gggctgctgg tcgggggaaa acagtgaact 1681 gcgtcccctg gcctgct // LOCUS HSU49727 1689 bp DNA PRI 04-OCT-1996 DEFINITION Human C-C chemokine receptor 3 (CKR-3) gene, complete cds. ACCESSION U49727 NID g1477560 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1689) AUTHORS Ponath,P.D., Qin,S., Post,T.W., Wang,J., Wu,L., Gerard,N.P., Newman,W., Gerard,C. and Mackay,C.R. TITLE Molecular cloning and characterization of a human eotaxin receptor expressed selectively on eosinophils JOURNAL J. Exp. Med. 183 (6), 2437-2448 (1996) MEDLINE 96281895 REFERENCE 2 (bases 1 to 1689) AUTHORS Ponath,P.D. TITLE Direct Submission JOURNAL Submitted (21-FEB-1996) Paul D. Ponath, Molecular Biology, LeukoSite, Inc., 215 First St., Cambridge, MA 02118, USA FEATURES Location/Qualifiers source 1..1689 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1689 /gene="CKR-3" CDS 181..1248 /gene="CKR-3" /codon_start=1 /function="G-protein coupled receptor for eotaxin, RANTES and MCP-3" /product="C-C chemokine receptor 3" /db_xref="PID:g1477561" /translation="MTTSLDTVETFGTTSYYDDVGLLCEKADTRALMAQFVPPLYSLV FTVGLLGNVVVVMILIKYRRLRIMTNIYLLNLAISDLLFLVTLPFWIHYVRGHNWVFG HGMCKLLSGFYHTGLYSEIFFIILLTIDRYLAIVHAVFALRARTVTFGVITSIVTWGL AVLAALPEFIFYETEELFEETLCSALYPEDTVYSWRHFHTLRMTIFCLVLPLLVMAIC YTGIIKTLLRCPSKKKYKAIRLIFVIMAVFFIFWTPYNVAILLSSYQSILFGNDCERT KHLDLVMLVTEVIAYSHCCMNPVIYAFVGERFRKYLRHFFHRHLLMHLGRYIPFLPSE KLERTSSVSPSTAEPELSIVF" BASE COUNT 430 a 416 c 345 g 497 t 1 others ORIGIN 1 aatccttttc ctggcacctc tgatatcctt ttgaaattca tgttaaagaa tccctaggct 61 gctatcacat gtggcatctt tgttgagtac atgaataaat caactggtgt gttttacgga 121 ggatgattat gcttcattgt gggattgtat ttttcttctt ctatcacagg gagaagtgaa 181 atgacaacct cactagatac agttgagacc tttggtacca catcctacta tgatgacgtg 241 ggcctgctct gtgaaaaagc tgataccaga gcactgatgg cccagtttgt gcccccgctg 301 tactccctgg tgttcactgt gggcctcttg ggcaatgtgg tggtggtgat gatcctcata 361 aaatacagga ggctccgaat tatgaccaac atctacctgc tcaacctggc catttcggac 421 ctgctcttcc tcgtcaccct tccattctgg atccactatg tcagggggca taactgggtt 481 tttggccatg gcatgtgtaa gctcctctca gggttttatc acacaggctt gtacagcgag 541 atctttttca taatcctgct gacaatcgac aggtacctgg ccattgtcca tgctgtgttt 601 gcccttcgag cccggactgt cacttttggt gtcatcacca gcatcgtcac ctggggcctg 661 gcagtgctag cagctcttcc tgaatttatc ttctatgaga ctgaagagtt gtttgaagag 721 actctttgca gtgctcttta cccagaggat acagtatata gctggaggca tttccacact 781 ctgagaatga ccatcttctg tctcgttctc cctctgctcg ttatggccat ctgctacaca 841 ggaatcatca aaacgctgct gaggtgcccc agtaaaaaaa agtacaaggc catccggctc 901 atttttgtca tcatggcggt gtttttcatt ttctggacac cctacaatgt ggctatcctt 961 ctctcttcct atcaatccat cttatttgga aatgactgtg agcggacgaa gcatctggac 1021 ctggtcatgc tggtgacaga ggtgatcgcc tactcccact gctgcatgaa cccggtgatc 1081 tacgcctttg ttggagagag gttccggaag tacctgcgcc acttcttcca caggcacttg 1141 ctcatgcacc tgggcagata catcccattc cttcctagtg agaagctgga aagaaccagc 1201 tctgtctctc catccacagc agagccggaa ctctctattg tgttttaggt agatgcagaa 1261 aattgcctaa agaggaagga ccaaggagat naagcaaaca cattaagcct tccacactca 1321 cctctaaaac agtccttcaa accttccagt gcaacactga agctcttaag acactgaaat 1381 atacacacag cagtagcagt agatgcatgt accctaaggt cattaccaca ggccagggct 1441 gggcagcgta ctcatcatca acctaaaaag cagagctttg cttctctctc taaaatgagt 1501 tacctatatt ttaatgcacc tgaatgttag atagttacta tatgccgcta caaaaaggta 1561 aaacttttta tattttatac attaacttca gccagctatt atataaataa aacattttca 1621 cacaatacaa taagttaact attttatttt ctaatgtgcc tagttctttc cctgcttaat 1681 gaaaagctt // LOCUS HSU49974 1301 bp DNA PRI 26-JAN-1998 DEFINITION Human mariner2 transposable element, complete consensus sequence. ACCESSION U49974 NID g1698454 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1301) AUTHORS Robertson,H.M., Zumpano,K.L., Lohe,A.R. and Hartl,D.L. TITLE Reconstructing the ancient mariners of humans JOURNAL Nature Genet. 12 (4), 360-361 (1996) MEDLINE 96224393 REFERENCE 2 (bases 1 to 1301) AUTHORS Robertson,H.M. and Martos,R. TITLE Molecular evolution of the second ancient human mariner transposon, Hsmar2, illustrates patterns of neutral evolution in the human genome lineage JOURNAL Gene 205, 219-228 (1997) REFERENCE 3 (bases 1 to 1301) AUTHORS Robertson,H.M. TITLE Direct Submission JOURNAL Submitted (22-FEB-1996) Hugh M. Robertson, Entomology, University of Illinois at Urbana-Champaign, 505 S. Goodwin, Urbana, IL 61801, USA FEATURES Location/Qualifiers source 1..1301 /organism="Homo sapiens" /transposon="Hsmar2" /note="consensus sequence based on 20 unique long genomic copy sequences and 18 unique cDNAs of variable length; consensus with recognized 25 CpG hypermutable base pairs" /db_xref="taxon:9606" repeat_region 1..31 /rpt_type=inverted CDS 183..1238 /codon_start=1 /product="mariner transposase" /db_xref="PID:g1698455" /translation="MNSAKIEARTNIKFMVKLGWKNGEITDALRKVYGDNAPKKSAVY KWITRFKKGRDDVEDEARSGRPSTSICEEKINLVRALIEEDRRLTAETIANTTDISIG SAYTILTEKLKLSKLSTRWVPKPLRPDQLQTRAELSMEILNKWDQDPEAFLRRIVTGD ETWLYQYDPEDKAQSKQWLPRGGSGPVKAKADWSRAKVMATVFWDAQGILLVDFLEGQ RTITSAYYESVLRKLAKALAEKRPGKLHQRVLLHHDNAPAHSSHQTRAILREFRWEII RHPPYSPDLAPSDFFLFPNLKKSLKGTHFSSVNNVKKTALTWLNSQDPQFFRDGLNGW YHRLQKCLELDGAYVEK" repeat_region 1270..1301 /rpt_type=inverted BASE COUNT 428 a 250 c 272 g 351 t ORIGIN 1 cgaggggtct tcaaaaagtt catggaaaat gcgtattatg aaaaaactat gcatggattt 61 caaaaatttt ttgcaccaaa ataaactcgt actaacttgt tataacatgt ctgaacagga 121 tctagtttga ggcactaaga aggataagac atcagtttga aaagagcccc tatcagagca 181 acatgaattc tgctaaaatt gaagcaagaa caaacatcaa atttatggtg aagcttgggt 241 ggaagaatgg tgaaatcact gatgctttac gaaaagttta tggggacaat gccccaaaga 301 aatcagcagt ttacaaatgg ataactcgtt ttaagaaggg acgagacgat gttgaagatg 361 aagcccgcag cggcagacca tccacatcaa tttgtgagga aaaaattaat cttgttcgtg 421 ccctaattga agaggaccga cgattaacag cagaaacaat agccaacacc acggacatct 481 caattggttc agcttacaca attctgactg aaaaattaaa gttgagcaaa ctttccactc 541 gatgggtgcc aaaaccgttg cgcccagatc agctgcagac aagagcagag ctttcaatgg 601 aaattttaaa caagtgggat caagatcctg aagcatttct tcgaagaatt gtaacaggag 661 atgaaacgtg gctttaccag tacgatcctg aagacaaagc acaatcaaag caatggctac 721 caagaggtgg aagtggtcca gtcaaagcaa aagcggactg gtcaagagca aaggtcatgg 781 caacagtttt ttgggatgct caaggcattt tgcttgttga ctttctggag ggccaaagaa 841 cgataacatc tgcttattat gagagtgttt tgagaaagtt agccaaagct ttagcagaaa 901 aacgcccggg aaagcttcac cagagagtcc ttctccacca cgacaatgct cctgctcatt 961 cctctcatca aacaagggca attttgcgag agtttcgatg ggaaatcatt aggcatccac 1021 cttacagtcc tgatttggct ccttctgact tctttttgtt tcctaatctt aaaaaatctt 1081 taaagggcac ccatttttct tcagttaata atgtaaaaaa gactgcattg acatggttaa 1141 attcccagga ccctcagttc tttagggatg gactaaatgg ctggtatcat cgcttacaaa 1201 agtgtcttga acttgatgga gcttatgttg agaaataaag tttatatttt taatttttat 1261 cttttaattc cattttccac gaactttttg aagtcccctc g // LOCUS HSU50062 2617 bp DNA PRI 25-APR-1996 DEFINITION Human RIP protein kinase gene, complete cds. ACCESSION U50062 NID g1236942 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2617) AUTHORS Hsu,H., Huang,J., Shu,H.B., Baichwal,V. and Goeddel,D.V. TITLE TNF-dependent recruitment of the protein kinase RIP to the TNF receptor-1 signaling complex JOURNAL Immunity 4 (4), 387-396 (1996) MEDLINE 96200892 REFERENCE 2 (bases 1 to 2617) AUTHORS Huang,J., Hsu,H., Baichwal,V.R. and Goeddel,D.V. TITLE Direct Submission JOURNAL Submitted (26-FEB-1996) Vijay R. Baichwal, Biology, Tularik Inc., 270 East Grand Avenue, South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..2617 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein endothelium" CDS 1..2016 /note="Ser/Thr protein kinase; protein has death domain sequence at the carboxyl terminus" /codon_start=1 /product="RIP protein kinase" /db_xref="PID:g1236943" /translation="MQPDMSLNVIKMKSSDFLESAELDSGGFGKVSLCFHRTQGLMIM KTVYKGPNCIEHNEALLEEAKMMNRLRHSRVVKLLGVIIEEGKYSLVMEYMEKGNLMH VLKAEMSTPLSVKGRIIWEIIEGMCYLHGKGVIHKDLKPENILVDNDFHIKIADLGLA SFKMWSKLNNEEHNELREVDGTAKKNGGTLYYMAPEHLNDVNAKPTEKSDVYSFAVVL WAIFANKEPYENAICEQQLIMCIKSGNRPDVDDITEYCPREIISLMKLCWEANPEARP TFPGIEEKFRPFYLSQLEESVEEDVKSLKKEYSNENAVVKRMQSLQLDCVAVPSSRSN SATEQPGSLHSSQGLGMGPVEESWFAPSLEHPQEENEPSLQSKLQDEANYHLYGSRMD RQTKQQPRQNVAYNREEERRRRVSHDPFAQQRPYENFQNTEGKGTVYSSAASHGNAVH QPSGLTSQPQVLYQNNGLYSSHGFGTRPLDPGTAGPRVWYRPIPSHMPSLHNIPVPET NYLGNTPTMPFSSLPPTDESIKYTIYNSTGIQIGAYNYMEIGGTSSSLLDSTNTNFKE EPAAKYQAIFDNTTSLTDKHLDPIRENLGKHWKNCARKLGFTQSQIDEIDHDYERDGL KEKVYQMLQKWVMREGIKGATVGKLAQALHQCSRIDLLSSLIYVSQN" BASE COUNT 794 a 586 c 660 g 573 t 4 others ORIGIN 1 atgcaaccag acatgtcctt gaatgtcatt aagatgaaat ccagtgactt cctggagagt 61 gcagaactgg acagcggagg ctttgggaag gtgtctctgt gtttccacag aacccaggga 121 ctcatgatca tgaaaacagt gtacaagggg cccaactgca ttgagcacaa cgaggccctc 181 ttggaggagg cgaagatgat gaacagactg agacacagcc gggtggtgaa gctcctgggc 241 gtcatcatag aggaagggaa gtactccctg gtgatggagt acatggagaa gggcaacctg 301 atgcacgtgc tgaaagccga gatgagtact ccgctttctg taaaaggaag gataatttgg 361 gaaatcattg aaggaatgtg ctacttacat ggaaaaggcg tgatacacaa ggacctgaag 421 cctgaaaata tccttgttga taatgacttc cacattaaga tcgcagacct cggccttgcc 481 tcctttaaga tgtggagcaa actgaataat gaagagcaca atgagctgag ggaagtggac 541 ggcaccgcta agaagaatgg cggcaccctc tactacatgg cgcccgagca cctgaatgac 601 gtcaacgcaa agcccacaga gaagtcggat gtgtacagct ttgctgtagt actctgggcg 661 atatttgcaa ataaggagcc atatgaaaat gctatctgtg agcagcagtt gataatgtgc 721 ataaaatctg ggaacaggcc agatgtggat gacatcactg agtactgccc aagagaaatt 781 atcagtctca tgaagctctg ctgggaagcg aatccggaag ctcggccgac atttcctggc 841 attgaagaaa aatttaggcc tttttattta agtcaattag aagaaagtgt agaagaggac 901 gtgaagagtt taaagaaaga gtattcaaac gaaaatgcag ttgtgaagag aatgcagtct 961 cttcaacttg attgtgtggc agtaccttca agccggtcaa attcagccac agaacagcct 1021 ggttcactgc acagttccca gggacttggg atgggtcctg tggaggagtc ctggtttgct 1081 ccttccctgg agcacccaca agaagagaat gagcccagcc tgcagagtaa actccaagac 1141 gaagccaact accatcttta tggcagccgc atggacaggc agacgaaaca gcagcccaga 1201 cagaatgtgg cttacaacag agaggaggaa aggagacgca gggtctccca tgaccctttt 1261 gcacagcaaa gaccttacga gaattttcag aatacagagg gaaaaggcac tgtttattcc 1321 agtgcagcca gtcatggtaa tgcagtgcac cagccctcag ggctcaccag ccaacctcaa 1381 gtactgtatc agaacaatgg attatatagc tcacatggct ttggaacaag accactggat 1441 ccaggaacag caggtcccag agtttggtac aggccaattc caagtcatat gcctagtctg 1501 cataatatcc cagtgcctga gaccaactat ctaggaaata cacccaccat gccattcagc 1561 tccttgccac caacagatga atctataaaa tataccatat acaatagtac tggcattcag 1621 attggagcct acaattatat ggagattggt gggacgagtt catcactact agacagcaca 1681 aatacgaact tcaaagaaga gccagctgct aagtaccaag ctatctttga taataccact 1741 agtctgacgg ataaacacct ggacccaatc agggaaaatc tgggaaagca ctggaaaaac 1801 tgtgcccgta aactgggctt cacacagtct cagattgatg aaattgacca tgactatgag 1861 cgagatggac tgaaagaaaa ggtttaccag atgctccaaa agtgggtgat gagggaaggc 1921 ataaagggag ccacggtggg gaagctggcc caggcgctcc accagtgttc caggatcgac 1981 cttctgagca gcttgattta cgtcagccag aactaaccct ggatgggcta cggcagctga 2041 agtggacgcc tcacttagcg gataacccca gaaagttggc tgcctcagag cattcagaat 2101 tctgtcctca ctgatagggg ttctgtgtct gcagaaattt ngtttcctgt acttcatagc 2161 tggagaatgg ggaaagaaat ctgcagcaaa ggggtctcac tctgttgcca ggctggtctc 2221 aaacttctgg actcaagtga tcctcccgcc tcggccttcc aaagtgctgg gatatcaggc 2281 actgagccac tgcgcccagt caacaatccg ntctgaggaa agcgtaagca ggaagacctc 2341 ttaatggcat agcaccaata aaaaaatgac tcctagttgt gtttggaaag ggagagaaga 2401 gatgtctgag gaaggtcatg ttctttcagc ttatggcatt tcctagagtt tngttgaagc 2461 aagaagaaaa actcagagaa tataaaatca actttnaaaa ttgtgtgctc tcttcttcac 2521 gtaggctcct gttaaaaaca aagtgcagtc agattctaag ccctgttcag agacttcgcg 2581 gatcacagct gcagctcacc gccacatcac aggatcc // LOCUS HSU50822 1676 bp DNA PRI 02-APR-1996 DEFINITION Human neurogenic helix-loop-helix protein NEUROD (neurod) gene, complete cds. ACCESSION U50822 NID g1245454 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1676) AUTHORS Tamimi,R., Steingrimsson,E., Copeland,N.G., Dyer-Montgomery,K., Lee,J.E., Hernandez,R., Jenkins,N.A. and Tapscott,S.J. TITLE The neurod gene maps to human chromosome 2q32 and mouse chromosome 2 JOURNAL Genomics (1996) In press REFERENCE 2 (bases 1 to 1676) AUTHORS Tapscott,S.J., Tamimi,R., Lee,J.E. and Hernandez,R. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) Stephen J. Tapscott, Clinical Research, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, Seattle, WA 98104, USA FEATURES Location/Qualifiers source 1..1676 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q32" mRNA 160..1676 /gene="neurod" gene 160..1676 /gene="neurod" CDS 173..1243 /gene="neurod" /note="neurogenic basic helix-loop-helix protein" /codon_start=1 /product="NEUROD" /db_xref="PID:g1245455" /translation="MTKSYSESGLMGEPQPQGPPSWTDECLSSQDEEHEADKKEDDLE AMNAEEDSLRNGGEEEDEDEDLEEEEEEEEEDDDQKPKRRGPKKKKMTKARLERFKLR RMKANARERNRMHGLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEISRSG KSPDLVSFVQTLCKGLSQPTTNLVAGCLQLNPRTFLPEQNQDMPPHLPTASASFPVHP YSYQSPGLPSPPYGTMDSSHVFHVKPPPHAYSAALEPFFESPLTDCTSPSFDGPLSPP LSINGNFSFKHEPSAEFEKNYAFTMHYPAATLAGAQSHGSIFSGTAAPRCEIPIDNIM SFDSHSHHERVMSAQLNAIFHD" BASE COUNT 452 a 435 c 387 g 402 t ORIGIN 1 acatcgatta actttttctc agaggcattc attttgtaat gggcaggtac ttttcgcaag 61 catttgtaca ggtttaggga gtggaagctg aaggcgatct ttcttttgat atagcgtttt 121 tctgcttttc tttctgtttg cctctccctt gttgaatgta ggaaatcgaa acatgaccaa 181 atcgtacagc gagagtgggc tgatgggcga gcctcagccc caaggtcctc caagctggac 241 agacgagtgt ctcagttctc aggacgagga gcacgaggca gacaagaagg aggacgacct 301 cgaagccatg aacgcagagg aggactcact gaggaacggg ggagaggagg aggacgaaga 361 tgaggacctg gaagaggagg aagaagagga agaggaggat gacgatcaaa agcccaagag 421 acgcggcccc aaaaagaaga agatgactaa ggctcgcctg gagcgtttta aattgagacg 481 catgaaggct aacgcccggg agcggaaccg catgcacgga ctgaacgcgg cgctagacaa 541 cctgcgcaag gtggtgcctt gctattctaa gacgcagaag ctgtccaaaa tcgagactct 601 gcgcttggcc aagaactaca tctgggctct gtcggagatc tcgcgctcag gcaaaagccc 661 agacctggtc tccttcgttc agacgctttg caagggctta tcccaaccca ccaccaacct 721 ggttgcgggc tgcctgcaac tcaatcctcg gacttttctg cctgagcaga accaggacat 781 gcccccgcac ctgccgacgg ccagcgcttc cttccctgta cacccctact cctaccagtc 841 gcctgggctg cccagtccgc cttacggtac catggacagc tcccatgtct tccacgttaa 901 gcctccgccg cacgcctaca gcgcagcgct ggagcccttc tttgaaagcc ctctgactga 961 ttgcaccagc ccttcctttg atggacccct cagcccgccg ctcagcatca atggcaactt 1021 ctctttcaaa cacgaaccgt ccgccgagtt tgagaaaaat tatgccttta ccatgcacta 1081 tcctgcagcg acactggcag gggcccaaag ccacggatca atcttctcag gcaccgctgc 1141 ccctcgctgc gagatcccca tagacaatat tatgtccttc gatagccatt cacatcatga 1201 gcgagtcatg agtgcccagc tcaatgccat atttcatgat tagaggcacg ccagtttcac 1261 catttccggg aaacgaaccc actgtgctta cagtgactgt cgtgtttaca aaaggcagcc 1321 ctttggtact actgctgcaa agtgcaaata ctccaagctt caagtgatat atgtatttat 1381 tgtcattact gcctttggaa gaaacagggg atcaaagttc ctgttcacct tatgtattat 1441 tttctataga ctcttctatt ttaaaaaata aaaaaataca gtaaagttta aaaaatacac 1501 cacgaatttg gtgtggctgt attcagatcg tattaattat ctgatcggga taacaaaatc 1561 acaagcaata attaggatct atgcaatttt taaactagta atgggccaat taaaatatat 1621 ataaatatat atttcaacca gcattttact acttgttacc tcccatgctg aattat // LOCUS HSU51224 3341 bp DNA PRI 02-MAY-1996 DEFINITION Human U2AFBPL gene, complete cds. ACCESSION U51224 NID g1293652 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3341) AUTHORS Pearsall,R.S., Shibata,H., Brozowska,A., Yoshino,K., Okuda,K., Plass,C., Chapman,V., deJong,P., Hayashizaki,Y. and Held,W.A. TITLE Absence of imprinting for U2AFBPL, a human homologue of the imprinted mouse gene U2afbp-rs JOURNAL Unpublished REFERENCE 2 (bases 1 to 3341) AUTHORS Pearsall,R.S. TITLE Direct Submission JOURNAL Submitted (12-MAR-1996) R. Scott Pearsall, Molecular and Cellular Biology, Roswell Park Cancer Institute, Elm and Carlton Streets, Buffalo, NY 14263, USA FEATURES Location/Qualifiers source 1..3341 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q22-31" repeat_region 184..452 /note="Alu element" repeat_region 878..1147 /note="Alu element" mRNA 1303..>2853 gene 1414..2853 /gene="U2AFBPL" CDS 1414..2853 /gene="U2AFBPL" /note="similar to mouse u2afbp-rs; similar to U2AF 35 kDa splicing factor" /codon_start=1 /product="U2AFBPL" /db_xref="PID:g1293653" /translation="MAALEKMTFPKKMTFPEKPSHKKYRAALKKEKRKKRRQELARLR DSGLSQEEEEDTFIEEQQLEEEKLLERERERLHEEWLLREQKAQEEFRIKKEKEEAAK KWLEEQERKLKEQWKEQQRKEREEEEQKQQEKKEKEEAVQKMLDQAENDLENSTTWQN PEPPVDFRVMEKDRANCPFYSKTGACRFGDRCSRKHNFPTSSPTLLIKSMFTTFGMEQ CRRDDYDPDASLEYSEEETYQQFLDFYEDVLPEFKNVGKVIQFKVSCNLEPHLRGNVY VQYQSEEECQAALSLFNGRWYAGRQLQCEFCPVTRWKMAICGLFEIQQCPRGKHCNFL HVFRNPNNEFWEANRDIYLSSDQTGSSFGKNSERREKMGHHDHYYSRQRGRRNPSPDH TYKRNGESERKKSSHRGKKSHKRTSKSRERHNSPSRGRNRHRSWDQGRRSQSRRSHRS RSQSSSRCRSRGRRKSGNRDRTVQSPQSK" polyA_signal 2849..2854 misc_feature complement(2902..3161) /note="Alu element" BASE COUNT 1122 a 685 c 769 g 765 t ORIGIN 1 ctgcagtgag ctgtgatacc atcactgcaa tccaggcttg gtgagacagc aagaccctgt 61 ctcaaaaaga caaaaaaaag ttgatatata aaacaaccta gaaattagta agaatacaga 121 ataaagaaca caaaatttaa gtcttttttt ttttttaaat aaaaaagggt ttcactttgt 181 cacccaggca ggagtgcagt ggcacaaata caaaacacta tagcctcaac ttcctgggct 241 caagtgattc tcccgcctca gccccccagg tagctgaaga ccacaggcat gcaggcacgc 301 accatcacgt ctggctaatt tttgtacttt ttgtagagat ggggtttcgc catgttggcc 361 gggctggtct cgaactcctg acctcaagtg atccacccat ctcagcctcc taaagtgctg 421 ggattacagg tgtgagccac tgcacctggc ctaaaatctc actctaaaga tatatacaga 481 tctctcaacc caacaaagag ggattcaaag agttaatata gtatatagag agtaacagga 541 tggactcaaa gaaactaact tccaatcctg gttctgccat ttactagcta agcaaacctt 601 atacaagtta cttaatcact taagtctggt ttttcctcta taaaataggt attaattgaa 661 aattttaaaa tttaccccta taattttgta agaaaataaa attactaagc atagcaagga 721 ctttctctga cccaagaata ctgtgttttc taacaacatc tatgaaacat tactaacagg 781 agaacatgtt agctttcagt aggggaaagc aaatcctaga accaaaaata tttaagcaaa 841 tatttatttt tattttttat ttttgagaca gagtctcact ctgtcaccca ggctggagtg 901 tagtggcaca gtcttggctc actgcaacct ctgcctcctg ggttcaagca gttctcctgc 961 ctcagcctcc caagtagctg ggattacagg cacgtgctac caagcctggc taattttttt 1021 atttttagta gagatgagtt tttgccatgt tggccaggct ggtctcaaac tcctggcctc 1081 aggtgatccg cccatttcgg cctcctaaag tgctgggatt acaggagtga gccaccgcac 1141 ctggccataa gtaaatattt tagaactctc cttttagtac atgaaatgaa attcaaattt 1201 atgatatact taattacaaa aaaagtctaa ctgcaatata aaggagaaac aaaatgaaag 1261 taatttgtaa tatatataag tatgcaaaca aaacaatact agaaaacata aaatattcat 1321 aagtagaatc actgacagtg gcagctatga acaaaatctt ccatatattc ccataggtta 1381 agaatgtctg aggggtcagg cggtgctggc aagatggctg cacttgagaa gatgacgttt 1441 cccaagaaga tgacatttcc agagaaacca agccacaaaa agtacagggc cgccctgaag 1501 aaggagaaac gaaagaaacg tcggcaggaa cttgctcgac tgagagactc aggactctca 1561 caggaggagg aagaggacac ttttattgaa gaacaacaac tagaagaaga gaagctattg 1621 gaaagagaga gggaaagatt acatgaggag tggttgctga gggagcagaa ggcacaagaa 1681 gaattcagaa taaagaagga aaaggaagag gcggctaaaa aatggctaga agaacaagag 1741 agaaagttaa aggaacaatg gaaagaacag cagaggaaag agagagaaga ggaggagcag 1801 aaacaacagg agaagaaaga aaaagaggaa gctgtgcaga agatgctgga tcaggctgaa 1861 aatgatttag aaaatagtac cacatggcaa aacccagaac cacccgtgga tttcagagta 1921 atggagaagg atcgagctaa ttgtcccttc tacagtaaaa caggagcttg cagatttgga 1981 gacagatgtt cacgtaaaca taatttccca acatctagtc ctacccttct tattaagagc 2041 atgtttacaa cgtttggaat ggagcagtgc aggagggatg actatgaccc tgacgcaagc 2101 ctggagtaca gcgaggaaga aacctaccaa cagttcctag atttctatga ggatgtgttg 2161 cccgagttca agaacgtggg gaaagtgatt cagttcaagg tcagctgcaa tttggaacct 2221 cacctgaggg gcaatgtata tgttcagtac cagtcggaag aagaatgcca agcagccctt 2281 tctctgttta acggacgatg gtatgcagga cgacagctgc aatgtgaatt ctgcccagtg 2341 acccggtgga aaatggcgat ttgtggttta tttgaaatac aacaatgtcc aagaggaaaa 2401 cactgcaact ttcttcatgt gttcagaaat cccaacaatg aattctggga agctaataga 2461 gacatctact tgtcttcaga tcagactggc tcctcctttg gcaagaactc cgagaggagg 2521 gagaagatgg gccaccacga ccactactac agcaggcagc ggggaaggag aaaccctagt 2581 ccagaccaca cctacaaaag aaatggggaa tccgagagaa aaaagagtag tcataggggg 2641 aagaaatctc acaaacgcac atcaaagagt cgggagaggc acaattcacc aagcagagga 2701 agaaataggc accgcagctg ggaccagggc cgccggagcc agagccgcag gagccaccgc 2761 agccggagcc aaagttcctc taggtgccga agtcgtggga ggaggaagtc gggtaataga 2821 gacagaactg ttcagagtcc ccaatccaaa taaactagtt ttgttcttaa aaaaaaaaaa 2881 aaaaaaaaaa agaatggacc aggccaggta tagtggctca cacctgtaat cccagcactt 2941 taggaggctg aggtgggtgg atcacttgag gtcaggagtt tgaggccagc ctggccaaca 3001 tggcaaaacc ccatttctac taaaaataca aaaattagcc aggtgtggtg gcaggcgtct 3061 gtaatccaag ctacttggga agctgaggca ggagaatcgc ttgaatctgg aaggcggagg 3121 ttgcagtgag tcgacatcat gccacttcac tccagcctgg gtgacagagc gagactgtgt 3181 ctcaaaaaat agaatttttg gactagtgga actacacagc ctgctttcca tatcagacca 3241 ttcaacccaa tgggattcac tgccataatc tccatgttat gattttaagc caggcctctc 3301 ttcacctttc ctattcctta catttaaaga ctccaaagct t // LOCUS HSU52965 2666 bp DNA PRI 19-JUL-1996 DEFINITION Human putative transcriptional regulator ENX-1 mRNA, complete cds. ACCESSION U52965 NID g1279912 KEYWORDS vertebrate polycomb-group gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2666) AUTHORS Hobert,O., Jallal,B. and Ullrich,A. TITLE Interaction of Vav with ENX-1, a putative transcriptional regulator of homeobox gene expression JOURNAL Mol. Cell. Biol. 16 (6), 3066-3073 (1996) MEDLINE 96220494 REFERENCE 2 (bases 1 to 2666) AUTHORS Hobert,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1996) Department of Molecular Biology, Massachusetts General Hospital, Wellman 8, Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..2666 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /cell_line="ZR-75" CDS 538..2379 /note="putative transcriptional regulator of chromatin activity; human homolog of the Drosophila Polycomb group gene Enhancer of zeste; contains CXC domain and SET domain; interacts with the Vav protooncogene product" /codon_start=1 /product="ENX-1" /db_xref="PID:g1279913" /translation="MGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNA LGQYNDDDDDDDGDDPEEREEKQKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKG TAEELKEKYKELTEQQLPGALPPECTPNIDGPNAKSVQREQSLHSFHTLFCRRCFKYD CFLHPFHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPG GRRRGRLPNNSSRPSTPTINVLESKDTDSDREAGTETGGENNDKEEEEKKDETSSSSE ANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIGTYYDNFCAIARLIGTKTCRQVY EFRVKESSIIAPAPAEDVDTPPRKKKRKHRLWAAHCRKIQLKKDGSSNHVYNYQPCDH PRQPCDSSCPCVIAQNFCEKFCQCSSECQNRFPGCRCKAQCNTKQCPCYLAVRECDPD LCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEY CGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKV MMVNGDHRIGIFAKRAIQTGEELFFDYRYSQADALKYVGIEREMEIP" BASE COUNT 877 a 515 c 584 g 690 t ORIGIN 1 ctagcctata gtaaatacat atatgtatgt gtaggtatat ataattattt tctaacctac 61 agaactgtga gaacctgcaa aatagttaag tgaactgtta ctaatcagag aagaactatg 121 gtgaatgaga gaggaactaa aagatgaaga caatttagtc atcgttcagt atgcatgggg 181 gattggttcc aagacccctt accaaatctg cagatgctca attatcttat ataaagtgat 241 gtagtatttc aaataaccta cacacatcct cttgtatatt ttaaatcatc tctcgattac 301 ttataatacc taacacaatg cctacacgtc atttgcatgg attcaacata gtacttggtg 361 tgtggcaaat tgaagttttg ctttttgaaa ctctatggaa ttttttctga atacttttga 421 tccatgattg gctgaatcca tggacgtgaa actcagggat aaggagggcc acctgcactc 481 tcccagtgtt ctaaaaagaa gatggatcat aaagatcata aaccactgtt ttttaaaatg 541 ggagatgaag ttttagatca ggatggtact ttcattgaag aactaataaa aaattatgat 601 gggaaagtac acggggatag agaatgtggg tttataaatg atgaaatttt tgtggagttg 661 gtgaatgccc ttggtcaata taatgatgat gacgatgatg atgatggaga cgatcctgaa 721 gaaagagaag aaaagcagaa agatctggag gatcaccgag atgataaaga aagccgccca 781 cctcggaaat ttccttctga taaaattttt gaagccattt cctcaatgtt tccagataag 841 ggcacagcag aagaactaaa ggaaaaatat aaagaactca ccgaacagca gctcccaggc 901 gcacttcctc ctgaatgtac ccccaacata gatggaccaa atgctaaatc tgttcagaga 961 gagcaaagct tacactcctt tcatacgctt ttctgtaggc gatgttttaa atatgactgc 1021 ttcctacatc cttttcatgc aacacccaac acttataagc ggaagaacac agaaacagct 1081 ctagacaaca aaccttgtgg accacagtgt taccagcatt tggagggagc aaaggagttt 1141 gctgctgctc tcaccgctga gcggataaag accccaccaa aacgtccagg aggccgcaga 1201 agaggacggc ttcccaataa cagtagcagg cccagcaccc ccaccattaa tgtgctggaa 1261 tcaaaggata cagacagtga tagggaagca gggactgaaa cggggggaga gaacaatgat 1321 aaagaagaag aagagaagaa agatgaaact tcgagctcct ctgaagcaaa ttctcggtgt 1381 caaacaccaa taaagatgaa gccaaatatt gaacctcctg agaatgtgga gtggagtggt 1441 gctgaagcct caatgtttag agtcctcatt ggcacttact atgacaattt ctgtgccatt 1501 gctaggttaa ttgggaccaa aacatgtaga caggtgtatg agtttagagt caaagaatct 1561 agcatcatag ctccagctcc cgctgaggat gtggatactc ctccaaggaa aaagaagagg 1621 aaacaccggt tgtgggctgc acactgcaga aagatacagc tgaaaaagga cggctcctct 1681 aaccatgttt acaactatca accctgtgat catccacggc agccttgtga cagttcgtgc 1741 ccttgtgtga tagcacaaaa tttttgtgaa aagttttgtc aatgtagttc agagtgtcaa 1801 aaccgctttc cgggatgccg ctgcaaagca cagtgcaaca ccaagcagtg cccgtgctac 1861 ctggctgtcc gagagtgtga ccctgacctc tgtcttactt gtggagccgc tgaccattgg 1921 gacagtaaaa atgtgtcctg caagaactgc agtattcagc ggggctccaa aaagcatcta 1981 ttgctggcac catctgacgt ggcaggctgg gggattttta tcaaagatcc tgtgcagaaa 2041 aatgaattca tctcagaata ctgtggagag attatttctc aagatgaagc tgacagaaga 2101 gggaaagtgt atgataaata catgtgcagc tttctgttca acttgaacaa tgattttgtg 2161 gtggatgcaa cccgcaaggg taacaaaatt cgttttgcaa atcattcggt aaatccaaac 2221 tgctatgcaa aagttatgat ggttaacggt gatcacagga taggtatttt tgccaagaga 2281 gccatccaga ctggcgaaga gctgtttttt gattacagat acagccaggc tgatgccctg 2341 aagtatgtcg gcatcgaaag agaaatggaa atcccttgac atctgctacc tcctccccct 2401 cctctgaaac agctgcctta gcttcaggaa cctcgagtac tgtgggcaat ttagaaaaag 2461 aacatgcagt ttgaaattct gaatttgcaa agtactgtaa gaataattta tagtaatgag 2521 tttaaaaatc aactttttat tgccttctca ccagctgcaa agtgttttgt accagtgaat 2581 ttttgcaata atgcagtatg gtacattttt caactttgaa taaagatact tgaacttgaa 2641 aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSU53143 1377 bp DNA PRI 28-JUL-1996 DEFINITION Human inward rectifying K+ channel negative regulator Kir2.2v gene, complete cds. ACCESSION U53143 NID g1465743 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1377) AUTHORS Namba,N., Inagaki,N., Gonoi,T., Seino,Y. and Seino,S. TITLE Kir2.2v: a possible negative regulator of the inwardly rectifying K+ channel Kir2.2 JOURNAL FEBS Lett. 386 (2-3), 211-214 (1996) MEDLINE 96228066 REFERENCE 2 (bases 1 to 1377) AUTHORS Namba,N. TITLE Direct Submission JOURNAL Submitted (29-MAR-1996) Noriyuki Namba, Division of Molecular Medicine, Center for Biomedical Science, Chiba University School of Medicine, 1-8-1, Inohana, Chuo-ku, Chiba, Chiba 260, Japan FEATURES Location/Qualifiers source 1..1377 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 7..1305 /codon_start=1 /product="inward rectifying K+ channel negative regulator Kir2.2v" /db_xref="PID:g1465744" /translation="MTAASRANPYSIVSSEEDGLHLVTMSGANGFGNGKVHTRRRCHN RFVKKNGQCNIEFANMDEKSQRYLADIFTTCVDIRWRYMLLIFSLAFLASWLLFGVIF WVIAVAHGDLEPAEGRGRTPCVMQVHGFMAAFLFSIKTQNTISYGLRCVTEECPVAVF MVVAQSIVGCIINSFMIGAIMAKMVRPKKRAQTLLFSHNAVVALRDGKLCFMWRVGNL RKSHIVEAHVRAQLIKPRVTKEGEYIPLDQIDIDVGFDKGLDHSFLVSPITILHEIDE ASPLFGISRQDLQMDDFEIVIILEGIVEATAMTTQARSSYLANEILWGHRFEPVLFEK NQYKIDYLHFHKTYEVPSTPRCSAKDLVENKFLLPRANSFCYKNELAFLSRDEEDEAD GDQDGRSREGLIPQARHDFDRLQAGGGVLEQRPYRRESEI" BASE COUNT 278 a 420 c 421 g 258 t ORIGIN 1 cccgggatga ccgcagccag ccgggccaac ccctacagca tcgtgtcatc ggaggaggac 61 gggctgcacc tggtcaccat gtcgggcgcc aacggcttcg gcaacggcaa ggtgcacacg 121 cggcgcaggt gccacaaccg cttcgtcaag aagaatggcc agtgcaacat tgagttcgcc 181 aacatggacg agaagtcaca gcgctacctg gctgacatat tcaccacctg tgtggacatc 241 cgctggcggt acatgctgct catattctca ctggccttcc ttgcctcctg gctgctgttt 301 ggtgtcatct tctgggtcat tgcggtggca catggtgacc tggagccggc tgagggccgc 361 ggccgcacac cctgtgtgat gcaggtgcac ggcttcatgg cggccttcct cttctccatc 421 aagacgcaga acaccatcag ctacgggctg cgctgtgtga cagaggagtg cccggtggcc 481 gtcttcatgg tggtggccca gtccatcgtg ggctgcatca tcaactcctt catgattggt 541 gccatcatgg ccaagatggt aaggcccaag aagcgggcac agacgctgct gttcagccac 601 aatgccgtgg tggccctgcg tgatggcaag ctctgcttca tgtggcgtgt gggcaacctg 661 cgtaagagcc acattgtgga ggcccatgtg cgcgcgcagc tcatcaagcc gcgggtcacc 721 aaggagggcg agtacatccc gctagaccag atcgacattg atgtgggctt cgacaagggc 781 ctggaccaca gctttctggt gtcacccatc accatcctgc acgagattga cgaggccagt 841 ccgctcttcg gcatcagccg gcaggacctg cagatggatg actttgagat cgtgatcatc 901 ctggaaggca ttgtggaggc cacagccatg accacccagg cccgcagctc ctacctggcc 961 aatgagatcc tgtggggtca ccgctttgag cccgtgctct tcgagaagaa ccagtacaag 1021 attgactact tgcacttcca caagacctat gaggtgccct ctacaccccg ctgcagcgcg 1081 aaggatctgg tggagaacaa gttcctgctg cccagggcca actccttctg ctacaagaac 1141 gagctggcct tcctgagccg tgacgaggag gatgaggcgg acggagacca ggatggccga 1201 agccgggaag gcctcatccc ccaggccagg catgactttg acagactcca ggctggtggc 1261 ggggtcctgg agcagcggcc ctacagacgg gagtcagaga tctaagccaa ccttggccga 1321 catgcagcat ccaccccagg ccggggagag gccccgtggt tgctcagggg ccccggg // LOCUS HSU55312 1903 bp DNA PRI 20-MAY-1996 DEFINITION Human G protein-coupled receptor GPR-NGA gene, complete cds. ACCESSION U55312 NID g1323695 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1903) AUTHORS Bonner,T.I and Matsuda,L.A. TITLE A G protein-coupled receptor expressed in NG108-15 and AtT-20 cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 1903) AUTHORS Bonner,T.I and Matsuda,L.A. TITLE Direct Submission JOURNAL Submitted (17-APR-1996) T.I. Bonner, Lab of Cell Biology, NIMH, Bldg 36, Rm 3A-17, MSC 4090, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..1903 /organism="Homo sapiens" /note="PCR product" /db_xref="taxon:9606" /cell_line="DMR1" /clone="hNGA" intron <1..382 /note="based on comparison with human brain cDNA, GenBank Accession Number H07970" mRNA <383..1791 CDS 405..1652 /note="G protein-coupled receptor of the rhodopsin family, ligand unknown" /codon_start=1 /product="GPR-NGA" /db_xref="PID:g1323696" /translation="MVFAHRMDNSKPHLIIPTLLVPLQNRSCTETATPLPSQYLMELS EEHSWMSNQTDLHYVLKPGEVATASIFFGILWLFSIFGNSLVCLVIHRSRRTQSTTNY FVVSMACADLLISVASTPFVLLQFTTGRWTLGSATCKVVRYFQYLTPGVQIYVLLSIC IDRFYTIVYPLSFKVSREKAKKMIAASWIFDAGFVTPVLFFYGSNWDSHCNYFLPSSW EGTAYTVIHFLVGFVIPSVLIILFYQKVIKYIWRIGTDGRTVRRTMNIVPRTKVKTIK MFLILNLLFLLSWLPFHVAQLWHPHEQDYKKSSLVFTAITWISFSSSASKPTLYSIYN ANFRRGMKETFCMSSMKCYRSNAYTITTSSRMAKKNYVGISEIPSMAKTITKDSIYDS FDREAKEKKLAWPINSNPPNTFV" polyA_site 1792 /note="based on comparison with human brain cDNA, GenBank Accession Number H07878" BASE COUNT 540 a 422 c 373 g 568 t ORIGIN 1 gaattccagc aaatcttcag ttggtggtaa cacccttacc atgagccaga tatgagatcc 61 ctaatattct gtgatccctg atgagtgaag ggaacaagga tatgtgtagg aggggagctc 121 ggtatgacta agggtcaaga gaaggtgagg ccagagagag cctgagctga gatctgctga 181 aacagctcct aaaatgaaaa caaggttggg gccagaattt tttctggata gtagttatgt 241 tttcctgcca acgctcaagt cctacacaaa gacaaatgac aatcaatgta aatgtcaaat 301 aagatcgtta gcctgagtaa tcataaccaa tctgtatgac acctttttaa caggaggcct 361 cattcttctt ttccccaacc agaattaaga gaaaaaaagt gaatatggtt tttgctcaca 421 gaatggataa cagcaagcca catttgatta ttcctacact tctggtgccc ctccaaaacc 481 gcagctgcac tgaaacagcc acacctctgc caagccaata cctgatggaa ttaagtgagg 541 agcacagttg gatgagcaac caaacagacc ttcactatgt gctgaaaccc ggggaagtgg 601 ccacagccag catcttcttt gggattctgt ggttgttttc tatcttcggc aattccctgg 661 tttgtttggt catccatagg agtaggagga ctcagtctac caccaactac tttgtggtct 721 ccatggcatg tgctgacctt ctcatcagcg ttgccagcac gcctttcgtc ctgctccagt 781 tcaccactgg aaggtggacg ctgggtagtg caacgtgcaa ggttgtgcga tattttcaat 841 atctcactcc aggtgtccag atctacgttc tcctctccat ctgcatagac cggttctaca 901 ccatcgtcta tcctctgagc ttcaaggtgt ccagagaaaa agccaagaaa atgattgcgg 961 catcgtggat ctttgatgca ggctttgtga cccctgtgct ctttttctat ggctccaact 1021 gggacagtca ttgtaactat ttcctcccct cctcttggga aggcactgcc tacactgtca 1081 tccacttctt ggtgggcttt gtgattccat ctgtcctcat aattttattt taccaaaagg 1141 tcataaaata tatttggaga ataggcacag atggccgaac ggtgaggagg acaatgaaca 1201 ttgtccctcg gacaaaagtg aaaactatca agatgttcct cattttaaat ctgttgtttt 1261 tgctctcctg gctgcctttt catgtagctc agctatggca cccccatgaa caagactata 1321 agaaaagttc ccttgttttc acagctatca catggatatc ctttagttct tcagcctcta 1381 aacctactct gtattcaatt tataatgcca attttcggag agggatgaaa gagacttttt 1441 gcatgtcctc tatgaaatgt taccgaagca atgcctatac tatcacaaca agttcaagga 1501 tggccaaaaa aaactacgtt ggcatttcag aaatcccttc catggccaaa actattacca 1561 aagactcgat ctatgactca tttgacagag aagccaagga aaaaaagctt gcttggccca 1621 ttaactcaaa tccaccaaat acttttgtct aagttctcat tctttcaatt gttatgcacc 1681 agagattaaa aagctttaac tataaaaaca gaagctattt acatatttgt tttcactcaa 1741 ctttccaagg gaaatgtttt attttgtaaa atgcattcat ttgtttactg tagtttttgt 1801 gggttttatt ttacttgctt tttatgtttt aggaaaagcg ttcactttga actttagcca 1861 acagtccttt tactattaat atattagtta catgcataaa aaa // LOCUS HSU56420 945 bp DNA PRI 30-MAY-1996 DEFINITION Human olfactory receptor (OLF1) gene, complete cds. ACCESSION U56420 NID g1336040 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 945) AUTHORS Issel-Tarver,L. and Rine,J. TITLE Evolution of Mammalian Olfactory Receptor Genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 945) AUTHORS Issel-Tarver,L. and Rine,J. TITLE Direct Submission JOURNAL Submitted (24-APR-1996) L. Issel-Tarver, MCB, UC Berkeley, 401 Barker Hall, Berkeley, CA 94720, USA FEATURES Location/Qualifiers source 1..945 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q11" gene 1..945 /gene="OLF1" CDS 1..945 /gene="OLF1" /note="olfactory receptor" /codon_start=1 /product="HsOLF1" /db_xref="PID:g1336041" /translation="MEFTDRNYTLVTEFILLGFPTRPELQIVLFLMFLTLYAIILIGN IGLMLLIRIDPHLQTPMYFFLSNLSFVDLCYFSDIVPKMLVNFLSENKSISYYGCALQ FYFFCTFADTESFILAAMAYDRYVAICNPLLYTVVMSRGICMRLIVLSYLGGNMSSLV HTSFAFILKYCDKNVINHFFCDLPPLLKLSCTDTTINEWLLSTYGSSVEIICFIIIII SYFFILLSVLKIRSFSGRKKTFSTCASHLTSVTIYQGTLLFIYSRPSYLYSPNTDKII SVFYTIFIPVLNPLIYSLRNKDVKDAAEKVLRSKVDSS" BASE COUNT 232 a 221 c 162 g 330 t ORIGIN 1 atggaattta cagatagaaa ctacacgttg gtcactgagt ttattctatt aggttttcca 61 actcgccctg aactgcagat tgtcctgttc ctcatgtttc tgacattgta tgctataatt 121 ctgataggga acattggatt gatgctgttg atcaggattg atcctcacct tcaaaccccc 181 atgtattttt tccttagcaa cctatcattt gtagaccttt gctatttctc agacattgtt 241 cccaaaatgc tggtcaattt cctctcggag aacaaatcta tttcctatta tgggtgtgcc 301 ctgcagtttt attttttctg tacttttgca gatacagaat ccttcatcct ggccgccatg 361 gcctatgatc gctatgtcgc catctgtaac cctttattgt acacagttgt gatgtctagg 421 ggcatctgta tgcggttgat tgtcttgtca taccttggag gcaacatgag ttccctggtt 481 cacacatcct ttgcctttat tctgaaatat tgtgacaaaa atgttattaa tcattttttc 541 tgtgacctcc ctcccctgct taaactatcc tgcactgaca caacaattaa tgagtggctc 601 ctctccacat acggcagctc agtggaaatc atttgtttta tcatcatcat catctcctac 661 tttttcattc ttctctcagt cttaaagatc cgctctttca gtgggaggaa gaagaccttt 721 tctacatgcg cctctcacct gacttcagtg acgatctacc aagggactct cctctttatt 781 tactcacggc ccagctacct gtattctcca aacactgata aaattatctc agtgttctac 841 accattttca ttccagtgct gaatccgttg atttatagtt tgagaaataa agatgtaaag 901 gatgcagctg agaaagttct aagatcaaag gtagattctt catga // LOCUS HSU56602 3480 bp DNA PRI 27-AUG-1996 DEFINITION Human peroxisome biogenesis disorder group 4 gene (PXAAA1) mRNA, complete cds. ACCESSION U56602 NID g1354752 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3480) AUTHORS Yahraus,T., Braverman,N., Dodt,G., Kalish,J.E., Morrell,J.C., Moser,H.W., Valle,D. and Gould,S.J. TITLE The peroxisome biogenesis disorder group 4 gene, PXAAA1, encodes a cytoplasmic ATPase required for stability of the PTS1 receptor JOURNAL EMBO J. 15 (12), 2914-2923 (1996) MEDLINE 96272151 REFERENCE 2 (bases 1 to 3480) AUTHORS Gould,S.J. TITLE Direct Submission JOURNAL Submitted (24-APR-1996) Stephen J. Gould, Biological Chemistry, The Johns Hopkins University School of Medicine, 725 North Wolfe Street, Baltimore, MD 21205, USA COMMENT PXAAA1 was intially identified on the basis of sequence similarity between the deduced aa sequence of a human EST clone and the Pichia pastoris PAS5 gene, mutations in which cause a defect in the import of both PTS1 and PTS2 containing proteins into the peroxisome. The observation that peroxisome biogenesis disorder patients belonging to the fourth complementation group carry mutations in PXAAA1, and have defects in the import of both PTS1 and PTS2 proteins suggests that PXAAA1 is indeed the human ortholog of P. pastoris PAS5. The product of PXAAA1, Pxaaa1p, contains consensus sequences for an ATPase, and substitution of an arginine codon in place of the lysine codon with the Walker A motif of PXAAA1 abolishes its biological activity, indicating that Pxaaa1p is an ATPase. FEATURES Location/Qualifiers source 1..3480 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p11-6p22" gene 213..3155 /gene="PXAAA1" CDS 213..3155 /gene="PXAAA1" /note="peroxisome biogenesis disorder group 4; required for stability of the PTS1 receptor; Description: PBD group 4 gene; ATPase, AAA family" /codon_start=1 /evidence=experimental /product="Pxaaa1p" /db_xref="PID:g1354753" /translation="MALAVLRVLEPFPTETPPLAVLLPPGGPWPAAELGLVLALRPAG ESPAGPALLVAALEGPDAGTEEQGPGPPQLLVNRALLRLLALGSGAWVRARAVRRPPA LGWALLGTSLGPGLGPRVGPLLVRRGETLPVPGPRVLETRPALQGLLGPGTRLAVTEL RGRARLCPESGDSSRPPPPPVVSSFAVSGTVRRLQGVLGGTGDSLGVSRSCLRGLGLF QGEWVWVAQARESSNTSQPHLARVQVLEPRWDLSDRLGPGSGPLGEPLADGLALVPAT LAFNLGCDPLEMGELRIQRYLEGSIAPEDKGSCSLLPGPPFARELHIEIVSSPHYSTN GNYDGVLYRHFQIPRVVQEGDVLCVPTIGQVEILEGSPEKLPRWREMFFKVKKTVGEA PDGPASAYLADTTHTSLYMVGSTLSPVPWLPSEESTLWSSLSPPGLEALVSELCAVLK PRLQPGGALLTGTSSVLLRGPPGCGKTTVVAAACSHLGLHLLKVPCSSLCAESSGAVE TKLQAIFSRARRCRPAVLLLTAVDLLGRDRDGLGEDARVMAVLRHLLLNEDPLNSCPP LMVVATTSRAQDLPADVQTAFPHELEVPALSEGQRLSILRALTAHLPLGQEVNLAQLA RRCAGFVVGDLYALLTHSSRAACTRIKNSGLAGGLTEEDEGELCAAGFPLLAEDFGQA LEQLQTAHSQAVGAPKIPSVSWHDVGGLQEVKKEILETIQLPLEHPELLSLGLRRSGL LLHGPPGTGKTLLAKAVATECSLTFLSVKGPELINMYVGQSEENVREVFARARAAAPC IIFFDELDSLAPSRGRSGDSGGVMDRVVSQLLAELDGLHSTQDVFVIGATNRPDLLDP ALLRPGRFDKLVFVGANEDRASQLRVLSAITRKFKLEPSVSLVNVLDCCPPQLTGADL YSLCSDAMTAALKRRVHDLEEGLEPGSSALMLTMEDLLQAAARLQPSVSEQELLRYKR IQRKFAAC" BASE COUNT 618 a 1056 c 1106 g 700 t ORIGIN 1 cgatgaaggt tactgcctat cgaggcacga cgcaagatca atccgaggcg cagctaaccc 61 cctcagagca agttcgcggc acccgacgcc cctccccttt tcctctggcc tcccctgacg 121 gaagcggaag cggccctcgc gcacactagt cgtctggttc tctggctccg gaagctgcgc 181 tccttcaccc tcctcgttgg tgtcctgtca ccatggcgct ggctgtcttg cgggtcctgg 241 agccctttcc gaccgagaca cccccgttgg cagtgctgct gccacccggg ggcccgtggc 301 cggcggcgga gctgggcctg gtgctggccc tgaggcctgc aggggagagc ccggcagggc 361 cggcgctgct ggtggcagcc ctggaggggc cggacgcggg caccgaagag cagggtcccg 421 ggccgccgca gctactggtt aaccgcgcgc tgctgcggct cctggcactg ggctccgggg 481 cctgggtgcg ggcgcgggcg gtgcggcggc ccccggcgct aggttgggca ctgcttggca 541 cctcgctggg gcctgggctc ggaccgcgag tcgggccgct gctggtgagg cgcggagaga 601 ccctcccagt tcccggaccg cgggtgctgg agacgcggcc ggcgttgcaa gggctgctgg 661 gcccagggac tcggctggct gtgactgagc tccgcgggcg ggccagactg tgtccagagt 721 ctggggacag cagtcggccc ccacccccgc ccgtggtgtc ctcctttgcg gtttctggca 781 cagtgcggcg actccaggga gttctgggag ggactggaga ttcactaggg gtgagccgga 841 gctgtctccg tggccttggc ctcttccagg gcgaatgggt gtgggtggcc caggccagag 901 agtcatcgaa cacttcacag ccgcacttgg ctagggtgca ggtcctagaa cctcgctggg 961 acctctctga tagactggga cccggctctg gaccgctggg agagcccctc gctgacggac 1021 tggcgcttgt ccctgccact ttggctttta atcttggctg tgaccccctg gaaatgggag 1081 agctcagaat tcagaggtac ttggaaggct ccatcgcccc tgaagacaaa ggaagctgct 1141 cattgctgcc tgggcctcca tttgccagag agttacacat cgaaattgtg tcttctcccc 1201 actacagcac taatggaaat tatgacggtg ttctttaccg gcactttcag atacccaggg 1261 tagtccagga aggggatgtt ctatgtgtgc caacaattgg gcaagtagag atcctggaag 1321 gaagtccaga gaaactgccc aggtggcggg aaatgttttt taaagtgaag aaaacagttg 1381 gggaagctcc agatggacca gccagtgcct acttggccga caccacccat acctccttgt 1441 acatggtggg ttctaccctg agccctgttc catggctccc ttcagaggaa tccactctct 1501 ggagcagttt gtctcctcca ggcctggagg ccttggtgtc tgaactctgt gctgtcctga 1561 agcctcgcct ccagccaggg ggtgccctgc tgacaggaac tagcagtgtc cttctacggg 1621 gccccccagg ctgtgggaag accacagtag ttgctgctgc ctgtagtcac cttgggctcc 1681 acttactgaa ggtgccctgc tccagcctct gtgcagaaag tagtggggct gtggagacaa 1741 aactgcaggc catcttctcc cgggcccgcc gttgccggcc tgcagtcctg ttgctcacag 1801 ctgtggacct tctgggccgg gaccgtgatg ggctgggtga ggatgcccgt gtgatggctg 1861 tgctgcgtca cctcctcctc aatgaggacc ccctcaacag ctgccctccc ctcatggttg 1921 tggccaccac aagccgggcc caggacctgc ctgctgatgt gcagacagca tttcctcatg 1981 agctcgaggt gcctgctctg tcagaggggc agcggctcag catcctgcgg gccctcactg 2041 cccaccttcc cctgggccag gaggtgaact tggcacagct agcacggcgg tgtgcaggct 2101 ttgtggtagg ggatctctat gcccttctga cccacagcag ccgggcagcc tgcaccagga 2161 tcaagaactc aggtttggca ggtggcttga ctgaggagga tgagggggag ctgtgtgctg 2221 ccggctttcc tctcctggct gaggactttg ggcaggcact ggagcaactg cagacagctc 2281 actcccaggc cgttggagcc cccaagatcc cctcagtgtc ctggcatgat gtgggtgggc 2341 tgcaggaggt gaagaaggag atcctggaga ccattcagct ccccctggag caccctgagc 2401 tactgagcct gggcctgaga cgctcaggcc ttctgctcca tgggccccct ggcaccggca 2461 agacccttct ggccaaggca gtagccactg agtgcagcct taccttcctc agcgtgaagg 2521 ggccagagct cattaacatg tatgtgggcc aaagtgagga gaatgtgcgg gaagtgtttg 2581 ccagggccag ggctgcagct ccatgcatta tcttctttga tgaactggac tctttggccc 2641 caagccgggg gcgaagtgga gattctggag gagtgatgga cagggtggtg tctcagctcc 2701 ttgccgagct agatgggctg cacagcactc aggatgtgtt tgtgattgga gccaccaaca 2761 gaccagatct cctggaccct gcccttctgc ggcctggcag atttgacaag ctggtgtttg 2821 tgggggcaaa tgaggaccgg gcctcccagc tacgcgttct aagtgccatc acacgcaaat 2881 tcaagctaga gccatctgtg agcctggtaa acgtgctaga ttgctgccct ccccagctga 2941 cgggcgcgga cctctactct ctctgctctg atgctatgac agctgccctc aaacgcaggg 3001 ttcatgacct ggaggaaggg ctggaaccag gtagctcagc actgatgctc accatggagg 3061 acttgctgca ggctgccgcc cggctgcaac cctcagtcag tgagcaggag ctgctccggt 3121 acaagcgcat ccagcgcaag tttgctgcct gctaggagcc ccccagggtc tgggaccccg 3181 ctcagcatgg ctgcaggtac cttgatagcc cacagagaga tctgggaagg aagggctcct 3241 cctcaggctg ctgccaaccc acctggaggc cacctccctc caggagatcc cagggtgcaa 3301 agtggcattg agacagcagc aacagctcaa gagatatctc ctgcctactt gcccctcctt 3361 ccaggccggc tctaagagaa aggcccatct actcaggaag agggccaggc cttgggttct 3421 ggggattggg ccctgagagg gctagttctg tggctgaaaa taaagcatgt cccgcccccg // LOCUS HSU57316 2093 bp DNA PRI 16-AUG-1996 DEFINITION Human GCN5 (hGCN5) gene, complete cds. ACCESSION U57316 NID g1491934 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2093) AUTHORS Berry,R., Stevens,T.J., Walter,N.A., Wilcox,A.S., Rubano,T., Hopkins,J.A., Weber,J., Goold,R., Soares,M.B. and Sikela,J.M. TITLE Gene-based sequence-tagged-sites (STSs) as the basis for a human gene map JOURNAL Nature Genet. 10 (4), 415-423 (1995) MEDLINE 95400322 REFERENCE 2 (bases 1 to 2093) AUTHORS Yang,X.J., Ogryzko,V.V., Nishikawa,J., Howard,B.H. and Nakatani,Y. TITLE A p300/CBP-associated factor that competes with the adenoviral oncoprotein E1A JOURNAL Nature 382 (6589), 319-324 (1996) MEDLINE 96300317 REFERENCE 3 (bases 1 to 2093) AUTHORS Walter,N. and Sikela,J.M. TITLE Randomly sequenced clone NIB2000-R JOURNAL Unpublished (1996) REFERENCE 4 (bases 1 to 2093) AUTHORS Nakatani,Y. TITLE Direct Submission JOURNAL Submitted (30-APR-1996) NICHD, NIH, Bldg. 6, Rm. 416, 6 Center Dr., MSC 2753, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2093 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetal" /note="hGCN5 was isolated by use of NIB2000-R as the probe" gene 352..1782 /gene="hGCN5" CDS 352..1782 /gene="hGCN5" /note="similar to yeast GCN5 and human p300/CBP-associated factor (P/CAF)" /codon_start=1 /product="GCN5" /db_xref="PID:g1491935" /translation="MLEEEIYGANSPIWESGFTMPPSEGTQLVPRPASVSAAVVPSTP IFSPSMGGGSNSSLSLDSAGAEPMPGEKRTLPENLTLEDAKRLRVMGDIPMELVNEVM LTITDPAAMLGPETSLLSANAARDETARLEERRGIIEFHVIGNSLTPKANRRVLLWLV GLQNVFSHQLPRMPKEYIARLVFDPKHKTLALIKDGRVIGGICFRMFPTQGFTEIVFC AVTSNEQVKGYGTHLMNHLKEYHIKHNILYFLTYADEYAIGYFKKQGFSKDIKVPKSR YLGYIKDYEGATLMECELNPRIPYTELSHIIKKQKEIIKKLIERKQAQIRKVYPGLSC FKEGVRQIPVESVPGIRETGWKPLGKEKGKELKDPDQLYTTLKNLLAQIKSHPSAWPF MEPVKKSEAPDYYEVIRFPIDLKTMTERLRSRYYVTRKLFVADLQRVIANCREYNPPD SEYCRCASALEKFFYFKLKEGGLIDK" BASE COUNT 435 a 653 c 586 g 419 t ORIGIN 1 gaattccggc gaaaccactc atgtctttgg gcgaagcctt ctccggtcca ttttcaccgt 61 tacccgccgg cagctgctgg aaaagttccg agtggagaag gacaaattgg tgcccgagaa 121 gaggaccctc atcctcactc acttccccaa gtaaggctcc ttctggccta ccaggatttg 181 gccccaagtt cacatcctcc ctgttgtccc cttttttcca ggaaggcttc ctggattggt 241 ccctcctctc cctccatggg ccttttggga tctgggcgtc tacctggcag acttgcccat 301 ggcccagaag caacttgcta gtactagtct ggggatggca gattcctgtc catgctggag 361 gaggagatct atggggcaaa ctctccaatc tgggagtcag gcttcaccat gccaccctca 421 gaggggacac agctggttcc ccggccagct tcagtcagtg cagcggttgt tcccagcacc 481 cccatcttca gccccagcat gggtgggggc agcaacagct ccctgagtct ggattctgca 541 ggggccgagc ctatgccagg cgagaagagg acgctcccag agaacctgac cctggaggat 601 gccaagcggc tccgtgtgat gggtgacatc cccatggagc tggtcaatga ggtcatgctg 661 accatcactg accctgctgc catgctgggg cctgagacga gcctgctttc ggccaatgcg 721 gcccgggatg agacagcccg cctggaggag cgccgcggca tcatcgagtt ccatgtcatc 781 ggcaactcac tgacgcccaa ggccaaccgg cgggtgttgc tgtggctcgt ggggctgcag 841 aatgtctttt cccaccagct gccgcgcatg cctaaggagt atatcgcccg cctcgtcttt 901 gacccgaagc acaagactct ggccttgatc aaggatgggc gggtcatcgg tggcatctgc 961 ttccgcatgt ttcccaccca gggcttcacg gagattgtct tctgtgctgt cacctcgaat 1021 gagcaggtca agggttatgg gacccacctg atgaaccacc tgaaggagta tcacatcaag 1081 cacaacattc tctacttcct cacctacgcc gacgagtacg ccatcggcta cttcaaaaag 1141 cagggtttct ccaaggacat caaggtgccc aagagccgct acctgggcta catcaaggac 1201 tacgagggag cgacgctgat ggagtgtgag ctgaatcccc gcatccccta cacggagctg 1261 tcccacatca tcaagaagca gaaagagatc atcaagaagc tgattgagcg caaacaggcc 1321 cagatccgca aggtctaccc ggggctcagc tgcttcaagg agggcgtgag gcagatccct 1381 gtggagagcg ttcctggcat tcgagagaca ggctggaagc cattggggaa ggagaagggg 1441 aaggagctga aggaccccga ccagctctac acaaccctca aaaacctgct ggcccaaatc 1501 aagtctcacc ccagtgcctg gcccttcatg gagcctgtga agaagtcgga ggcccctgac 1561 tactacgagg tcatccgctt ccccattgac ctgaagacca tgactgagcg gctgcgaagc 1621 cgctactacg tgacccggaa gctctttgtg gccgacctgc agcgggtcat cgccaactgt 1681 cgcgagtaca accccccgga cagcgagtac tgccgctgtg ccagcgccct ggagaagttc 1741 ttctacttca agctcaagga gggaggcctc attgacaagt aggcccatct ttgggccgca 1801 gccctgacct ggaatgtctc cacctcggat tctgatctga tccttagggg gtgccctggc 1861 cccacggacc cgactcagct tgagacactc cagccaaggg tcctccggac ccgatcctgc 1921 agctctttct ggaccttcag gcacccccaa gcgtgcagct ctgtcccagc cttcactgtg 1981 tgtgagaggt ctcctgggtt ggggcccagc ccctctagag tagctggtgg ccagggatga 2041 accttgccca gccgtggtgg cccccaggcc tggtccccaa gagcccggaa ttc // LOCUS HSU58681 1535 bp DNA PRI 04-DEC-1996 DEFINITION Human neurogenic basic-helix-loop-helix protein (NeuroD2) gene, complete cds. ACCESSION U58681 NID g1477748 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1535) AUTHORS McCormick,M.B., Tamimi,R.M., Snider,L., Asakura,A., Bergstrom,D. and Tapscott,S.J. TITLE NeuroD2 and neuroD3: distinct expression patterns and transcriptional activation potentials within the neuroD gene family JOURNAL Mol. Cell. Biol. 16 (10), 5792-5800 (1996) MEDLINE 96413331 REFERENCE 2 (bases 1 to 1535) AUTHORS Tapscott,S.J., Tamimi,R.T. and McCormick,B.M. TITLE Direct Submission JOURNAL Submitted (17-MAY-1996) Clinical Research, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, Seattle, WA 19104, USA FEATURES Location/Qualifiers source 1..1535 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q12" gene 55..1200 /gene="NeuroD2" CDS 55..1200 /gene="NeuroD2" /note="neurogenic basic-helix-loop-helix (bHLH) protein" /codon_start=1 /db_xref="PID:g1477749" /translation="MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAP GPGAPGPARAAKPVPLRGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLDEAEGERPK KRGPKKRKMTKARLERSKLRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKI ETLRLAKNYIWALSEILRSGKRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTE QGADGAGRFHGSGGPFAMHPYPYPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAYET LYAAAGGGGASPDYNSSEYEGPLSPPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPG SRHGHGLVFGSSAVRGGVHSENLLSYDMHLHHDRGPMYEELNAFFHN" BASE COUNT 250 a 559 c 476 g 244 t 6 others ORIGIN 1 cccctcactt tgtgctgtct gtctcccctt cccgcccggg gnccctcagg caccatgctg 61 acccgcctgt tcagcgagcc cggccttctc tcggacgtgc ccaagttcgc cagctggggc 121 gacggcgaag acgacgagcc gaggagcgac aagggcgacg cgccgccacc gccaccgcct 181 gcgcccgggc caggggctcc ggggccagcc cgggcggcca agccagtccc tctccgtgga 241 gaagagggga cggaggccac gttggccgag gtcaaggagg aaggcgagct ggggggagag 301 gaggaggagg aagaggagga ggaagaagga ctggacgagg cggagggcga gcggcccaag 361 aagcgcgggc ccaagaagcg caagatgacc aaggcgcgct tggagcgctc caagcttcgg 421 cggcagaagg cgaacgcgcg ggagcgcaac cgcatgcacg acctgaacgc agccctggac 481 aacctgcgca aggtggtgcc ctgctactcc aagacgcaga agctgtccaa gatcgagacg 541 ctgcgcctag ccaagaacta tatctgggcg ctctcggaga tcctgcgctc cggcaagcgg 601 ccagacctag tgtcctacgt gcagactctg tgcaagggtc tgtcgcagcc caccaccaat 661 ctggtggccg gctgtctgca gctcaactct cgcaacttcc tcacggagca aggcgccgac 721 ggtgccggcc gcttccacgg ctcgggcggc ccgttcgcca tgcaccccta cccgtacccg 781 tgctcgcgcc tggcgggcgc acagtgccag gcggccggcg gcctgggcgg cggcgcggcg 841 cacgccctgc ggacccacgg ctactgcgcc gcctacgaga cgctgtatgc ggcggcaggc 901 ggtggcggcg cgagcccgga ctacaacagc tccgagtacg agggcccgct cagccccccg 961 ctctgtctca atggcaactt ctcactcaag caggactcct cgcccgacca cgagaaaagc 1021 taccactact ctatgcacta ctcggcgctg cccggttcgc gccacggcca cgggctagtc 1081 ttcggctcgt cggctgtgcg cgggggcgtc cactcggaga atctcttgtc ttacgatatg 1141 caccttcacc acgaccgggg ccccatgtac gaggagctca atgcgttttt tcataactga 1201 gacttcgcgc cgnctccctn ctttttcttt tgcctttgcc cgcccccctg tccccagccc 1261 ccagagcgca gggacacccc catnctaccc cggcnccggc ggagcgggcc accggtctgc 1321 cgctctcctg gggcagcgca gtctgttacn tgtgggtggc tgtcccaggg gcctcgcttc 1381 ccccagggac tcgccttctc tctccaaggg gttccctcct cctctctccc aaggagtgct 1441 tctccaggga cctctctccg ggggctccct ggaggcaccc ctcccccatt cccaatatct 1501 tcgctgaggt ttcctcctcc ccctcctccc tgcag // LOCUS HSU60289 4742 bp DNA PRI 02-SEP-1996 DEFINITION Human receptor protein tyrosine phosphatase psi R-PTP-Psi gene, complete cds. ACCESSION U60289 NID g1518671 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4742) AUTHORS Banville,D., Masson,S., L'Abbe,D., Stocco,R. and Shen,S.-H. TITLE Cloning and expression of R-PTP-Psi, a novel receptor protein tyrosine phosphatase related to the homophilic binding R-PTP-Kappa and -mu JOURNAL Unpublished REFERENCE 2 (bases 1 to 4742) AUTHORS Banville,D., Masson,S., L'Abbe,D., Stocco,R. and Shen,S.-H. TITLE Direct Submission JOURNAL Submitted (10-JUN-1996) Pharmaceutical Biotechnology, Biotechnology Research Institute, 6100 Royalmount, Montreal, Quebec H4P 2R2, Canada FEATURES Location/Qualifiers source 1..4742 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 39..4349 /note="R-PTP-Psi" /codon_start=1 /product="receptor protein tyrosine phosphatase psi" /db_xref="PID:g1518672" /translation="MARAQALVLALTFQLCAPETETPAAGCTFEEASDPAVPCEYSQA QYDDFQWEQVRIHPGTRAPADLPHGSYLMVNTSQHAPGQRAHVIFQSLSENDTHCVQF SYFLYSRDGHSPGTLGVYVRVNGGPLGSAVWNMTGSHGRQWHQAELAVSTFWPNEYQV LFEALISPDRRGYMGLDDILLLSYPCAKAPHFSRLGDVEVNAGQNASFQCMAAGRAAE AERFLLQRQSGALVPAAGVRHISHRRFLATFPLAAVSRAEQDLYRCVSQAPRGAGVSN FAELIVKEPPTPIAPPQLLRAGPTYLIIQLNTNSIIGDGPIVRKEIEYRMARGPWAEV HAVSLQTYKLWHLDPDTEYEISVLLTRPGDGGTGRPGPPLISRTKCAEPMRAPKGLAF AEIQARQLTLQWEPLGYNVTRCHTYTVSLCYHYTLGSSHNQTIRECVKTEQGVSRYTI KNLLPYRNVHVRLVLTNPEGRKEGKEVTFQTDEDVPSGIAAESLTFTPLEDMIFLKWE EPQEPNGLITQYEISYQSIESSDPAVNVPGPRRTISKLRNETYHVFSNLHPGTTYLFS VRARTGKGFGQAALTEITTNISAPSFDYADMPSPLGESENTITVLLRPAQGRGAPISV YQVIVEEERARRLRREPGGQDCFPVPLTFEAALARGLVHYFGAELAASSLPEAMPFTV GDNQTYRGFWNPPLEPRKAYLIYFQAASHLKGETRLNCIRIARKAACKESKRPLEVSQ RSEEMGLILGICAGGLAVLILLLGAIIVIIRKGKPVNMTKATVNYRQEKTHMMSAVDR SFTDQSTLQEDERLGLSFMDTHGYSTRGDQRSGGVTEASSLLGGSPRRPCGRKGSPYH TGQLHPAVRVADLLQHINQMKTAEGYGFKQEYESFFEGWDATKKKDKVKGSRQEPMPA YDRHRVKLHPMLGDPNADYINANYIDGYHRSNHFIATQGPKPEMVYDFWRMVWQEHCS SIVMITKLVEVGRVKCSRYWPEDSDTYGDIKIMLVKTETLAEYVVRTFALERRGYSAR HEVRQFHFTAWPEHGVPYHATGLLAFIRRVKASTPPDAGPIVIHCSAGTGRTGCYIVL DVMLDMAECEGVVDIYNCVKTLCSRRVNMIQTEEQYIFIHDAILEACLCGETTIPVSE FKATYKEMIRIDPQSNSSQLREEFQTLNSVTPPLDVEECSIALLPRNRDKNRSMDVLP ADRCLPFLISTDGDSNNYINAALTDSYTRSAAFIVTLHPLQSTTPDFWRLVYDYGCTS IVMLNQLNQSNSAWPCLQYWPEPGRQQYGLMEVEFMSGTADEDLVARVFRVQNISRLQ EGHLLVRHFQFLRWSAYRDTPDSKKAFLHLLAEVDKWQAESGDGRTIVHCLNGGGRSG TFCACATVLEMIRCHNLVDVFFAAKTLRNYKPNMVETMDQYHFCYDVVLEYLEGLESR " BASE COUNT 953 a 1520 c 1422 g 847 t ORIGIN 1 tcccgcgccg ggccccggga cgcggcgatc gtccaaccat ggcccgtgcc caggcgctcg 61 tgctggcact caccttccag ctctgcgcgc cggagaccga gactccggca gctggctgca 121 ccttcgagga ggcaagtgac ccagcagtgc cctgcgagta cagccaggcc cagtacgatg 181 acttccagtg ggagcaagtg cgaatccacc ctggcacccg ggcacctgcg gacctgcccc 241 acggctccta cttgatggtc aacacttccc agcatgcccc aggccagcga gcccatgtca 301 tcttccagag cctgagcgag aatgataccc actgtgtgca gttcagctac ttcctgtaca 361 gccgggacgg gcacagcccg ggcaccctgg gcgtctacgt gcgcgttaat gggggccccc 421 tgggcagtgc tgtgtggaat atgactggat cccacggccg tcagtggcac caggctgagc 481 tggctgtcag cactttctgg cccaatgaat atcaggtgct gtttgaggcc ctcatctccc 541 cagaccgcag gggctacatg ggcctagatg acatcctgct tctcagctac ccctgcgcaa 601 aggccccaca cttctcccgc ctgggcgacg tggaggtcaa cgcgggccag aacgcgtcgt 661 tccagtgcat ggccgcgggc agagcggccg aggccgaacg cttcctcttg caacggcaga 721 gcggggcgct ggtgccggcg gcgggcgtgc ggcacatcag ccaccggcgc ttcctggcca 781 ctttcccgct ggctgccgtg agccgcgccg agcaggacct gtaccgctgt gtgtcccagg 841 ccccgcgcgg cgcgggcgtc tctaacttcg cggagctcat cgtcaaggag cccccaactc 901 ccatcgcgcc cccacagctg ctgcgtgctg gccccaccta cctcatcatc cagctcaaca 961 ccaactccat cattggcgac gggccgatcg tgcgcaagga gattgagtac cgcatggcgc 1021 gcgggccctg ggctgaggtg cacgccgtca gcctgcagac ctacaagctg tggcacctcg 1081 accccgacac agagtatgag atcagcgtgc tgctcacgcg tcccggagac ggcggcactg 1141 gccgccctgg gccacccctc atcagccgca ccaaatgcgc agagcccatg agggccccca 1201 aaggcctggc ttttgctgag atccaggccc gtcagctgac cctgcagtgg gaaccactgg 1261 gctacaacgt gacgcgttgc cacacctata ctgtgtcgct gtgctatcac tacaccctgg 1321 gcagcagcca caaccagacc atccgagagt gtgtgaagac agagcaaggt gtcagccgct 1381 acaccatcaa gaacctgctg ccctatcgga acgttcacgt gaggcttgtc ctcactaacc 1441 ctgaggggcg caaagagggc aaggaggtca ctttccagac ggatgaggat gtgcccagtg 1501 ggattgcagc cgagtccctg accttcactc cactggagga catgatcttc ctcaagtggg 1561 aggagcccca ggagcccaat ggtctcatca cccagtatga gatcagctac cagagcatcg 1621 agtcatcaga cccggcagtg aacgtgccag gcccacgacg taccatctcc aagctccgca 1681 atgagaccta ccatgtcttc tccaacctgc acccaggcac cacctacctg ttctccgtgc 1741 gggcccgcac aggcaaaggc ttcggccagg cggcactcac tgagataacc actaacatct 1801 ctgctcccag ctttgattat gccgacatgc cgtcacccct gggcgagtct gagaacacca 1861 tcaccgtgct gctgaggccg gcacagggcc gcggtgcgcc catcagtgtg taccaggtga 1921 ttgtggagga ggagcgggcg cggaggctgc ggcgggagcc aggtggacag gactgcttcc 1981 cagtgccatt gaccttcgag gcggcgctgg cccgaggcct ggtgcactac ttcggggccg 2041 aactggcggc cagcagtcta cctgaggcca tgccctttac cgtgggtgac aaccagacct 2101 accgaggctt ctggaaccca ccacttgagc ctaggaaggc ctatctcatc tacttccagg 2161 cagcaagcca cctgaagggg gagacccggc tgaattgcat ccgcattgcc aggaaagctg 2221 cctgcaagga aagcaagcgg cccctggagg tgtcccagag atcggaggag atggggctta 2281 tcctgggcat ctgtgcaggg gggcttgctg tcctcatcct tctcctgggt gccatcattg 2341 tcatcatccg caaagggaag ccggtgaaca tgaccaaggc caccgtcaac taccgccagg 2401 agaagacaca catgatgagc gccgtggacc gcagcttcac agaccagagc accctgcagg 2461 aggacgagcg gctgggcctg tccttcatgg acacccatgg ctacagcacc cggggagacc 2521 agcgcagcgg tggggtcact gaggccagca gcctcctggg gggctccccg aggcgtccct 2581 gtggccggaa gggctcccca taccacacgg ggcagctgca ccctgcggtg cgtgtcgcag 2641 accttctgca gcacatcaac cagatgaaga cggccgaggg ttacggcttc aagcaggagt 2701 atgagagctt ctttgaaggc tgggacgcca caaagaagaa agacaaggtc aagggcagcc 2761 ggcaggagcc aatgcctgcc tatgatcggc accgagtgaa actgcacccg atgctgggag 2821 accccaatgc cgactacatt aatgccaact acatagatgg ttaccacagg tcaaaccact 2881 tcatagccac tcaagggccg aagcctgaga tggtctatga cttctggcgt atggtgtggc 2941 aggagcactg ttccagcatc gtcatgatca ccaagctggt cgaggtgggc agggtgaaat 3001 gctcacggta ctggccggag gactcagaca cctacgggga catcaagatt atgctggtga 3061 agacagagac cctggctgag tatgtcgtgc gcacttttgc cctggagcgg agaggctact 3121 ctgcccggca cgaggtccgc cagttccact tcacagcgtg gccagagcat ggcgtcccct 3181 accatgccac ggggctgctg gctttcatcc ggcgcgtgaa ggcctccacc ccacctgatg 3241 ccgggcccat tgtcatccac tgcagcgcgg gcaccggccg cacaggttgc tatatcgtcc 3301 tggatgtgat gctggacatg gcagagtgtg agggcgtcgt ggacatttac aactgtgtga 3361 agactctctg ctcccggcgt gtcaacatga tccagactga ggagcagtac atcttcattc 3421 atgatgcaat cctggaggcc tgcctgtgtg gggagaccac catccctgtc agtgagttca 3481 aggccaccta caaggagatg atccgcattg atcctcagag taattcctcc cagctgcggg 3541 aagagttcca gacgctgaac tcggtcaccc cgccgctgga cgtggaggag tgcagcatcg 3601 ccctgttgcc ccggaaccgt gacaagaacc gcagcatgga cgtcctgccg gccgaccgct 3661 gcctgccctt cctcatctcc actgatgggg actccaacaa ctacattaat gcagccctga 3721 ctgacagcta cacacggagt gcggccttca tcgtgaccct gcacccgctg cagagcacca 3781 cgcccgactt ctggcggctg gtctacgatt acgggtgcac ctccatcgtc atgctcaacc 3841 agctgaacca gtccaactcc gcctggccct gcctgcagta ctggccagag ccaggccggc 3901 agcaatatgg cctcatggag gtggagttta tgtcgggcac agctgatgaa gacttagtgg 3961 ctcgagtctt ccgggtgcag aacatctctc ggttgcagga ggggcacctg ctggtgcggc 4021 acttccagtt cctgcgctgg tctgcatacc gggacacacc tgactccaag aaggccttct 4081 tgcacctgct ggctgaggtg gacaagtggc aggccgagag tggggatggg cgcaccatcg 4141 tgcactgcct aaacggggga ggacgcagcg gcaccttctg cgcctgcgcc acggtcctgg 4201 agatgatccg ctgccacaac ttggtggacg ttttctttgc tgccaaaacc ctccggaact 4261 acaaacccaa catggtggag accatggatc agtaccactt ttgctacgat gtggtcctgg 4321 agtacttgga ggggctggag tcaagatagc ggggccctgg cctggggcac ccactgcaca 4381 ctcagggcca gacccaccat cctggactgg cgaggaagat cagtgcctcc tgctctgccc 4441 aaacacactc ccatggggca agcactggag tggatgctgg gctatcttgc tcccccttcc 4501 accgtgggca gggcctttcg cttgtcccat gggcgggtgg tgggccaagg aggagcttag 4561 caagtctgca gcccagcccc acctccatag ggtcctgcag gcctgtgctg agaggcctgg 4621 tgctgcctgg cagagtgaca aaggctcagg acggctggct ctgggggact caggccaagc 4681 cccttggcac catcctggct tttggcaggg atgagtgagg ccctgcagag agcccggaat 4741 tc // LOCUS HSU61148 1572 bp DNA PRI 25-JAN-1997 DEFINITION Human atonal homolog 1 (Hath1) gene, complete cds. ACCESSION U61148 NID g1575354 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1572) AUTHORS Ben-Arie,N., McCall,A.E., Berkman,S., Eichele,G., Bellen,H.J. and Zoghbi,H.Y. TITLE Evolutionary conservation of sequence and expression of the bHLH protein Atonal suggests a conserved role in neurogenesis JOURNAL Hum. Mol. Genet. 5 (9), 1207-1216 (1996) MEDLINE 97026280 REFERENCE 2 (bases 1 to 1572) AUTHORS Ben-Arie,N. TITLE Direct Submission JOURNAL Submitted (18-JUN-1996) Pediatrics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1572 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q22" gene 363..1427 /gene="Hath1" CDS 363..1427 /gene="Hath1" /note="atonal homolog 1" /codon_start=1 /product="HATH1" /db_xref="PID:g1575355" /translation="MSRLLHAEEWAEVKELGDHHRQPQPHHLPQPPPPPQPPATLQAR EHPVYPPELSLLDSTDPRAWLAPTLQGICTARAAQYLLHSPELGASEAAAPRDEVDGR GELVRRSSGGASSSKSPGPVKVREQLCKLKGGVVVDELGCSRQRAPSSKQVNGVQKQR RLAANARERRRMHGLNHAFDQLRNVIPSFNNDKKLSKYETLQMAQIYINALSELLQTP SGGEQPPPPPASCKSDHHHLRTAASYEGGAGNATAAGAQQASGGSQRPTPPGSCRTRF SAPASAGGYSVQLDALHFSTFEDSALTAMMAQKNLSPSLPGSILQPVQEENSKTSPRS HRSDGEFSPHSHYSDSDEAS" misc_feature 834..1004 /gene="Hath1" /note="encodes basic helix-loop-helix" BASE COUNT 334 a 475 c 497 g 262 t 4 others ORIGIN 1 gtcctctgca cacaagaact tttctcgggg tgtaaaaact ctttgattgg ctgctcgcac 61 gcgcctgccc gcgccctcca ttggctgaga agacacgcga ccggcgcgag gagggggttg 121 ggagaggagc ggggggagac tgagtggcgc gtgccgcttt ttaaaggggc gcagcgcctt 181 cagcaaccgg agaagcatag ttgcacgcga cctggtgtgt gatctccgag tgggtggggg 241 agggtcgagg agggaaaaaa aaataagacg ttgcagaaga gacccggaaa gggccttttt 301 tttggttgag ctggtgtccc agtgctgcct ccgatcctga gcgtccgagc ctttgcagtg 361 caatgtcccg cctgctgcat gcagaagagt gggctgaagt gaaggagttg ggagaccacc 421 atcgccagcc ccagccgcat catctcccgc aaccgccgcc gccgccgcag ccacctgcaa 481 ctttgcaggc gagagagcat cccgtctacc cgcctgagct gtccctcctg gacagcaccg 541 acccacgcgc ctggctggct cccactttgc agggcatctg cacggcacgc gccgcccagt 601 atttgctaca ttccccggag ctgggtgcct cagaggccgc tgcgccccgg gacgaggtgg 661 acggccgggg ggagctggta aggaggagca gcggcggtgc cagcagcagc aagagccccg 721 ggccggtgaa agtgcgggaa cagctgtgca agctgaaagg cggggtggtg gtagacgagc 781 tgggctgcag ccgccaacgg gccccttcca gcaaacaggt gaatggggtg cagaagcaga 841 gacggctagc agccaacgcc agggagcggc gcaggatgca tgggctgaac cacgccttcg 901 accagctgcg caatgttatc ccgtcgttca acaacgacaa gaagctgtcc aaatatgaga 961 ccctgcagat ggcccaaatc tacatcaacg ccttgtccga gctgctacaa acgcccagcg 1021 gaggggaaca gccaccgccg cctccagcct cctgcaaaag cgaccaccac caccttcgca 1081 ccgcggcctc ctatgaaggg ggcgcgggca acgcgaccgc agctggggct cagcaggctt 1141 ccggagggag ccagcggccg accccgcccg ggagttgccg gactcgcttc tcagccccag 1201 cttctgcggg agggtactcg gtgcagctgg acgctctgca cttctcgact ttcgaggaca 1261 gcgccctgac agcgatgatg gcgcaaaaga atttgtctcc ttctctcccc gggagcatct 1321 tgcagccagt gcaggaggaa aacagcaaaa cttcgcctcg gtcccacaga agcgacgggg 1381 aattttcccc ccattcccat tacagtgact cggatgaggc aagttaggaa ggtgacagaa 1441 gcctgaaaac tgagacagaa acaaaactgc cctttcccag tgcgcgggaa gccccgnggt 1501 taangatccc cgcacccttt aatttnggct ctgcgatggt cgttgtttag caacgacttg 1561 gctncagatg gt // LOCUS HSU61734 1359 bp DNA PRI 10-JUL-1996 DEFINITION Human protein trafficking protein (S31iii125) mRNA, complete cds. ACCESSION U61734 NID g1407825 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1359) AUTHORS Sanseau,P., Sherington,R., Trower,M.K., St-George-Hyslop,P. and Dykes,C.W. TITLE New human gene, member of a large family implicated in protein trafficking JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 1359) AUTHORS Sanseau,P., Sherington,R., Trower,M.K., St-George-Hyslop,P. and Dykes,C.W. TITLE Direct Submission JOURNAL Submitted (21-JUN-1996) Genomics, Glaxo-Wellcome, Gunnels Wood Road, Stevenage, HH SG1 2NY, United Kingdom FEATURES Location/Qualifiers source 1..1359 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q24.3" gene 28..687 /gene="S31iii125" CDS 28..687 /gene="S31iii125" /function="putative role in protein trafficking" /codon_start=1 /product="protein trafficking protein" /db_xref="PID:g1407826" /translation="MSGLSGPPARRGPFPLALLLLFLLGPRLVLAISFHLPINSRKCL REEIHKDLLVTGAYEISDQSGGAGGLRSHLKITDSAGHILYSKEDATKGKFAFTTEDY DMFEVCFESKGTGRIPDQLVILDMKHGVEAKNYEEIAKVEKLKPLEVELRRLEDLSES IVNDFAYMKKREEEMRDTNESTNTRVLYFSIFSMFCLIGLATWQVFYLRRFFKAKKLI E" BASE COUNT 349 a 315 c 303 g 392 t ORIGIN 1 aagcttggca cgagggtctc cagcaccatg tctggtttgt ctggcccacc agcccggcgc 61 ggcccttttc cgttagcgtt gctgcttttg ttcctgctcg gccccagatt ggtccttgcc 121 atctccttcc atctgcccat taactctcgc aagtgcctcc gtgaggagat tcacaaggac 181 ctgctagtga ctggcgcgta cgagatctcc gaccagtctg ggggcgctgg cggcctgcgc 241 agccacctca agatcacaga ttctgctggc catattctct actccaaaga ggatgcaacc 301 aaggggaaat ttgcctttac cactgaagat tatgacatgt ttgaagtgtg ttttgagagc 361 aagggaacag ggcggatacc tgaccaactc gtgatcctag acatgaagca tggagtggag 421 gcgaaaaatt acgaagagat tgcaaaagtt gagaagctca aaccattaga ggtagagctg 481 cgacgcctag aagacctttc agaatctatt gttaatgatt ttgcctacat gaagaagaga 541 gaagaggaga tgcgtgatac caacgagtca acaaacactc gggtcctata cttcagcatc 601 ttttcaatgt tctgtctcat tggactagct acctggcagg tcttctacct gcgacgcttc 661 ttcaaggcca agaaattgat tgagtaatga atgaggcata ttctcctccc accttgtacc 721 tcagccagca gaacatcgct gggacgtgcc tggcctaagg catcctacca acagcaccat 781 caaggcacgt tggtgctttc ttgccagaac tgatctcttt tggtgtggga ggacatgggg 841 taccacctac acccaacaag tcaatgaggg acttcttttt taatttggta ggattttgac 901 tggttttgca acaataggtc tattattaga gtcacctatg acaaaaaata ggggttacct 961 agataatgcc aaagtcagca tttgtcctgg gttcccttgt gtgatctgtt tggactatgt 1021 tttcttttct tctcccactt gctcagcagc ttgggcttcc attctagttc ttttaccaag 1081 atttttgtgt gaccatgttg acttcatttg gattgccctc tttcaatttc cttgtgaaaa 1141 cacccttaac tttctcttta cccttagctg aaatgtgtac atagcttctg gtgatatctt 1201 ctcatgattt tatatctctt aaaatggtga tggatgtgac acctcataaa agtgagcttt 1261 gaactgtaga taactcttaa agaaaatgtc attttagaca attaaaatat ttgtgctcca 1321 aaaaaaaaaa aaaaaattcc tggggccgca agggaattc // LOCUS HSU63329 1869 bp DNA PRI 28-JUL-1996 DEFINITION Human mutY homolog (hMYH) gene, complete cds. ACCESSION U63329 NID g1458227 KEYWORDS mutY; micA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1869) AUTHORS Slupska,M.M., Baikalov,C., Luther,W.M., Chiang,J.H., Wei,Y.F. and Miller,J.H. TITLE Cloning and sequencing a human homolog (hMYH) of the Escherichia coli mutY gene whose function is required for the repair of oxidative DNA damage JOURNAL J. Bacteriol. 178 (13), 3885-3892 (1996) MEDLINE 96272264 REFERENCE 2 (bases 1 to 1869) AUTHORS Slupska,M.M., Baikalov,C., Luther,W.M., Chiang,J-H., Wei,Y-F. and Miller,J.H. TITLE Direct Submission JOURNAL Submitted (09-JUL-1996) Microbiology and Molecular Genetics, UCLA, 405 Hilgard Ave, Los Angeles, CA 90025, USA FEATURES Location/Qualifiers source 1..1869 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="between 1p32.1 and 1p34.3" 5'UTR 1..182 gene 183..1790 /gene="hMYH" CDS 183..1790 /gene="hMYH" /codon_start=1 /product="mutY homolog" /db_xref="PID:g1458228" /translation="MTPLVSRLSRLWAIMRKPRAAVGSGHRKQAASQEGRQKHAKNNS QAKPSACDGLARQPEEVVLQASVSSYHLFRDVAEVTAFRGSLLSWYDQEKRDLPWRRR AEDEMDLDRRAYAVWVSEVMLQQTQVATVINYYTGWMQKWPTLQDLASASLEEVNQLW AGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLLPGVGRYTAGAIASIAFGQAT GVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLVDPARPGDFNQAAMELGATVC TPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVEECAPNTGQCHLCLPPSEP WDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLVQRPNSGLLAGLWEFPS VTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIKLTYQVYGLALEGQT PVTTVPPGARWLTQEEFHTAAVSTAMKKVFRVYQGQQPGTCMGSKRSQVSSPCSRKKP RMGQQVLDNFFRSHISTDAHSLNSAAQ" allele replace(377,"t") /gene="hMYH" allele replace(740,"t") /gene="hMYH" allele replace(1154,"c") /gene="hMYH" polyA_site 1853 BASE COUNT 401 a 550 c 570 g 348 t ORIGIN 1 tctcctcgtg gctagttcag gcggaaggag cagtcctctg aagcttgagg agcctctaga 61 actatgagcc cgaggccttc ccctctccca gagcgcagag gctttgaagg ctacctctgg 121 gaagccgctc accgtcggaa gctgcgggag ctgaaactgc gccatcgtca ctgtcggcgg 181 ccatgacacc gctcgtctcc cgcctgagtc gtctgtgggc catcatgagg aagccacgag 241 cagccgtggg aagtggtcac aggaagcagg cagccagcca ggaagggagg cagaagcatg 301 ctaagaacaa cagtcaggcc aagccttctg cctgtgatgg cctggccagg cagccggaag 361 aggtggtatt gcaggcctct gtctcctcat accatctatt cagagacgta gctgaagtca 421 cagccttccg agggagcctg ctaagctggt acgaccaaga gaaacgggac ctaccatgga 481 gaagacgggc agaagatgag atggacctgg acaggcgggc atatgctgtg tgggtctcag 541 aggtcatgct gcagcagacc caggttgcca ctgtgatcaa ctactatacc ggatggatgc 601 agaagtggcc tacactgcag gacctggcca gtgcttccct ggaggaggtg aatcaactct 661 gggctggcct gggctactat tctcgtggcc ggcggctgca ggagggagct cggaaggtgg 721 tagaggagct agggggccac atgccacgta cagcagagac cctgcagcag ctcctgcctg 781 gcgtggggcg ctacacagct ggggccattg cctctatcgc ctttggccag gcaaccggtg 841 tggtggatgg caacgtagca cgggtgctgt gccgtgtccg agccattggt gctgatccca 901 gcagcaccct tgtttcccag cagctctggg gtctagccca gcagctggtg gacccagccc 961 ggccaggaga tttcaaccaa gcagccatgg agctaggggc cacagtgtgt accccacagc 1021 gcccactgtg cagccagtgc cctgtggaga gcctgtgccg ggcacgccag agagtggagc 1081 aggaacagct cttagcctca gggagcctgt cgggcagtcc tgacgtggag gagtgtgctc 1141 ccaacactgg acagtgccac ctgtgcctgc ctccctcgga gccctgggac cagaccctgg 1201 gagtggtcaa cttccccaga aaggccagcc gcaagccccc cagggaggag agctctgcca 1261 cctgtgttct ggaacagcct ggggcccttg gggcccaaat tctgctggtg cagaggccca 1321 actcaggtct gctggcagga ctgtgggagt tcccgtccgt gacctgggag ccctcagagc 1381 agcttcagcg caaggccctg ctgcaggaac tacagcgttg ggctgggccc ctcccagcca 1441 cgcacctccg gcaccttggg gaggttgtcc acaccttctc tcacatcaag ctgacatatc 1501 aagtatatgg gctggccttg gaagggcaga ccccagtgac caccgtacca ccaggtgctc 1561 gctggctgac gcaggaggaa tttcacaccg cagctgtttc caccgccatg aaaaaggttt 1621 tccgtgtgta tcagggccaa cagccaggga cctgtatggg ttccaaaagg tcccaggtgt 1681 cctctccgtg cagtcggaaa aagccccgca tgggccagca agtcctggat aatttctttc 1741 ggtctcacat ctccactgat gcacacagcc tcaacagtgc agcccagtga cacctctgaa 1801 agcccccatt ccctgagaat cctgttgtta gtaaagtgct tatttttgta gttaaaaaaa 1861 aaaaaaaaa // LOCUS HSU63842 1268 bp DNA PRI 04-DEC-1996 DEFINITION Human neurogenic basic-helix-loop-helix protein (neuroD3) gene, complete cds. ACCESSION U63842 NID g1654337 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1268) AUTHORS McCormick,M.B., Tamimi,R.M., Snider,L., Asakura,A., Bergstrom,D. and Tapscott,S.J. TITLE NeuroD2 and neuroD3: distinct expression patterns and transcriptional activation potentials within the neuroD gene family JOURNAL Mol. Cell. Biol. 16 (10), 5792-5800 (1996) MEDLINE 96413331 REFERENCE 2 (bases 1 to 1268) AUTHORS Tapscott,S.J., Tamimi,R., Bergstrom,D. and McCormick,M.B. TITLE Direct Submission JOURNAL Submitted (15-JUL-1996) Clinical Research, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, Seattle, WA 98104, USA FEATURES Location/Qualifiers source 1..1268 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q22-35" gene 55..768 /gene="neuroD3" CDS 55..768 /gene="neuroD3" /note="bHLH protein related to neuroD; neurogenic basic-helix-loop-helix protein" /codon_start=1 /db_xref="PID:g1654338" /translation="MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGP PAPARRSAPNISRASEVPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRER NRMHNLNAALDALRSVLPSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGA RERLLPPQCVPCLPGPPSPASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFS FPSLPKDLLHTTPCFIPYH" BASE COUNT 246 a 455 c 343 g 224 t ORIGIN 1 ctgcagcgct ctgagccgct ttctatctgt ccgtcggtcc tgcacagcgc aacgatgcca 61 gcccgccttg agacctgcat ctccgacctc gactgcgcca gcagcagcgg cagtgaccta 121 tccggcttcc tcaccgacga ggaagactgt gccagactcc aacaggcagc ctccgcttcg 181 gggccgcccg cgccggcccg caggagcgcg cccaatatct cccgggcgtc tgaggttcca 241 ggggcacagg acgacgagca ggagaggcgg cggcgccgcg gccggacgcg ggtccgctcc 301 gaggcgctgc tgcactcgct gcgcaggagc cggcgcgtca aggccaacga tcgcgagcgc 361 aaccgcatgc acaacttgaa cgcggccctg gacgcactgc gcagcgtgct gccctcgttc 421 cccgacgaca ccaagctcac caaaatcgag acgctgcgct tcgcctacaa ctacatctgg 481 gctctggccg agacactgcg cctggcggat caagggctgc ccggaggcgg tgcccgggag 541 cgcctcctgc cgccgcagtg cgtcccctgc ctgcccggtc ccccaagccc cgccagcgac 601 gcggagtcct ggggctcagg tgccgccgcc gcctccccgc tctctgaccc cagtagccca 661 gccgcctccg aagacttcac ctaccgcccc ggcgaccctg ttttctcctt cccaagcctg 721 cccaaagact tgctccacac aacgccctgt ttcattcctt accactaggc cctttgtaga 781 cactgttact ttccccctcc cctagtcagc aggcaataga ttgggcccag ctgccgcctc 841 gggacccctc tccaggcgga gggaggaagc gggagcttta aagcagtcgg ggatacctga 901 gccgcttgtt aggtcgccgc accctcgcgg cggatgtctc ttggtctgtt tctccggccc 961 tcagcccagc gcccctcctg cccgccccta gacggccttt ccttttgcac tttctgaact 1021 ccacaaaacc tcctttgtga ctggctcaga actgacccca gccaccactt cagtgtgatt 1081 tagaaaaggg acagatcagc ccctgaagac gaggtgaaaa gtcaatttta caatttgtag 1141 aactctaatg aagaaaaacg agcatgaaaa ttcggtttga gccggctgac aatacaatga 1201 aaaggcttaa aaagcagaga caaggagtgg gcttcatgca ttatggatcc cgacccccac 1261 cactgcag // LOCUS HSU64998 453 bp DNA PRI 05-NOV-1997 DEFINITION Human ribonuclease k6 precursor gene, complete cds. ACCESSION U64998 NID g2585987 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 453) AUTHORS Rosenberg,H.F. and Dyer,K.D. TITLE Molecular cloning and characterization of a novel human ribonuclease (RNase k6): increasing diversity in the enlarging ribonuclease gene family JOURNAL Nucleic Acids Res. 24 (18), 3507-3513 (1996) MEDLINE 96433147 REFERENCE 2 (bases 1 to 453) AUTHORS Rosenberg,H.F. TITLE Direct Submission JOURNAL Submitted (24-JUL-1996) Laboratory of Host Defenses/NIAID/NIH, 10 Center Dr., MSC 1886, Bethesda, MD 20892-1886, USA FEATURES Location/Qualifiers source 1..453 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" CDS 1..453 /note="RNase k6" /codon_start=1 /product="ribonuclease k6 precursor" /db_xref="PID:g2585988" /translation="MVLCFPLLLLLLVLWGPVCPLHAWPKRLTKAHWFEIQHIQPSPL QCNRAMSGINNYTQHCKHQNTFLHDSFQNVAAVCDLLSIVCKNRRHNCHQSSKPVNMT DCRLTSGKYPQCRYSAAAQYKFFIVACDPPQKSDPPYKLVPVHLDSIL" mat_peptide 70..450 /note="RNase k6" /product="ribonuclease k6" BASE COUNT 110 a 127 c 90 g 126 t ORIGIN 1 atggtgctat gctttcctct tcttttactg ctgctggttc tatggggacc agtgtgtcca 61 cttcatgctt ggcctaagcg tctcaccaag gctcactggt ttgaaattca gcatatacag 121 ccaagtcctc tccaatgcaa cagggcaatg agtggcatca acaattatac ccagcactgt 181 aagcatcaaa atacctttct gcatgactct ttccagaacg tggctgctgt ctgtgatttg 241 ctcagcattg tctgcaaaaa tcgtcggcac aactgccacc agagctcaaa gcctgtcaac 301 atgactgact gcagactcac ttcaggaaag tatccccagt gccgctatag tgctgctgcc 361 cagtacaaat tcttcattgt tgcctgtgac ccccctcaga agagcgatcc cccctacaag 421 ttggttcctg tacacttaga tagtattctc taa // LOCUS HSU65402 2061 bp DNA PRI 03-JUL-1997 DEFINITION Human seven transmembrane G-coupled receptor (GPR31) gene, complete cds. ACCESSION U65402 NID g2065522 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2061) AUTHORS Zingoni,A., Rocchi,M., Storlazzi,C.T., Bernardini,G., Santoni,A. and Napolitano,M. TITLE Isolation and chromosomal localization of GPR31, a human gene encoding a putative G protein-coupled receptor JOURNAL Genomics 42 (3), 519-523 (1997) MEDLINE 97349123 REFERENCE 2 (bases 1 to 2061) AUTHORS Zingoni,A., Rocchi,M. and Napolitano,M. TITLE Direct Submission JOURNAL Submitted (26-JUL-1996) Laboratory of Physiopatology, Regina Elena Cancer Institute, Via delle Messi d'oro 156, Rome 00158, Italy FEATURES Location/Qualifiers source 1..2061 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q27" gene 1..2061 /gene="GPR31" CDS 499..1458 /gene="GPR31" /codon_start=1 /product="seven transmembrane G-coupled receptor" /db_xref="PID:g2065523" /translation="MPFPNCSAPSTVVATAVGVLLGLECGLGLLGNAVALWTFLFRVR VWKPYAVYLLNLALADLLLAACLPFLAAFYLSLQAWHLGRVGCWALRFLLDLSRSVGM AFLAAVALDRYLRVVHPRLKVNLLSPQAALGVSGLVWLLMVALTCPGLLISEAAQNST RCHSFYSRADGSFSIIWQEALSCLQFVLPFGLIVFCNAGIIRALQKRLREPEKQPKLQ RAQALVTLVVVLFALCFLPCFLARVLMHIFQNLGSCRALCAVAHTSDVTGSLTYLHSV VNPVVYCFSSPTFRSSYRRVFHTLRGKGQAAEPPDFNPRDSYS" BASE COUNT 442 a 565 c 501 g 553 t ORIGIN 1 ggatcccagc aagcgtcctt tatgtatgaa aaggaagaag aaaatttccc catgaaacat 61 attcaaagta gagaacaatc tttttgattc cattgttatt ttaattgtat acagacatag 121 gagtctttgc ataattagac ttttccttct ttcaggactg tggatgcaaa gccctggacc 181 cccagacgtt ataggacatt actcctcagc tttgcagccc ggtgatgtga agcgaaacac 241 catttcccct tttttatggc ggaagaaaac agaacacaac tgcaaagggg cttttccctc 301 ccctgctcat cctctttccc caaatgaatt ttggtttgct gtggactcta ttctgctgag 361 gaactgttct tgttgggcaa atgtagatct tgtctactct gtggcaggaa aaggcctttt 421 ctttcatttt gtaagaaaga gcacagagtt cctcctgtac ctgctccagc tgtgcctgca 481 gcccctcacg gccgggtgat gccattccca aactgctcag cccccagcac tgtggtggcc 541 acagctgtgg gtgtcttgct ggggctggag tgtgggctgg gtctgctggg caacgcggtg 601 gcgctgtgga ccttcctgtt ccgggtcagg gtgtggaagc cgtacgctgt ctacctgctc 661 aacctggccc tggctgacct gctgttggct gcgtgcctgc ctttcctggc cgccttctac 721 ctgagcctcc aggcttggca tctgggccgt gtgggctgct gggccctgcg cttcctgctg 781 gacctcagcc gcagcgtggg gatggccttc ctggccgccg tggctttgga ccggtacctc 841 cgtgtggtcc accctcggct taaggtcaac ctgctgtctc ctcaggcggc cctgggggtc 901 tcgggcctcg tctggctcct gatggtcgcc ctcacctgcc cgggcttgct catctctgag 961 gccgcccaga actccaccag gtgccacagt ttctactcca gggcagacgg ctccttcagc 1021 atcatctggc aggaagcact ctcctgcctt cagtttgtcc tcccctttgg cctcatcgtg 1081 ttctgcaatg caggcatcat cagggctctc cagaaaagac tccgggagcc tgagaaacag 1141 cccaagcttc agcgggccca ggcactggtc accttggtgg tggtgctgtt tgctctgtgc 1201 tttctgccct gcttcctggc cagagtcctg atgcacatct tccagaatct ggggagctgc 1261 agggcccttt gtgcagtggc tcatacctcg gatgtcacgg gcagcctcac ctacctgcac 1321 agtgtcgtca accccgtggt atactgcttc tccagcccca ccttcaggag ctcctatcgg 1381 agggtcttcc acaccctccg aggcaaaggg caggcagcag agcccccaga tttcaacccc 1441 agagactcct attcctgaca acagccagcg tcctcaacgc ccgtgtttat ggaactacct 1501 gcgacctaaa taataattac tcctactttg ggattctgga agaagaagaa gtcttaagac 1561 tgcaatacaa ggatcagagc ataaacatgg gcacagttgc tgcaggtgtg gtcttatact 1621 ttgttgacca gggtggtcct ctgtgatttt accttgtaga gtggcaaatc aaaaatgaac 1681 aagctagaac ctcctcctac ccaactatga tgcagattca gttgctgaac tgaaaagtcg 1741 ggcagctact ccatctccac acttgaagaa aatgtaattt gctaaatcag tgaaggaaga 1801 gaagaaagcc gggtgatggc atctttccaa ctcttacttg gtctcagcaa gtcattttca 1861 tttattatgc ttcagtttta aatacaaaaa aaaaactatg ttttcttccc acctgctgtg 1921 cagactgggg atgaccgaca tcagaaagtg ccctggttct aaaaagagac tctgctgtat 1981 ataaggtact gtcgtacatg ctagccttta tttggaacat aacatttttg ttttcataaa 2041 attttgcttc atttttctag a // LOCUS HSU66579 1228 bp DNA PRI 15-MAY-1997 DEFINITION Human putative G protein-coupled receptor (GPR20) gene, complete cds. ACCESSION U66579 NID g1753102 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1228) AUTHORS O'Dowd,B.F., Nguyen,T., Jung,B.P., Marchese,A., Cheng,R., Heng,H.H., Kolakowski,L.F. Jr., Lynch,K.R. and George,S.R. TITLE Cloning and chromosomal mapping of four putative novel human G-protein-coupled receptor genes JOURNAL Gene 187 (1), 75-81 (1997) MEDLINE 97225799 REFERENCE 2 (bases 1 to 1228) AUTHORS O'Dowd,B.F., Nguyen,T., Jung,B., Marchese,A., Cheng,R., Heng,H.H.Q., Kolakowski,L.F. Jr., Lynch,K.R. and George,S.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1228 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q24.3-24.2" gene 64..1140 /gene="GPR20" CDS 64..1140 /gene="GPR20" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g1753103" /translation="MPSVSPAGPSAGAVPNATAVTTVRTNASGLEVPLFHLFARLDEE LHGTFPGLCVALMAVHGAIFLAGLVLNGLALYVFCCRTRAKTPSVIYTINLVVTDLLV GLSLPTRFAVYYGARGCLRCAFPHVLGYFLNMHCSILFLTCICVDRYLAIVRPEAPAA CRQPACARAVCAFVWLAAGAVTLSVLGVTGSRPCCRVFALTVLEFLLPLLVISVFTGR IMCALSRPGLLHQGRQRRVRAMQLLLTVLIIFLVCFTPFHARQVAVALWPDMPHHTSL VVYHVAVTLSSLNSCMDPIVYCFVTSGFQATVRGLFGQHGEREPSSGDVVSMHRSSKG SGRHHILSAGPHALTQALANGPEA" BASE COUNT 162 a 445 c 375 g 246 t ORIGIN 1 aggctggcgc ctggggtgac ctggtctcct tgcttccagg cctggggtgt gctggctgcc 61 gtcatgccct ctgtgtctcc agcggggccc tcggccgggg cagtccccaa tgccaccgca 121 gtgacaacag tgcggaccaa tgccagcggg ctggaggtgc ccctgttcca cctgtttgcc 181 cggctggacg aggagctgca tggcaccttc ccaggcctgt gcgtggcgct gatggcggtg 241 cacggagcca tcttcctggc agggctggtg ctcaacgggc tggcgctgta cgtcttctgc 301 tgccgcaccc gggccaagac accctcagtc atctacacca tcaacctggt ggtgaccgat 361 ctactggtag ggctgtccct gcccacgcgc ttcgctgtgt actacggcgc caggggctgc 421 ctgcgctgtg ccttcccgca cgtcctcggt tacttcctca acatgcactg ctccatcctc 481 ttcctcacct gcatctgcgt ggaccgctac ctggccatcg tgcggcccga agctcccgcc 541 gcctgccgcc agcctgcctg tgccagggcc gtgtgcgcct tcgtgtggct ggccgccggt 601 gccgtcaccc tgtcggtgct gggcgtgaca ggcagccggc cctgctgccg tgtctttgcg 661 ctgactgtcc tggagttcct gctgcccctg ctggtcatca gcgtgtttac cggccgcatc 721 atgtgtgcac tgtcgcggcc gggtctgctc caccagggtc gccagcgccg cgtgcgggcc 781 atgcagctcc tgctcacggt gctcatcatc tttctcgtct gcttcacgcc cttccacgcc 841 cgccaagtgg ccgtggcgct gtggcccgac atgccacacc acacgagcct cgtggtctac 901 cacgtggccg tgaccctcag cagcctcaac agctgcatgg accccatcgt ctactgcttc 961 gtcaccagtg gcttccaggc caccgtccga ggcctcttcg gccagcacgg agagcgtgag 1021 cccagcagcg gtgacgtggt cagcatgcac aggagctcca agggctcagg ccgtcatcac 1081 atcctcagtg ccggccctca cgccctcacc caggccctgg ctaatgggcc cgaggcttag 1141 tcagcagggc tctgccaggg gccgaaggtc aggactcatc tgggcatgcc agcgtggaca 1201 cccaccatgc caggggtggc aatcggtt // LOCUS HSU66580 1160 bp DNA PRI 15-MAY-1997 DEFINITION Human putative G protein-coupled receptor (GPR21) gene, complete cds. ACCESSION U66580 NID g1753104 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1160) AUTHORS O'Dowd,B.F., Nguyen,T., Jung,B.P., Marchese,A., Cheng,R., Heng,H.H., Kolakowski,L.F. Jr., Lynch,K.R. and George,S.R. TITLE Cloning and chromosomal mapping of four putative novel human G-protein-coupled receptor genes JOURNAL Gene 187 (1), 75-81 (1997) MEDLINE 97225799 REFERENCE 2 (bases 1 to 1160) AUTHORS O'Dowd,B.F., Nguyen,T., Jung,B., Marchese,A., Cheng,R., Heng,H.H.Q., Kolakowski,L.F. Jr., Lynch,K.R. and George,S.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1160 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q33" gene 41..1090 /gene="GPR21" CDS 41..1090 /gene="GPR21" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g1753105" /translation="MNSTLDGNQSSHPFCLLAFGYLETVNFCLLEVLIIVFLTVLIIS GNIIVIFVFHCAPLLNHHTTSYFIQTMAYADLFVGVSCVVPSLSLLHHPLPVEESLTC QIFGFVVSVLKSVSMASLACISIDRYIAITKPLTYNTLVTPWRLRLCIFLIWLYSTLV FLPSFFHWGKPGYHGDVFQWCAESWHTDSYFTLFIVMMLYAPAALIVCFTYFNIFRIC QQHTKDISERQARFSSQSGETGEVQACPDKRYAMVLFRITSVFYILWLPYIIYFLLES STGHSNRFASFLTTWLAISNSFCNCVIYSLSNSVFQRGLKRLSGAMCTSCASQTTAND PYTVRSKGPLNGCHI" BASE COUNT 254 a 283 c 248 g 375 t ORIGIN 1 aagcggcagc atgaagtgac agatcactcc tgagctcaag atgaactcca ccttggatgg 61 taatcagagc agccaccctt tttgcctctt ggcatttggc tatttggaaa ctgtcaattt 121 ttgccttttg gaagtattga ttattgtctt tctaactgta ttgattattt ctggcaacat 181 cattgtgatt tttgtatttc actgtgcacc tttgttgaac catcacacta caagttattt 241 tatccagact atggcatatg ctgacctttt tgttggggtg agctgcgtgg tcccttcttt 301 atcactcctc catcaccccc ttccagtaga ggagtccttg acttgccaga tatttggttt 361 tgtagtatca gttctgaaga gcgtctccat ggcttctctg gcctgtatca gcattgatag 421 atacattgcc attactaaac ctttaaccta taatactctg gttacaccct ggagactacg 481 cctgtgtatt ttcctgattt ggctatactc gaccctggtc ttcctgcctt cctttttcca 541 ctggggcaaa cctggatatc atggagatgt gtttcagtgg tgtgcggagt cctggcacac 601 cgactcctac ttcaccctgt tcatcgtgat gatgttatat gccccagcag cccttattgt 661 ctgcttcacc tatttcaaca tcttccgcat ctgccaacag cacacaaagg atatcagcga 721 aaggcaagcc cgcttcagca gccagagtgg ggagactggg gaagtgcagg cctgtcctga 781 taagcgctat gccatggtcc tgtttcgaat cactagtgta ttttacatcc tctggttgcc 841 atatatcatc tacttcttgt tggaaagctc cactggccac agcaaccgct tcgcatcctt 901 cttgaccacc tggcttgcta ttagtaacag tttctgcaac tgtgtaattt atagtctctc 961 caacagtgta ttccaaagag gactaaagcg cctctcaggg gctatgtgta cttcttgtgc 1021 aagtcagact acagccaacg acccttacac agttagaagc aaaggccctc ttaatggatg 1081 tcatatctga agtggctcag ttacggggtt cccgtgtgtg tgtgtgtgtg tgtgtgtgtg 1141 tgtgtgtatt ttatctctaa // LOCUS HSU66581 1881 bp DNA PRI 15-MAY-1997 DEFINITION Human putative G protein-coupled receptor (GPR22) gene, complete cds. ACCESSION U66581 NID g1753106 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1881) AUTHORS O'Dowd,B.F., Nguyen,T., Jung,B.P., Marchese,A., Cheng,R., Heng,H.H., Kolakowski,L.F. Jr., Lynch,K.R. and George,S.R. TITLE Cloning and chromosomal mapping of four putative novel human G-protein-coupled receptor genes JOURNAL Gene 187 (1), 75-81 (1997) MEDLINE 97225799 REFERENCE 2 (bases 1 to 1881) AUTHORS O'Dowd,B.F., Nguyen,T., Jung,B., Marchese,A., Cheng,R., Heng,H.H.Q., Kolakowski,L.F. Jr., Lynch,K.R. and George,S.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1881 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q22-q31.1" gene 237..1538 /gene="GPR22" CDS 237..1538 /gene="GPR22" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g1753107" /translation="MCFSPILEINMQSESNITVRDDIDDINTNMYQPLSYPLSFQVSL TGFLMLEIVLGLGSNLTVLVLYCMKSNLINSVSNIITMNLHVLDVIICVGCIPLTIVI LLLSLESNTALICCFHEACVSFASVSTAINVFAITLDRYDISVKPANRILTMGRAVML MISIWIFSFFSFLIPFIEVNFFSLQSGNTWENKTLLCVSTNEYYTELGMYYHLLVQIP IFFFTVVVMLITYTKILQALNIRIGTRFSTGQKKKARKKKTISLTTQHEATDMSQSSG GRNVVFGVRTSVSVIIALRRAVKRHRERRERQKRVFRMSLLIISTFLLCWTPISVLNT TILCLGPSDLLVKLRLCFLVMAYGTTIFHPLLYAFTRQKFQKVLKSKMKKRVVSIVEA DPLPNNAVIHNSWIDPKRNKKITFEDSEIREKRLVPQVVTD" BASE COUNT 649 a 317 c 288 g 627 t ORIGIN 1 gttatttctt caaaaggaaa acacaatttt cttttatatc aaaacaatgc aaacttgatg 61 gttcttaatt ctacattttc tattaatagt ttacaaactt aaaaattaaa ctaagtacac 121 aattgaaaga tttttttttc ttacaaagaa cacgttatac gtcatttaaa ttgccaaata 181 tcaaatagtt tattctattt cactttctag ggaaaaaaac caactgctcc aaaagaatgt 241 gtttttctcc cattctggaa atcaacatgc agtctgaatc taacattaca gtgcgagatg 301 acattgatga catcaacacc aatatgtacc aaccactatc atatccgtta agctttcaag 361 tgtctctcac cggatttctt atgttagaaa ttgtgttggg acttggcagc aacctcactg 421 tattggtact ttactgcatg aaatccaact taatcaactc tgtcagtaac attattacaa 481 tgaatcttca tgtacttgat gtaataattt gtgtgggatg tattcctcta actatagtta 541 tccttctgct ttcactggag agtaacactg ctctcatttg ctgtttccat gaggcttgtg 601 tatcttttgc aagtgtctca acagcaatca acgtttttgc tatcactttg gacagatatg 661 acatctctgt aaaacctgca aaccgaattc tgacaatggg cagagctgta atgttaatga 721 tatccatttg gattttttct tttttctctt tcctgattcc ttttattgag gtaaattttt 781 tcagtcttca aagtggaaat acctgggaaa acaagacact tttatgtgtc agtacaaatg 841 aatactacac tgaactggga atgtattatc acctgttagt acagatccca atattctttt 901 tcactgttgt agtaatgtta atcacataca ccaaaatact tcaggctctt aatattcgaa 961 taggcacaag attttcaaca gggcagaaga agaaagcaag aaagaaaaag acaatttctc 1021 taaccacaca acatgaggct acagacatgt cacaaagcag tggtgggaga aatgtagtct 1081 ttggtgtaag aacttcagtt tctgtaataa ttgccctccg gcgagctgtg aaacgacacc 1141 gtgaacgacg agaaagacaa aagagagtct tcaggatgtc tttattgatt atttctacat 1201 ttcttctctg ctggacacca atttctgttt taaataccac cattttatgt ttaggcccaa 1261 gtgacctttt agtaaaatta agattgtgtt ttttagtcat ggcttatgga acaactatat 1321 ttcaccctct attatatgca ttcactagac aaaaatttca aaaggtcttg aaaagtaaaa 1381 tgaaaaagcg agttgtttct atagtagaag ctgatcccct gcctaataat gctgtaatac 1441 acaactcttg gatagatccc aaaagaaaca aaaaaattac ctttgaagat agtgaaataa 1501 gagaaaaacg tttagtgcct caggttgtca cagactagag aaaagtctca gtttcaccaa 1561 atccacattc aaatgagttt taaatttaaa ttgtaaaaac tgatattact gccaaatata 1621 agaaaaatat tttaagtatt ggttatgttg taaattttca atgtgaaatg ctaattagat 1681 aggtcatata tattcaattt cttcattact taatgtattt gttgcatggc agtttgttaa 1741 agtactatca tgtgtatatt ttgtcaatat tatgtccaac agaaaatatt catgtaagtc 1801 atatttttta aggaataaat acatagcctt aaaacagtgt ataactttaa aatgtaaaaa 1861 aaaaaaaaaa aaaaaaaaaa a // LOCUS HSU66840 2166 bp DNA PRI 01-JAN-1997 DEFINITION Human MAP kinase 3c gene, complete cds. ACCESSION U66840 NID g1778154 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2166) AUTHORS Han,Jiahuai. TITLE Direct Submission JOURNAL Submitted (13-AUG-1996) Immunology, The Scripps Research Institute, 10666 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2166 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 372..1430 /note="protein kinase" /codon_start=1 /product="MAP kinase 3c" /db_xref="PID:g1778155" /translation="MGVQGTLMSRDSQTPHLLSILGKSKRKKDLRISCMSKPPAPNPT PPRNLDSRTFITIGDRNFEVEADDLVTISELGRGAYGVVEKVRHAQSGTIMAVKRIRA TVNSQEQKRLLMDLDINMRTVDCFYTVTFYGALFREGDVWICMELMDTSLDKFYRKVL DKNMTIPEDILGEIAVSIVRALEHLHSKLSVIHRDVKPSNVLINKEGHVKMCDFGISG YLVDSVAKTMDAGCKPYMAPERINPELNQKGYNVKSDVWSLGITMIEMAILRFPYESW GTPFQQLKQVVEEPSPQLPADRFSPEFVDFTAQCLRKNPAERMSYLELMEHPFFTLHK TKKTDIAAFVKKILGEDS" BASE COUNT 434 a 644 c 630 g 458 t ORIGIN 1 actcataggg ctcgagcggg cgccgggggg tcagctatga agcccctcta cagtggctgc 61 tgcaggtgct ggctggaccc cgctgggccc cttctcaccc cacatcccat tgattgatta 121 gtcaggcagg gcagtgagga cacctcacag atggggaaac tgaagcccag ggaggtactc 181 aagatgtgcc cagtgcgggc tgagataagg ccctggagtc cacctccagg ctgggtctgt 241 cctgctctgc tccctgcagg gctggtgggc ctccttcccc cttttgacta acggcctggc 301 ttggagagag ggggctcccg gcatgggtga tggcagaggc tggcaccctt gtggccaggg 361 cctgatggct catgggagtg caggggacat tgatgtcaag ggatagccag acgcctcacc 421 ttctctccat tctaggaaaa tccaagagga agaaggatct acggatatcc tgcatgtcca 481 agccacccgc acccaacccc acaccccccc ggaacctgga ctcccggacc ttcatcacca 541 ttggagacag aaactttgag gtggaggctg atgacttggt gaccatctca gaactgggcc 601 gtggagccta tggggtggta gagaaggtgc ggcacgccca gagcggcacc atcatggccg 661 tgaagcggat ccgggccacc gtgaactcac aggagcagaa gcggctgctc atggacctgg 721 acatcaacat gcgcacggtc gactgtttct acactgtcac cttctacggg gcactattca 781 gagagggaga cgtgtggatc tgcatggagc tcatggacac atccttggac aagttctacc 841 ggaaggtgct ggataaaaac atgacaattc cagaggacat ccttggggag attgctgtgt 901 ctatcgtgcg ggccctggag catctgcaca gcaagctgtc ggtgatccac agagatgtga 961 agccctccaa tgtccttatc aacaaggagg gccatgtgaa gatgtgtgac tttggcatca 1021 gtggctactt ggtggactct gtggccaaga cgatggatgc cggctgcaag ccctacatgg 1081 cccctgagag gatcaaccca gagctgaacc agaagggcta caatgtcaag tccgacgtct 1141 ggagcctggg catcaccatg attgagatgg ccatcctgcg gttcccttac gagtcctggg 1201 ggaccccgtt ccagcagctg aagcaggtgg tggaggagcc gtccccccag ctcccagccg 1261 accgtttctc ccccgagttt gtggacttca ctgctcagtg cctgaggaag aaccccgcag 1321 agcgtatgag ctacctggag ctgatggagc accccttctt caccttgcac aaaaccaaga 1381 agacggacat tgctgccttc gtgaagaaga tcctgggaga agactcatag gggctgggcc 1441 tcggacccca ctccggccct ccagagcccc acagccccat ctgcgggggc agtgctcacc 1501 cacaccataa gctactgcca tcctggccca gggcatctgg gaggaaccga gggggctgct 1561 cccacctggc tctgtggcga gccatttgtc ccaagtgcca aagaagcaga ccattggggc 1621 tcccagccag gcccttgtcg gccccaccag tgcctctccc tgctgctcct aggacccgtc 1681 tccagctgct gagatcctgg actgaggggg cctggatgcc ccctgtggat gctgctgccc 1741 ctgcacagca ggctgccagt gcctgggtgg atgggccacc gccttgccca gcctggatgc 1801 catccaagtt gtatattttt ttaatctctc gactgaatgg actttgcaca ctttggccca 1861 gggtggccac acctctatcc cggctttggt gcggggtaca caagagggga tgagttgtgt 1921 gaatacccca agactcccat gagggagatg ccatgagccg cccaaggcct tcccctggca 1981 ctggcaaaca gggcctctgc ggagcacact ggctcaccca gtcctgcccg ccaccgttat 2041 cggtgtcatt cacctttcgt gtttttttta atttatcctc tgttgatttt ttcttttgct 2101 ttatgggttt ggcttgtttt tcttgcatgg tttggagctg atcgcttctc ccccaccccc 2161 tagggg // LOCUS HSU71092 1877 bp DNA PRI 19-DEC-1996 DEFINITION Human somatostatin receptor-like protein (SLC1) gene, complete cds. ACCESSION U71092 NID g1737178 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1877) AUTHORS Kolakowski,L.F. Jr., Jung,B.P., Nguyen,T., Johnson,M.P., Lynch,K.R., Cheng,R., Heng,H.H., George,S.R. and O'Dowd,B.F. TITLE Characterization of a human gene related to genes encoding somatostatin receptors JOURNAL FEBS Lett. 398 (2-3), 253-258 (1996) MEDLINE 97131607 REFERENCE 2 (bases 1 to 1877) AUTHORS Kolakowski,L.F.J.r., Jung,B.P., Nguyen,T., Johnson,M.P., Lynch,K.R., Cheng,R., Heng,H.H.Q., George,S.R. and O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (17-SEP-1996) Department of Pharmacology, University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Drive, San Antonio, TX 78284-7764, USA FEATURES Location/Qualifiers source 1..1877 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q13.3" repeat_region 237..260 /rpt_family="dinucleotide repeat" /rpt_type=tandem /rpt_unit=CA gene 382..1590 /gene="SLC1" CDS 382..1590 /gene="SLC1" /note="G-protein-coupled receptor" /codon_start=1 /product="somatostatin receptor-like protein" /db_xref="PID:g1737179" /translation="MLCPSKTDGSGHSGRIHQETHGEGKRDKISNSEGRENGGRGFQM NGGSLEAEHASRMSVLRAKPMSNSQRLLLLSPGSPPRTGSISYINIIMPSVFGTICLL GIIGNSTVIFAVVKKSKLHWCNNVPDIFIINLSVVDLLFLLGMPFMIHQLMGNGVWHF GETMCTLITAMDANSQFTSTYILTAMAIDRYLATVHPISSTKFRKPSVATLVICLLWA LSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLAFALPFVVITAA YVRILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRP TLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLSVKPAAQGQLRAVSNAQTA DEERTESKGT" BASE COUNT 435 a 563 c 482 g 397 t ORIGIN 1 tccaacagac agtttctgtc tctgcttcac tcaagaagcc caggctcaga agataccaat 61 caaggaaatc cccgctagga agcctggggt agggagagct gctggcttga ccagggcaca 121 gccggcaaaa gcctctacaa gacagtcacc cacagatatg cccaagaatc agtacacagt 181 ttccaaccag agatctccaa aatgaaacac tcagggctac acataggaaa agcacgcaca 241 cacacacaca cacacacaca gacacttact tttgtgtcct tctggctatg ctgacgagtt 301 ttcctggtga agcccggggc tcacagagta atctctgcag acaactgtgg ttcttgcctc 361 tggtgcctgc aggaggcagg catgttgtgt ccttccaaga cagatggctc agggcactct 421 ggtaggattc accaggaaac tcatggagaa gggaaaaggg acaagattag caacagtgaa 481 gggagggaga atggtgggag aggattccag atgaacggtg ggtcgctgga ggctgagcat 541 gccagcagga tgtcagttct cagagcaaag cccatgtcaa acagccaacg cttgctcctt 601 ctgtccccag gatcacctcc tcgcacgggg agcatctcct acatcaacat catcatgcct 661 tcggtgttcg gcaccatctg cctcctgggc atcatcggga actccacggt catcttcgcg 721 gtcgtgaaga agtccaagct gcactggtgc aacaacgtcc ccgacatctt catcatcaac 781 ctctcggtag tagatctcct ctttctcctg ggcatgccct tcatgatcca ccagctcatg 841 ggcaatgggg tgtggcactt tggggagacc atgtgcaccc tcatcacggc catggatgcc 901 aatagtcagt tcaccagcac ctacatcctg accgccatgg ccattgaccg ctacctggcc 961 actgtccacc ccatctcttc cacgaagttc cggaagccct ctgtggccac cctggtgatc 1021 tgcctcctgt gggccctctc cttcatcagc atcacccctg tgtggctgta tgccagactc 1081 atccccttcc caggaggtgc agtgggctgc ggcatacgcc tgcccaaccc agacactgac 1141 ctctactggt tcaccctgta ccagtttttc ctggcctttg ccctgccttt tgtggtcatc 1201 acagccgcat acgtgaggat cctgcagcgc atgacgtcct cagtggcccc cgcctcccag 1261 cgcagcatcc ggctgcggac aaagagggtg acccgcacag ccatcgccat ctgtctggtc 1321 ttctttgtgt gctgggcacc ctactatgtg ctacagctga cccagttgtc catcagccgc 1381 ccgaccctca cctttgtcta cttatacaat gcggccatca gcttgggcta tgccaacagc 1441 tgcctcaacc cctttgtgta catcgtgctc tgtgagacgt tccgcaaacg cttggtcctg 1501 tcggtgaagc ctgcagccca ggggcagctt cgcgctgtca gcaacgctca gacggctgac 1561 gaggagagga cagaaagcaa aggcacctga tacttcccct gccaccctgc acacctccaa 1621 gtcagggcac cacaacacgc caccgggaga gatgctgaga aaaacccaag accgctcggg 1681 aaatgcagga aggccgggtt gtgaggggtt gttgcaatga aataaataca ttccatgggc 1741 tcacacgttg ctggggaggc ctggagtcag gtttggggtt ttcagatatc agaaatccct 1801 tgggggagca ggatgagacc tttggataga acagaagctg agcaagagaa catgttggtt 1861 tggataaccg gttgcac // LOCUS HSU72398 723 bp DNA PRI 23-OCT-1996 DEFINITION Human Bcl-x beta (bcl-x) gene, complete cds. ACCESSION U72398 NID g1622940 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 723) AUTHORS Inohara,N. and Ohta,S. TITLE The nucleotide sequence of the human Bcl-x beta JOURNAL Unpublished REFERENCE 2 (bases 1 to 723) AUTHORS Inohara,N. and Ohta,S. TITLE Direct Submission JOURNAL Submitted (24-SEP-1996) Division of Biochemistry, Institute of Gerontology, 1-396 Kosugi-cho, Nakahara-ku, Kawasaki City 211, Japan FEATURES Location/Qualifiers source 1..723 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q11.2" gene 1..684 /gene="bcl-x" CDS 1..684 /gene="bcl-x" /codon_start=1 /product="Bcl-x beta" /db_xref="PID:g1622941" /translation="MSQSNRELVVDFLSYKLSQKGYSWSQFSDVEENRTEAPEGTESE METPSAINGNPSWHLADSPAVNGATGHSSSLDAREVIPMAAVKQALREAGDEFELRYR RAFSDLTSQLHITPGTAYQSFEQVVNELFRDGVNWGRIVAFFSFGGALCVESVDKEMQ VLVSRIAAWMATYLNDHLEPWIQENGGWVRTKPLVCPFSLASGQRSPTALLLYLFLLC WVIVGDVDS" BASE COUNT 166 a 180 c 217 g 160 t ORIGIN 1 atgtctcaga gcaaccggga gctggtggtt gactttctct cctacaagct ttcccagaaa 61 ggatacagct ggagtcagtt tagtgatgtg gaagagaaca ggactgaggc cccagaaggg 121 actgaatcgg agatggagac ccccagtgcc atcaatggca acccatcctg gcacctggca 181 gacagccccg cggtgaatgg agccactggc cacagcagca gtttggatgc ccgggaggtg 241 atccccatgg cagcagtaaa gcaagcgctg agggaggcag gcgacgagtt tgaactgcgg 301 taccggcggg cattcagtga cctgacatcc cagctccaca tcaccccagg gacagcatat 361 cagagctttg aacaggtagt gaatgaactc ttccgggatg gggtaaactg gggtcgcatt 421 gtggcctttt tctccttcgg cggggcactg tgcgtggaaa gcgtagacaa ggagatgcag 481 gtattggtga gtcggatcgc agcttggatg gccacttacc tgaatgacca cctagagcct 541 tggatccagg agaacggcgg ctgggtaaga accaagcccc ttgtgtgtcc cttttctttg 601 gcctctggtc agagatcccc aacagccctt cttctgtatc tctttctgtt gtgttgggtg 661 attgttggag acgttgatag ttgaggaaac ctgactggcc tcatttcacc acaagaggtt 721 aac // LOCUS HSU73192 2896 bp DNA PRI 08-JAN-1997 DEFINITION Human inward rectifier potassium channel Kir1.2 (Kir1.2) gene, complete cds. ACCESSION U73192 NID g1765986 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2896) AUTHORS Shuck,M.E., Piser,T.M., Bock,J.H., Slightom,J.L., Lee,K.S. and Bienkowski,M.J. TITLE Cloning and characterization of two K+ inward rectifier (Kir) 1.1 potassium channel homologs from human kidney (Kir1.2 and Kir1.3) JOURNAL J. Biol. Chem. 272 (1), 586-593 (1997) MEDLINE 97150765 REFERENCE 2 (bases 1 to 2896) AUTHORS Shuck,M.E., Piser,T.M., Bock,J.H., Slightom,J.L., Lee,K.S. and Bienkowski,M.J. TITLE Direct Submission JOURNAL Submitted (27-SEP-1996) Molecular Biology, Pharmacia & Upjohn, 301 Henriette Street, Kalamazoo, MI 49007, USA FEATURES Location/Qualifiers source 1..2896 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /tissue_type="kidney" gene 439..1578 /gene="Kir1.2" CDS 439..1578 /gene="Kir1.2" /codon_start=1 /product="inward rectifier potassium channel Kir1.2" /db_xref="PID:g1765987" /translation="MTSVAKVYYSQTTQTESRPLMGPGIRRRRVLTKDGRSNVRMEHI ADKRFLYLKDLWTTFIDMQWRYKLLLFSATFAGTWFLFGVVWYLVAVAHGDLLELDPP ANHTPCVVQVHTLTGAFLFSLESQTTIGYGFRYISEECPLAIVLLIAQLVLTTILEIF ITGTFLAKIARPKKRAETIRFSQHAVVASHNGKPCLMIRVANMRKSLLIGCQVTGKLL QTHQTKEGENIRLNQVNVTFQVDTASDSPFLILPLTFYHVVDETSPLKDLPLRSGEGD FELVLILSGTVESTSATCQVRTSYLPEEILWGYEFTPAISLSASGKYIADFSLFDQVV KVASPSGLRDSTVRYGDPEKLKLEESLREQAEKEGSALSVRISNV" BASE COUNT 675 a 815 c 658 g 748 t ORIGIN 1 ctgatctgat cttatcttct ctcttctttt ctttgagtgt gaattttcct gtttccccca 61 ggctggagtg cagtggcgcg atgtcggctc actgcaacct ctgtctcccg ggttcaagcg 121 attctcctgc ctcagcctcc tgagtagctg ggactacagg cgcatgccac catgcccagc 181 taatttttgt atttttagta gagacagggt tttgccttgt tggccaggct ggtcttgaac 241 tcctgacctc aggcgatcca cccgcctcgg cccctgcaca gtgcctggca catagcaagt 301 gctcaataaa tatttggtaa gacaagaaca cataagcgac attcaaatga atgtcaattc 361 ctccctccca tggggtgagg gttaggagtc agctggattt ctacgataac ctccattatg 421 ctgtcttgct ccctccagat gacgtcagtt gccaaggtgt attacagtca gaccactcag 481 acagaaagcc ggcccctaat gggcccaggg atacgacggc ggagagtcct gacaaaagat 541 ggtcgcagca acgtgagaat ggagcacatt gccgacaagc gcttcctcta cctcaaggac 601 ctgtggacaa ccttcattga catgcagtgg cgctacaagc ttctgctctt ctctgcgacc 661 tttgcaggca catggttcct ctttggcgtg gtgtggtatc tggtagctgt ggcacatggg 721 gacctgctgg agctggaccc cccggccaac cacaccccct gtgtggtaca ggtgcacaca 781 ctcactggag ccttcctctt ctcccttgaa tcccaaacca ccattggcta tggcttccgc 841 tacatcagtg aggaatgtcc actggccatt gtgcttctta ttgcccagct ggtgctcacc 901 accatcctgg aaatcttcat cacaggtacc ttcctggcga agattgcccg gcccaagaag 961 cgggctgaga ccattcgttt cagccagcat gcagttgtgg cctcccacaa tggcaagccc 1021 tgcctcatga tccgagttgc caatatgcgc aaaagcctcc tcattggctg ccaggtgaca 1081 ggaaaactgc ttcagaccca ccaaaccaag gaaggggaga acatccggct caaccaggtc 1141 aatgtgactt tccaagtaga cacagcctct gacagcccct tccttattct accccttacc 1201 ttctatcatg tggtagatga gaccagtccc ttgaaagatc tccctcttcg cagtggtgag 1261 ggtgactttg agctggtgct gatcctaagt gggacagtgg agtccaccag tgccacctgt 1321 caggtgcgca cttcctacct gccagaggag atcctttggg gctacgagtt cacacctgcc 1381 atctcactgt cagccagtgg taaatacata gctgacttta gcctttttga ccaagttgtg 1441 aaagtggcct ctcctagtgg cctccgtgac agcactgtac gctacggaga ccctgaaaag 1501 ctcaagttgg aggagtcatt aagggagcaa gctgagaagg agggcagtgc ccttagtgtg 1561 cgcatcagca atgtctgatg acctgttccc actcccccat tcctctggtc tcttttcctc 1621 tcttccaatg ccctggtaag gaatactacc cgggtttact ggagatcccc cgaagcaccc 1681 atcctccact ccctcttctt taacccagtg gcctgttggt agcttaggcc aactggagtc 1741 caggttcgcc tcccactgtc ccctttccac ttccccagct tctgccccaa tacacatacc 1801 tcccttaagc caggatgggg gaaagagtgg gattaggctg aagtggctta gaaggcctca 1861 gccatgcttg gatactcaca ttaggaggac catgtggttg gaaggataga ctgcccccta 1921 cctcccacca ccaccatgaa gtttggtgac ttgaggctgg agctccctct gttacctttc 1981 catctagcaa gttcccaaag gcaagactct ctctgatggt cactttgtgg tctgtgcttt 2041 cagaaataca ggaatctgat atcaacatat cctagggttt ctaccaatct ctgttgaaag 2101 aagccagggt ttgccactgt gaagcttgat ttctgctggt gacttctgac cataagctag 2161 aaccatggtc gccactgttt tccctctgta gtttctcaag tgaacactct caggataccc 2221 agttccctca tagcctctgt tctcagagaa ttggagttgg cccaagaaac ataaacatat 2281 aaccacccat atctatcctg gattctgaac tcttcaattt ggagtgacta acacaagttg 2341 ttatctaaac ctttaaacct atcttccagg cagcccagag aagatctgtt tccctgtgtc 2401 ctgtgaatgg aaggacccaa gccaatatgt tcctttgaaa agagtccagt acccaggccc 2461 catggaaagg tctgaaaata atattccaga ttacactgta cctggcttct cttcttcctt 2521 tcctgctcag cctagatcct tcttccttaa ccccaactct ttgggagaag ggagggaaaa 2581 tgcaagggcc ttcctctctt aacacggatg ctcaagtaaa actagattca cagggcacag 2641 attccccaga aagttaacac aatcccacca tgagggatgg gtaaattctc agatttccaa 2701 actgctgtac agagcctctg agaattggtg atgctttgtt aaggtttggg caggagcaga 2761 actctgtggc tggcagccac tattctcagt tacacctccc agtgcccttc tgaaaagtgc 2821 cagctatttc attaggcaat gctggaagga aatgaaatta taccttctga tcaaataacc 2881 atggcttccc tcagcc // LOCUS HSU73304 5665 bp DNA PRI 05-NOV-1996 DEFINITION Human CB1 cannabinoid receptor (CNR1) gene, complete cds. ACCESSION U73304 NID g1657840 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5665) AUTHORS Hoehe,M.R., Caenazzo,L., Martinez,M.M., Hsieh,W.T., Modi,W.S., Gershon,E.S. and Bonner,T.I. TITLE Genetic and physical mapping of the human cannabinoid receptor gene to chromosome 6q14-q15 JOURNAL New Biol. 3 (9), 880-885 (1991) MEDLINE 92031291 REFERENCE 2 (bases 1 to 5665) AUTHORS Bonner,T.I. TITLE The coding exon of the human CB1 cannabinoid receptor JOURNAL Unpublished REFERENCE 3 (bases 1 to 5665) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (03-OCT-1996) Lab of Cell Biology, NIMH, Bldg. 36, Rm 3A-17, MSC 4090, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..5665 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q14-15" intron <1..58 /note="5 exons located 19-23 kb upstream" mRNA <59..5530 /gene="CNR1" gene 59..5530 /gene="CNR1" CDS 122..1540 /gene="CNR1" /note="G protein-coupled receptor" /codon_start=1 /product="CB1 cannabinoid receptor" /db_xref="PID:g1657841" /translation="MKSILDGLADTTFRTITTDLLYVGSNDIQYEDIKGDMASKLGYF PQKFPLTSFRGSPFQEKMTAGDNPQLVPADQVNITEFYNKSLSSFKENEENIQCGENF MDIECFMVLNPSQQLAIAVLSLTLGTFTVLENLLVLCVILHSRSLRCRPSYHFIGSLA VADLLGSVIFVYSFIDFHVFHRKDSRNVFLFKLGGVTASFTASVGSLFLTAIDRYISI HRPLAYKRIVTRPKAVVAFCLMWTIAIVIAVLPLLGWNCEKLQSVCSDIFPHIDETYL MFWIGVTSVLLLFIVYAYMYILWKAHSHAVRMIQRGTQKSIIIHTSEDGKVQVTRPDQ ARMDIRLAKTLVLILVVLIICWGPLLAIMVYDVFGKMNKLIKTVFAFCSMLCLLNSTV NPIIYALRSKDLRHAFRSMFPSCEGTAQPLDNSMGDSDCLHKHANNAASVHRAAESCI KSTVKIAKVTMSVSTDTSAEAL" polyA_signal 5499..5504 /gene="CNR1" polyA_site 5530 /gene="CNR1" /note="location established by comparison with ESTs with GenBank Accession Numbers R20626, R42346, H06205, H10202" BASE COUNT 1603 a 1185 c 1118 g 1759 t ORIGIN 1 tttgttttta ttcttcctgt ttctcaccat tcggcttatt tgttttccct cctcttagga 61 ttgccccctg tgggtcactt tctcagtcat tttgagctca gcctaatcaa agactgaggt 121 tatgaagtcg atcctagatg gccttgcaga taccaccttc cgcaccatca ccactgacct 181 cctgtacgtg ggctcaaatg acattcagta cgaagacatc aaaggtgaca tggcatccaa 241 attagggtac ttcccacaga aattcccttt aacttccttt aggggaagtc ccttccaaga 301 gaagatgact gcgggagaca acccccagct agtcccagca gaccaggtga acattacaga 361 attttacaac aagtctctct cgtccttcaa ggagaatgag gagaacatcc agtgtgggga 421 gaacttcatg gacatagagt gtttcatggt cctgaacccc agccagcagc tggccattgc 481 agtcctgtcc ctcacgctgg gcaccttcac ggtcctggag aacctcctgg tgctgtgcgt 541 catcctccac tcccgcagcc tccgctgcag gccttcctac cacttcatcg gcagcctggc 601 ggtggcagac ctcctgggga gtgtcatttt tgtctacagc ttcattgact tccacgtgtt 661 ccaccgcaaa gatagccgca acgtgtttct gttcaaactg ggtggggtca cggcctcctt 721 cactgcctcc gtgggcagcc tgttcctcac agccatcgac aggtacatat ccattcacag 781 gcccctggcc tataagagga ttgtcaccag gcccaaggcc gtggtagcgt tttgcctgat 841 gtggaccata gccattgtga tcgccgtgct gcctctcctg ggctggaact gcgagaaact 901 gcaatctgtt tgctcagaca ttttcccaca cattgatgaa acctacctga tgttctggat 961 cggggtcacc agcgtactgc ttctgttcat cgtgtatgcg tacatgtata ttctctggaa 1021 ggctcacagc cacgccgtcc gcatgattca gcgtggcacc cagaagagca tcatcatcca 1081 cacgtctgag gatgggaagg tacaggtgac ccggccagac caagcccgca tggacattag 1141 gttagccaag accctggtcc tgatcctggt ggtgttgatc atctgctggg gccctctgct 1201 tgcaatcatg gtgtatgatg tctttgggaa gatgaacaag ctcattaaga cggtgtttgc 1261 attctgcagt atgctctgcc tgctgaactc caccgtgaac cccatcatct atgctctgag 1321 gagtaaggac ctgcgacacg ctttccggag catgtttccc tcttgtgaag gcactgcgca 1381 gcctctggat aacagcatgg gggactcgga ctgcctgcac aaacacgcaa acaatgcagc 1441 cagtgttcac agggccgcag aaagctgcat caagagcacg gtcaagattg ccaaggtaac 1501 catgtctgtg tccacagaca cgtctgccga ggctctgtga gcctgatgcc tccctggcag 1561 cacaggaaaa gaattttttt ttttaagctc aaaatctaga agagtctatt gtctccttgg 1621 ttatattttt ttaactttac catgctcaat gaaaaggtga ttgtcaccat gatcacttat 1681 cagtttgcta atgtttccat agtttaggta ctcaaactcc attctccagg ggtttacagt 1741 gaagaaagcc tgttgtttaa gtgactgaac gatccttcaa agtctcaatg aaataggagg 1801 gaaacctttg gctacacaat tggaagtcta agaacccatg gaaaaatgcc atcaaatgaa 1861 taatgccttt gtaaccacaa ctttcactat aatgtgaaat gtaactgtcc gtagtatcag 1921 agatgtccat ttttacaagt tatagtacta gagatatttt gtaaaatgta ttatgtcctg 1981 tgagatgtgt atcagtgttt atgtgctatt aatatttgtt tagttcagca aaactgaaag 2041 gtagactttt atgagaacaa tggacaagca gtggatacgt gtcaatgtgt gcactttttt 2101 tctatattat tgcccatgat ataactttag aaataaacct taatatttct tcaaatatct 2161 ctatttaatt ttgacactga aataaccgta aaggtttatt tttctgttac ctcaacaaga 2221 agaatttgaa gacttcaaaa tattgagcag aattcattca tacttaaaaa tttattagcc 2281 ctgcattttc ataggaagac acattatctt ctggactata gctgttctaa tggattataa 2341 tcagaatgga agagagaaag catattgact ttttttgagc gacatctctg actttcttta 2401 gtctttagct attactggat ctcttaagac agcatgtgtt aatcttaatg tatatcgtta 2461 tcactgtgca gttgctgttt acttgaatag tattgtgttc ctatattcca ggtttaagta 2521 gatttcatgc ctgggtggcc aaacaacagt cttcattttt tttaattgaa aagaagtagt 2581 gtctggatca gtaaaattat actgtgtgtg agtgtgaata taaatgtgtg tatgtgtgtt 2641 tctgtccgta actgttacag taatgtcata aagtgagaaa actgtgacca agtataaact 2701 tttaccactt gctgcactct tgcacatgga ttcagtttct aaaattgagt tcttcctgta 2761 atcttgttga taaaaatact gactccaacc attcaaaaat ttcaccccat ccctccttaa 2821 gagattggat caagtattac taaattgacc tttaggtatt acacaagacc agtgcttagc 2881 aaaaaataat gacaggcatc caaggaaggg atgtatttgt agtgttattg ccaggaaagg 2941 agagtacttt ggtttctgag caccgaatat tgagcaatat gtcagtcact aaaaggaaga 3001 cagttctaca gaaaaacaaa tggtaacatt tttcaatagc gtgtgtagat agtatgcact 3061 atatacatca cgttaaagta ggactatcac acccagccca tgtggctaaa aaagctgaat 3121 cagacagtgg atgagacaca caacggcagt gaagaaccga tacacttggc attgacgtct 3181 agctatgctg tatctgtgct ttgcccacat gcccttggtg acagctgagc acccagctct 3241 gtcttggtag gtttgggcta aggaacaaat ctctcctttg ctcgtggtta gcaagataca 3301 ctcaagcatg aagataaaca cagctgcttt cttcttacac cccggtctca tgctccttaa 3361 tggcgccatg ggtgcttgtt gggccttttt ccagtaagga atgatattgc tgaagaatct 3421 acttaaccct gacaaatttt aattataatc tcttcttata cagataaaac atgactccta 3481 caaggcccca aggtttacat agtctgaagt gaagtacaga gctggcatct atctggtgat 3541 ttctagctct cgagataccc aagcagcctg atggggcagt tccccttctt acggttcacg 3601 ctctaaggca ggatgtggct tatgagatac tttgcattgt ctgtctgcac accttgaatc 3661 tgcctgctgg ctcccttact ttacctctct gtcatgtgca gatgaaggct cagggtgcta 3721 gaggattagt aagatctctt tctaaagaca ggagagatta tttacaagaa gaactcacca 3781 gggtttagtt tgcatttaag aattgccagt cttttgtcct gcatcatctt gaacattaat 3841 ccacatgttt cagagctcac caggcagtac caatgctctt ttcacagcta tgaagagcta 3901 gagaaattct tgttatggta gaaaaatttc acggttcatt tttgaaactg catttgtgcg 3961 tatgcagtgt agattttata gtgtgttgtg ctttcaagat ctaaatcata tataataaat 4021 taagggacaa tggggctgac agcactaaac ttggtgctta ttgatattct aagaaatatc 4081 tgtgaaatat catcacgtat gttatacaac cttcatttaa aaaggtttaa aactagttag 4141 attcactttg acacttttca tatcatttct taacccaagt gacgaaaaca ttgtccccaa 4201 tgaatatact cattagaatt accatttgtt aatatcactc attaattaac cccataatta 4261 gatccattaa tttaaatgat ttaaatttaa gtaagtttta taaggtctga catcagaggt 4321 atcttacttt cctctgagga tgatgtactt gccctgacca tgcattttac catcacacat 4381 gttcagaaag ggccaaattc ccaacctgct catttttttt tttatcagag tcatgatgaa 4441 tcagtcctag aatgtttcat ttgcacaagt agggctgcct ccaagaggaa cctctgattt 4501 attttgtatg aaatatatgt gaaaggatat gaatctgaga gatgctgtag acatctgtcc 4561 tacacttgag atgatttcca agcctctctg gcactttgag ttaagtctat ctggtattaa 4621 atgccaagga ccttttgctg cctaaatcca ctctgcagga aataggccca accaccagat 4681 gagaattagg ccctggatga gtagcgctat agttactgtc ctgttgatta atttctgcca 4741 tttcatgtcc ataaaagaga ccacccatat catgcacaca attagatttc tcacactcta 4801 actgtatatt tgtatgatat tttaaaatct cctaaatgct gggcaatggc tattaacaat 4861 taattgtctt gcactggcct tctgatgaaa tgttaacaat gcctattgta atatagaaaa 4921 aaacattcta tctactgatt tgggctgaat gtatgtaaat aggtttctaa aaagtcagat 4981 gtttgagcag tggcctacaa atcagtaatt ttcgggtggg agagtttctt tacattgccg 5041 tggcatctta aaagctatct tcatgtaaat tgactgtact aggcctactg gggatcagag 5101 ttcccaagaa aggaaacctt ttcttgtatc tggattcaaa tttatttcca atgtttcaag 5161 cgggaaacat gactctttat tgtctgtaaa tctaacatta ttacttttcc tcttagaaga 5221 atattgtatt gttagatgtt tgttgagctg gtaacatcgt tgcaaccact gcaatatctt 5281 cgttagtaat ctgtataata ctttgtatac aagtactggt aagattgtta ttaaatgtag 5341 cttcagtcat taaattacta tagcaaagta gtacttcttc tgtaatattt acaatgtatt 5401 aagcccacag tatattttat ttcaatgtaa ttaaactgtt aacttattca aagagaaaac 5461 atctcatcat gtctattgtc caaagttacc tggaatcaaa taaaaattct agattaccat 5521 gaagaacata aaatgccttt gaactctgcc ttatttcaca gtctgatggc aaaatactaa 5581 ggatttaatt tctaaaagat tgctgaacta atttattcct caaaaagcac taatgactac 5641 ttgaaaagtg gggacatatt ggatt // LOCUS HSU77589 768 bp DNA PRI 02-APR-1997 DEFINITION Human MHC class II HLA-DQ-alpha chain (HLA-DQA1*0104 allele) mRNA, complete cds. ACCESSION U77589 NID g1916744 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 768) AUTHORS Lardy,N.M., Horst,A.R., Otting,N., Bontrop,R.E. and Waal de,L.P. TITLE Full length cDNA nucleotide sequence of a serological silent HLA-DQA1*0104 allele JOURNAL Unpublished REFERENCE 2 (bases 1 to 768) AUTHORS Lardy,N.M., Horst,A.R., Otting,N., Bontrop,R.E. and Waal de,L.P. TITLE Direct Submission JOURNAL Submitted (06-NOV-1996) Central Lab of the Dutch Red Cross Blood Transfusion Service, Plesmanlaan 125, Amsterdam 1066CX, the Netherlands FEATURES Location/Qualifiers source 1..768 /organism="Homo sapiens" /isolate="I-2" /db_xref="taxon:9606" /chromosome="6" /cell_line="EBV" /tissue_type="PBL" /haplotype="HLA-A2, B7, DR14, DQ'blanc'" gene 1..768 /gene="HLA-DQ-alpha" CDS 1..768 /gene="HLA-DQ-alpha" /function="binds to T cell receptor" /note="HLA-DQA1*0104 allele" /codon_start=1 /product="MHC class II HLA-DQ-alpha chain" /db_xref="PID:g1916745" /translation="MILNKALLLGALALTTMMSPCGGEGIVADHVASCGVNLYQFYGP SGQYTHEFDGDEEFYVDLERKETAWRWPEFSKFGGFDPQGALRNMAVAKHNLNIMIKC YNSTAATNEVPEVTVFSKSPVTLGQPNTLICLVDNIFPPVVNITWLSNGQSVTEGVSE TSFLSKSDHSFFKISYLTFLPSADEIYDCKVEHWGLDQPLLKHWEPEIPAPMSELTET VVCTLGLSVGLVGIVVGTVFIIQGLRSVGASRHQGPL" BASE COUNT 166 a 200 c 203 g 199 t ORIGIN 1 atgatcctaa acaaagctct gctgctgggg gccctcgctc tgaccaccat gatgagccct 61 tgtggaggtg aaggcattgt ggctgaccac gttgcctctt gtggtgtaaa cttgtaccag 121 ttttacggtc cctctggcca gtacacccat gaatttgatg gagatgagga gttctacgtg 181 gacctggaga ggaaggagac tgcctggcgg tggcctgagt tcagcaaatt tggaggtttt 241 gacccgcagg gtgcactgag aaacatggct gtggcaaaac acaacttgaa catcatgatt 301 aaatgctaca actctaccgc tgctaccaat gaggttcctg aggtcacagt gttttccaag 361 tctcccgtga cactgggtca gcccaacacc ctcatttgtc ttgtggacaa catctttcct 421 cctgtggtca acatcacatg gctgagcaat gggcagtcag tcacagaagg tgtttctgag 481 accagcttcc tctccaagag tgatcattcc ttcttcaaga tcagttacct caccttcctc 541 ccttctgctg atgagattta tgactgcaag gtggagcact ggggcctgga ccagcctctt 601 ctgaaacact gggagcctga gattccagcc cctatgtcag agctcacaga gactgtggtc 661 tgcaccctgg ggttgtctgt gggcctcgtg ggcattgtgg tgggcactgt cttcatcatc 721 caaggcctgc gttcagttgg tgcttccaga caccaagggc cattgtga // LOCUS HSU77629 1140 bp DNA PRI 26-NOV-1997 DEFINITION Homo sapiens Achaete-Scute homologue 2 (ASCL2) gene, complete cds. ACCESSION U77629 NID g2642464 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1140) AUTHORS Alders,M., Hodges,M., Hadjantonakis,A.-K., Postmus,J., van Wijk,I., Bliek,J., de Meulemeester,M., Westerveld,A., Guillemot,F., Oudejans,C., Little,P. and Mannens,M. TITLE The human Achaete-Scute homologue 2 (ASCL2,HASH2) maps to chromosome 11p15.5, close to IGF2 and is expressed in extravillus trophoblasts JOURNAL Hum. Mol. Genet. 6 (6), 859-867 (1997) MEDLINE 97318794 REFERENCE 2 (bases 1 to 1140) AUTHORS Alders,M. TITLE Direct Submission JOURNAL Submitted (06-NOV-1996) Human Genetics, University of Amsterdam, Meibergdreef 15, Amsterdam 1105AZ, The Netherlands FEATURES Location/Qualifiers source 1..1140 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" mRNA 209..1140 /gene="ASCL2" gene 209..1140 /note="HASH2" /gene="ASCL2" CDS 545..1126 /gene="ASCL2" /codon_start=1 /product="Achaete-Scute homologue 2" /db_xref="PID:g2642465" /translation="MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAE TGGGAAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRAL QRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGS PRSAYSSDDSGCEGALSPAERELLDFSSWLGGY" BASE COUNT 161 a 394 c 431 g 154 t ORIGIN 1 gtaccttgct ttgggggcgc actaagtacc tgccgggagc agggggcgca ccgggaactc 61 gcagatttcg ccagttgggc gcactgggga tctgtggact gcgtccgggg gatgggctag 121 ggggacatgc gcacgctttg ggccttacag aatgtgatcg cgcgaggggg agggcgaagc 181 gtggcgggag ggcgaggcga aggaaggagg gcgtgagaaa ggcgacggcg gcggcgcgga 241 ggagggttat ctatacattt aaaaaccagc cgcctgcgcc gcgcctgcgg agacctggga 301 gagtccggcc gcacgcgcgg gacacgagcg tcccacgctc cctggcgcgt acggcctgcc 361 accactaggc ctcctatccc cgggctccag acgacctagg acgcgtgccc tggggagttg 421 cctggcggcg ccgtgccaga agcccccttg gggcgccaca gttttccccg tcgcctccgg 481 ttcctctgcc tgcaccttcc tgcggcgcgc cgggacctgg agcgggcggg tggatgcagg 541 cgcgatggac ggcggcacac tgcccaggtc cgcgccccct gcgccccccg tccctgtcgg 601 ctgcgctgcc cggcggagac ccgcgtcccc ggaactgttg cgctgcagcc ggcggcggcg 661 accggccacc gcagagaccg gaggcggcgc agcggccgta gcgcggcgca atgagcgcga 721 gcgcaaccgc gtgaagctgg tgaacttggg cttccaggcg ctgcggcagc acgtgccgca 781 cggcggcgcc agcaagaagc tgagcaaggt ggagacgctg cgctcagccg tggagtacat 841 ccgcgcgctg cagcgcctgc tggccgagca cgacgccgtg cgcaacgcgc tggcgggagg 901 gctgaggccg caggccgtgc ggccgtctgc gccccgcggg ccgccaggga ccaccccggt 961 cgccgcctcg ccctcccgcg cttcttcgtc cccgggccgc gggggcagct cggagcccgg 1021 ctccccgcgt tccgcctact cgtcggacga cagcggctgc gaaggcgcgc tgagtcctgc 1081 ggagcgcgag ctactcgact tctccagctg gttagggggc tactgagcgc cctcgaccta // LOCUS HSU83908 1740 bp DNA PRI 07-FEB-1997 DEFINITION Human nuclear antigen H731 mRNA, complete cds. ACCESSION U83908 NID g1825561 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1740) AUTHORS Matsuhashi,S., Yoshinaga,H., Yatsuki,H., Tsugita,A. and Hori,K. TITLE Isolation of a novel gene from a human cell line with Pr-28 MAb which recognizes a nuclear antigen involved in the cell cycle JOURNAL Res. Commun. Biochem. Cell Mol. Biol. 1, 109-120 (1997) REFERENCE 2 (bases 1 to 1740) AUTHORS Yoshinaga,H., Matsuhashi,S., Kondo,T. and Hori,K. TITLE Expression of the human H731 gene product in Escherichia coli inhibits DNA synthesis JOURNAL Unpublished REFERENCE 3 (bases 1 to 1740) AUTHORS Matsuhashi,S. TITLE Direct Submission JOURNAL Submitted (06-JAN-1997) Biochemistry, Saga Medical School, Nabeshima, Saga 849, Japan FEATURES Location/Qualifiers source 1..1740 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="glioma cell line" CDS 235..1611 /function="has a role in the cell cycle" /codon_start=1 /product="nuclear antigen H731" /db_xref="PID:g1825562" /translation="MTKYPDNLSDSLFSGDEENAGTEEVKNEINGNWISASSINEARI NAKAKRRLRKNSSRDSGRGDSVSESGSDALRSGLTVPTSPKGRLLDRRSRSGKGRGLP KKGGAGGKGVWGTPGQVYDVEEVDVKDPNYDDDQENCVYETVVLPLDERAFEKTLTPI IQEYFEHGDTNEVAEMLRDLNLGEMKSGVPVLAVSLALEGKASHREMTTKLLSDLCGT VMSTTDVEKSFDKLLKDLPELALDTPRAPQLVGQFIARAVGDGILCNTYIDSYKGTVD CVQARAALDKATVLLSMSKGGKRKDSVWGSGGGQQSVNHLVKEIDMLLKEYLLSGDIS EAEHCLKELEVPHFHHELVYEAIIMVLESTGESTFKMILDLLKSLWKSSTITVDQMKR GYERIYNEIPDINLDVPHSYSVLERFVEECFQAGIISKQLRDLCPSRGRKRFVSEGDG GRLKPESY" BASE COUNT 535 a 263 c 452 g 490 t ORIGIN 1 ggggtcgggg ccggctgacc aggaacctgg gcgagcagcg gcgggggccc gagggattct 61 gaaggaagat ttccattagg taatttgttt aatcagtgca agcgaaatta agggaaaatg 121 gatgtagaaa atgagcagat actgaatgta aaccctgcag ggtattttcc ctaattctcc 181 atggtgcttc aatagcatgt tattatcata aaaatgaaca gttttgtgga atagatgacc 241 aaatatcctg ataacttaag tgactctctc ttttccggtg atgaagaaaa tgctgggact 301 gaggaagtaa agaatgaaat aaatggaaat tggatttcag catcctccat taacgaagct 361 agaattaatg ccaaggcaaa aaggcgacta aggaaaaact catcccggga ctctggcaga 421 ggcgattcgg tcagcgagag tgggagtgac gcccttagaa gtggattaac tgtgccaacc 481 agtccaaagg gaaggttgct ggataggcga tccagatctg ggaaaggaag gggactacca 541 aagaaaggtg gtgcaggagg caaaggtgtc tggggtacac ctggacaggt gtatgatgtg 601 gaggaggtgg atgtgaaaga tcctaactat gatgatgacc aggagaactg tgtttatgaa 661 actgtagttt tgcctttgga tgaaagggca tttgagaaga ctttaacacc aatcatacag 721 gaatattttg agcatggaga tactaatgaa gttgcggaaa tgttaagaga tttaaatctt 781 ggtgaaatga aaagtggagt accagtgttg gcagtatcct tagcattgga ggggaaggct 841 agtcatagag agatgactac taagcttctt tctgaccttt gtgggacagt aatgagcaca 901 actgatgtgg aaaaatcatt tgataaattg ttgaaagatc tacctgaatt agcactggat 961 actcctagag caccacagtt ggtgggccag tttattgcta gagctgttgg agatggaatt 1021 ttatgtaata cctatattga tagttacaaa ggaactgtag attgtgtgca ggctagagct 1081 gctctggata aggctaccgt gcttctgagt atgtctaaag gtggaaagcg taaagatagt 1141 gtgtggggct ctggaggtgg gcagcaatct gtcaatcacc ttgttaaaga gattgatatg 1201 ctgctgaaag aatatttact ctctggagac atatctgaag ctgaacattg ccttaaggaa 1261 ctggaagtac ctcattttca ccatgagctt gtatatgaag ctattataat ggttttagag 1321 tcaactggag aaagtacatt taagatgatt ttggatttat taaagtccct ttggaagtct 1381 tctaccatta ctgtagacca aatgaaaaga ggttatgaga gaatttacaa tgaaattccg 1441 gacattaatc tggatgtccc acattcatac tctgtgctgg agcggtttgt agaagaatgt 1501 tttcaggctg gaataatttc caaacaactc agagatcttt gtccttcaag gggcagaaag 1561 cgttttgtaa gcgaaggaga tggaggtcgt cttaaaccag agagctactg aatataagaa 1621 ctcttgcagt cttagatgtt ataaaaatat atatctgaat tgtaagagtt gttagcacaa 1681 gttttttttt tttttttttt aagcacttgt tttgggtaca aggcatttct gacattttat // LOCUS HSU88629 1923 bp DNA PRI 22-APR-1997 DEFINITION Human RNA polymerase II elongation factor ELL2, complete cds. ACCESSION U88629 NID g1946346 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1923) AUTHORS Shilatifard,A., Duan,D.R., Haque,D., Florence,C., Schubach,W.H., Conaway,J.W. and Conaway,R.C. TITLE ELL2, a new member of an ELL family of RNA polymerase II elongation factors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (8), 3639-3643 (1997) MEDLINE 97268622 REFERENCE 2 (bases 1 to 1923) AUTHORS Shilatifard,A., Duan,D.R., Haque,D., Florence,C., Schubach,W.H., Conaway,J.W. and Conaway,R.C. TITLE Direct Submission JOURNAL Submitted (05-FEB-1997) Molecular and Cell Biology, Oklahoma Medical Research Foundation, 825 NE 13th Street, Oklahoma City, OK 73104, USA FEATURES Location/Qualifiers source 1..1923 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1923 /codon_start=1 /product="RNA polymerase II elongation factor ELL2" /db_xref="PID:g1946347" /translation="MAAGGTGGLREEQRYGLSCGRLGQDNITVLHVKLTETAIRALET YQSHKNLIPFRPSIQFQGLHGLVKIPKNDPLNEVHNFNFYLSNVGKDNPQGSFDCIQQ TFSSSGASQLNCLGFIQDKITVCATNDSYQMTRERMTQAEEESRNRSTKVIKPGGPYV GKRVQIRKAPQAVSDTVPERKRSTPMNPANTIRKTHSSSTISQRPYRDRVIHLLALKA YKKPELLARLQKDGVNQKDKNSLGAILQQVANLNSKDLSYTLKDYVFKELQRDWPGYS EIDRRSLESVLSRKLNPSQNATGTSRSESPVCSSRDAVSSPQKRLLDSEFIDPLMNKK ARISHLTNRVPPTLNGHLNPTSEKSAAGLPLPPAAAAIPTPPPLPSTYLPISHPPQIV NSNSNSPSTPEGRGTQDLPVDSFSQNDSIYEDQQDKYTSRTSLETLPPGSVLLKCPKP MEENHSMSHKKSKKKSKKHKEKDQIKKHDIETIEEKEEDLKREEEIAKLNNSSPNSSG GVKEDCTASMEPSAIELPDYLIKYIAIVSYEQRQNYKDDFNAEYDEYRALHARMETVA RRFIKLDAQRKRLSPGSKEYQNVHEEVLQEYQKIKQSSPNYHEEKYRCEYLHNKLAHI KRLIGEFDQQQAESWS" BASE COUNT 630 a 462 c 410 g 421 t ORIGIN 1 atggcggcgg gggggacagg gggcctgcgg gaggagcagc gctatgggct gtcgtgcgga 61 cggctggggc aggacaacat caccgtactg catgtgaagc tcaccgagac ggcgatccgg 121 gcgctcgaga cttaccagag ccacaagaat ttaattcctt ttcgaccttc aatccagttc 181 caaggactcc acgggcttgt caaaattccc aaaaatgatc ccctcaatga agttcataac 241 tttaactttt atttgtcaaa tgtgggcaaa gacaaccctc agggcagctt tgactgcatc 301 cagcaaacat tctccagctc tggagcctcc cagctcaatt gcctgggatt tatacaagat 361 aaaattacag tgtgtgcaac aaacgactcg tatcagatga cacgagaaag aatgacccag 421 gcagaggagg aatcccgcaa ccgaagcaca aaagttatca aacccggtgg accatatgta 481 gggaaaagag tgcaaattcg gaaagcacct caagctgttt cagatacagt tcctgagagg 541 aaaaggtcaa cccccatgaa ccctgcaaat acaattcgaa agacacatag cagcagcacc 601 atctctcaga ggccatacag ggacagggtg attcacttac tggccctgaa ggcctacaag 661 aaaccggagc tacttgctag actccagaaa gatggtgtca atcaaaaaga caagaactcc 721 ctgggagcaa ttctgcaaca ggtagccaat ctgaattcta aggacctctc atatacctta 781 aaggattatg tttttaaaga gcttcaaaga gactggcctg gatacagtga aatagacaga 841 cggtcattgg agtcagtgct ctctagaaaa ctaaatccgt ctcagaatgc tacaggcacc 901 agccgttcag aatctcctgt atgttctagt agagatgctg tatcttctcc tcagaaacgg 961 cttttggatt cagagtttat tgatccttta atgaataaaa aagcccgaat atctcacctg 1021 acgaacagag taccaccaac actaaatggt catttgaatc ccaccagtga aaaatcggct 1081 gcaggcctcc cactgccccc tgcggctgct gccatcccca cccctccacc gctgccttca 1141 acctatctgc ccatctcaca tcctcctcag attgtaaatt ctaactccaa ctcccctagc 1201 actccagaag gccgggggac tcaagaccta cctgttgaca gttttagtca aaacgatagt 1261 atctatgagg accagcaaga caaatatacc tctaggactt ctctggaaac cttaccccct 1321 ggttccgttc tactaaagtg tccaaagcct atggaagaaa accattcaat gtctcacaaa 1381 aagtccaaaa agaagtctaa aaaacataag gaaaaggacc aaataaaaaa gcacgacatt 1441 gagactattg aggaaaagga ggaagatctt aagagagaag aggaaattgc caagctaaat 1501 aactccagtc caaattccag tggaggagtt aaagaggatt gcactgcctc catggaacct 1561 tcagcaattg aactcccaga ttatttgata aaatatatcg ctatcgtctc ctatgagcaa 1621 cgccagaatt ataaggatga cttcaatgca gagtatgatg agtacagagc tttgcatgcc 1681 aggatggaga ctgtagctag aagatttatc aaactagatg cacaaagaaa gcgcctttct 1741 ccaggctcaa aagagtatca gaatgttcat gaagaagtct tacaagaata tcagaagata 1801 aagcagtcta gtcccaatta ccatgaagaa aaatacagat gtgaatatct tcataacaag 1861 ctggctcaca tcaaaaggct aataggtgaa tttgaccaac agcaagcaga gtcatggtcc 1921 tag // LOCUS HSU90724 3428 bp DNA PRI 16-JAN-1998 DEFINITION Human aminopeptidase P gene, complete cds. ACCESSION U90724 NID g2772609 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3428) AUTHORS Venema,R.C., Ju,H., Zou,R., Venema,V.J. and Ryan,J.W. TITLE Cloning and tissue distribution of human membrane-bound aminopeptidase P JOURNAL Biochim. Biophys. Acta 1354 (1), 45-48 (1997) MEDLINE 98041638 REFERENCE 2 (bases 1 to 3428) AUTHORS Venema,R.C., Ju,H., Zou,R., Venema,V.J. and Ryan,J.W. TITLE Direct Submission JOURNAL Submitted (25-FEB-1997) Vascular Biology Center, Medical College of Georgia, 1120 15th Street, Augusta, GA 30912, USA FEATURES Location/Qualifiers source 1..3428 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 265..2286 /EC_number="3.4.11.9" /codon_start=1 /product="aminopeptidase P" /db_xref="PID:g2772610" /translation="MARAHWGCCPWLVLLCACAWGHTKPLDLGGQDVRNCSTNPPYLP VTVVNTTMSLTALRQQMQTQNLSAYIIPGTDAHMNEYIGQHDERRAWITGFTGSAGTA VVTMKKAAVWTDSRYWTQAERQMDCNWELHKEVGTTPIVTWLLTEIPAGGRVGFDPFL LSIDTWESYDLALQGSNRQLVSITTNLVDLVWGSERPPVPNQPIYALQEAFTGSTWQE KVSGVRSQMQKHQKVPTAVLLSALEETAWLFNLRASDIPYNPFFYSYTLLTDSSIRLF ANKSRFSSETLSYLNSSCTGPMCVQIEDYSQVRDSIQAYSLGDVRIWIGTSYTMYGIY EMIPREKLVTDTYSPVMMTKAVKNSKEQALLKASHVRDAVAVIRYLVWLEKNVPKGTV DEFSGAEIVDKFRGEEQFSSGPSFETISASGLNAALAHYSPTKELNRKLSSDEMYLLD SGGQYWDGTTDITRTVHWGTPSAFQKEAYTRVLIGNIDLSRLIFPAATSGRMVEAFAR RALWDAGLNYGHGTGHGIGNFLCVHEWPVGFQSNNIAMAKGMFTSIEPGYYKDGEFGI RLEDVALVVEAKTKYPGELPDLVVSFVPYDRNLIDVSLLSPEHLQYLNRYYQTIREKV GPELQRRQLLEEFEWLQQHTEPLAARAPDTASWASVLVVSTLAILGWSV" BASE COUNT 808 a 1032 c 859 g 729 t ORIGIN 1 caccctatcc tacactacta ggaacttgca cagtccgcct cgggcagccc aaagctcctc 61 tgcccaccct ggctcccaaa accctccaaa acaaaagacc agaaaagcac tctccaccca 121 gcagccaaac gcctccttct tgacgccagc ccccaccctc tgtctgctcg agcccaggaa 181 aggcctgaag gaacaggccg gggaaggagc cctccctctc tcccttgtcc ctccatccac 241 ccagcgccgg catctggaga ccctatggcc cgggctcact ggggctgctg cccctggctg 301 gtcctcctct gtgcttgtgc ctggggccac acaaagccac tggaccttgg agggcaggat 361 gtgagaaatt gttccaccaa ccccccttac cttccagtta ctgtggtcaa taccacaatg 421 tcactcacag ccctccgcca gcagatgcag acccagaatc tctcagccta catcatccca 481 ggcacagatg ctcacatgaa cgagtacatc ggccaacatg acgagaggcg tgcgtggatt 541 acaggcttta cagggtctgc aggaactgca gtggtgacta tgaagaaagc agctgtctgg 601 accgacagtc gctactggac tcaggctgag cggcaaatgg actgtaattg ggagctccat 661 aaggaagttg gcaccactcc tattgtcacc tggctcctca ccgagattcc cgctggaggg 721 cgtgtgggtt ttgacccctt cctcttgtcc attgacacct gggagagtta tgatctggcc 781 ctccaaggct ctaacagaca gctggtgtcc atcacaacca atcttgtgga cctggtatgg 841 ggatcagaga ggccaccggt tccaaatcaa cccatttatg ccctgcagga ggcattcaca 901 gggagcactt ggcaggagaa agtatctggc gtccgaagcc agatgcagaa gcatcaaaag 961 gtcccgactg ccgtccttct gtcggcgctt gaggagacgg cctggctctt caaccttcga 1021 gccagtgaca tcccctataa ccccttcttc tattcctaca cgctgctcac agactcttct 1081 attaggttgt ttgcaaacaa gagtcgcttt agctccgaaa ccttgagcta tctgaactcc 1141 agttgcacag gccccatgtg tgtgcaaatc gaggattaca gccaagttcg tgacagcatc 1201 caggcctact cattgggaga tgtgaggatc tggattggga ccagctatac catgtatggg 1261 atctatgaaa tgataccaag ggagaaactc gtgacagaca cctactcccc agtgatgatg 1321 accaaggcag tgaagaacag caaggagcag gccctcctca aggccagcca cgtgcgggac 1381 gctgtggctg tgatccggta cttggtctgg ctggagaaga acgtgcccaa aggcacagtg 1441 gatgagtttt cgggggcaga gatcgtggac aagttccgag gagaagaaca gttctcctcc 1501 ggacccagtt ttgaaaccat ctctgctagt ggtttgaatg ctgccctggc ccactacagc 1561 ccgaccaagg agctgaaccg caagctgtcc tcagatgaga tgtacctgct ggactctggg 1621 gggcagtact gggacgggac cacagacatc accagaacag tccactgggg caccccctct 1681 gcctttcaga aggaggcata tacccgtgtg ctgataggaa atattgacct gtccaggctc 1741 atctttcccg ctgctacatc agggcgaatg gtggaggcct ttgcccgcag agccttgtgg 1801 gatgctggtc tcaattatgg tcatgggaca ggccacggca ttggcaactt cctgtgtgtg 1861 catgagtggc cagtgggatt ccagtccaac aacatcgcta tggccaaggg catgttcact 1921 tccattgaac ctggttacta taaggatgga gaatttggga tccgtctcga agatgtggct 1981 ctcgtggtag aagcaaagac caagtaccca ggggagctac ctgaccttgt ggtatcattt 2041 gtgccctatg accggaacct catcgatgtc agcctgctgt ctcccgagca tctccagtac 2101 ctgaatcgct actaccagac catccgggag aaggtgggtc cagagctgca gaggcgccag 2161 ctactagagg agttcgagtg gcttcaacag cacacagagc ccctggccgc cagggcccca 2221 gacaccgcct cctgggcctc tgtgttagtg gtctccaccc ttgccatcct tggctggagt 2281 gtctagaggc tccagactct cctgttaacc ctccatctag atggggggct cccttgctta 2341 gctcccctca ccctgcactg aacatacccc aagagcccct gctggcccat tgcctagaaa 2401 cctttgcatt catcctcctt ctccaagacc tatggagaag gtcccaggcc ccaggaaaca 2461 cagggcttct tggccccaga tggcacctcc ctgcaccccg gggttgtata ccacaccctg 2521 ggcccctaat cccaggcccc gaaataggaa agccagctag tctcttctct tctgtgatct 2581 cagtaggcct aacctataac ctaacacaga ctgctacagc tgctcccctc ccgccaaaca 2641 aagccccaag aaaacaatgc ccctaccacc caagggtgcc atggtcccgg gaaaacccaa 2701 cctgtcaccg cgtgttgggc gtaaccagaa ctgttccccc ccaccagggc ttaaaaatcg 2761 cccccacttt ttaaccatcg tccattaacc acctggtggg catagccaga gctgttcgaa 2821 cccagccagg gatgaaaaat caacccccga catggaaccc atgattccta aacccggggt 2881 aggttccatg ccaagtaaca gcagagggag ttaagccata ggaatttggc tgtggagtaa 2941 gagggaatgc ggtgaggcag tgtggaatat gaccctacca gaggttggag aacaaacttg 3001 ggcagccgga acccgtcact attttagatt cctggcattc gaggagccct ttgaactttc 3061 caaagtgcag ccacagctac aatgctgtta aatcctccca catttcttgg atgccccttc 3121 accttgtgtg gacagtgtct ggtttcccca ttttacagac aggaaaactg agcttcagac 3181 agggggtggg ctttgcctaa ggacacacaa atttggttgg gagttgatgg ggccagatga 3241 gccagcattc cagctgtttc acccttcagc aacatgcaga gtccctgagc ccacctccca 3301 gccctctcct cattctctga acccactgtg gtgagaagaa tttgctccgg ccaaattggc 3361 cgttagccac ctgggtccac atcctgctaa gacgtttaaa acagcctaac aaagacactt 3421 gcctgtgg // LOCUS HSU91934 2267 bp DNA PRI 17-MAR-1997 DEFINITION Human retina-derived POU-domain factor-1 mRNA, complete cds. ACCESSION U91934 NID g1890301 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2267) AUTHORS Zhou,H., Yoshioka,T. and Nathans,J. TITLE Retina-derived POU-domain factor-1: a complex POU-domain gene implicated in the development of retinal ganglion and amacrine cells JOURNAL J. Neurosci. 16 (7), 2261-2274 (1996) MEDLINE 96180940 REFERENCE 2 (bases 1 to 2267) AUTHORS Zhou,H., Yoshioka,T. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (04-MAR-1997) MBG, HHMI/JHU, 725 North Wolfe Street, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..2267 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retina" CDS 108..2162 /codon_start=1 /product="retina-derived POU-domain factor-1" /db_xref="PID:g1890302" /translation="MIAGQVSKPLLSVRSEMNAELRGEDKAATSDSELNEPLLAPVES NDSEDTPSKLFGARGNPALSDPGTPDQHQASQTHPPFPVGPQPLLTAQQLASAVAGVM PGGPPALNQPILIPFNMAGQLGGQQGLVLTLPTANLTNIQGLVAAAAAGGIMTLPLQN LQATSSLNSQLQQLQLQLQQQQQQQQQQQPPPSTNQHPQPAPQAPSQSQQQPLQPTPP QQPPPASQQPPAPTSQLQQAPQPQQHQPHSHSHNQNQPSPTQQSSSPPQKPSQSPGHG LPSPLTPPNPLQLVNNPLASQAAAAAAAMSSIASSQAFGNALSSLQGVTGQLVTNAQG QIIGTIPLMPNPGPSSQAASGTQGLQVQPITPQLLTNAQGQIIATVIGNQILPVINTQ GITLSPIKPGQQLHQPSQTSVGQAASQGNLLHLAHSQASMSQSPVRQASSSSSSSSSS SALSVGQLVSNPQTAAGEVDGVNLEEIREFAKAFKIRRLSLGLTQTQVGQALSATEGP AYSQSAICRHTILRSHFFLPQEAQENTIASSLTAKLNPGLLYPARFEKLDITPKSAQK IKPVLERWMAEAEARHRAGMQNLTEFIGSEPSKKRKRRTSFTPQALEILNAHFEKNTH PSGQEMTEIAEKLNYDREVVRVWFCNKRQALKNTIKRLKQHEPATAVPLEPLTDSLEE NS" exon 1658..1765 /note="absent in alternatively spliced variant, GenBank Accession Number U91935" BASE COUNT 596 a 755 c 517 g 399 t ORIGIN 1 ctgaccaatt acagcttgca tggtccccgt gtccctgctg cgctgagaga atgttcttat 61 aatgatccaa gaggaagtgg caaatgagtg ctcttcttca ggatccaatg atagctggac 121 aagtcagtaa gcccttgctg tcagtgcgga gtgaaatgaa tgcggagttg agaggtgagg 181 acaaggctgc tacttcagac agcgagctga atgagcccct gcttgcgcct gtggaatcaa 241 atgacagcga ggacactccc agcaagctct tcggggctag aggaaaccca gcattatcag 301 acccaggcac tcctgaccaa caccaggcca gtcagaccca ccccccattt ccagttgggc 361 cacagccact tctgacggca cagcagttag cttctgctgt ggccggcgtg atgccgggag 421 gccccccagc cctcaaccag ccaatcctca ttcccttcaa catggcggga cagctaggag 481 gccagcaagg actggttctc acactgccaa cagcgaatct caccaacatc caagggctgg 541 tggcagcagc tgcagccgga ggcattatga ctctgccact gcaaaatcta caagctacct 601 catccctgaa ctcccagctc cagcagctcc agctccagct ccagcagcag cagcagcagc 661 agcagcagca gcagcctccc ccgtcaacca accagcaccc gcaaccagcc ccacaggcgc 721 cctcgcagtc ccagcagcag ccgctgcagc ccaccccacc ccagcagcca ccacccgcct 781 ctcagcagcc gccagctcct acatctcagc tgcaacaggc gcctcagccc cagcagcacc 841 aaccccactc ccactcccac aaccagaacc aaccatctcc aacccagcag agctccagcc 901 ccccgcagaa acctagtcag tctccaggac atggcctgcc ttcaccgctc acgccaccca 961 atcctctaca gctggttaat aatccactag caagtcaggc tgcagcggct gcagcagcca 1021 tgagctccat agcaagctca caggcctttg gcaatgccct ctccagtctt cagggggtca 1081 caggtcaact agttactaat gcacaaggac agattatcgg gaccattcca ctgatgccta 1141 atccggggcc atcgagccaa gcagcaagcg gcactcaggg cttgcaagtg cagccaatca 1201 ccccccagct cctcacaaac gcccagggcc agatcatcgc cacagtcatt gggaaccaga 1261 tcctgcccgt gatcaacacc cagggcatca cgctgtcacc catcaagccc ggccagcagc 1321 tccaccaacc ctcccagacg tcagtgggtc aagcagcctc ccaaggcaac cttctgcacc 1381 tggctcacag ccaagcatcc atgtctcaaa gtcccgtccg gcaggcttcc tcttcttcct 1441 cctcatcctc ctcttcttca gctttgagcg tgggccagtt agtcagcaat cctcaaacgg 1501 cagcgggtga ggtggatggg gttaatctgg aggagatccg agaatttgcc aaagctttta 1561 aaatccggcg cctgtccctt ggcctgaccc agactcaggt gggacaggct ctcagtgcta 1621 cagagggccc cgcgtacagc cagtcggcca tctgcagaca caccatcctg agaagccact 1681 ttttcctacc acaggaagcc caagagaaca ctatagctag cagtctgaca gccaaactga 1741 accctggcct tttgtatcct gccaggtttg aaaagctgga catcacccct aaaagtgccc 1801 agaagatcaa gccggtgctt gagcggtgga tggctgaggc tgaggcccgc catcgagcag 1861 gtatgcagaa cctgaccgag tttatcggga gtgaaccatc caaaaagcgc aagcggcgca 1921 cctccttcac accccaggcc cttgagatcc tcaatgccca ctttgagaag aacacacacc 1981 cttctgggca ggaaatgacc gaaattgctg agaagctgaa ctatgaccga gaagtagtta 2041 gagtttggtt ctgcaataag aggcaagccc tgaagaacac aattaaacgc ttaaaacagc 2101 acgagccggc cacggcagtc cctttggagc ccttaacaga ctctctggaa gaaaactcct 2161 aaagagatgc ccacccataa tcagaagcaa aattcacaga aactaaactc cacccttggg 2221 actccacaac aacaacaaca acaaaattta atttaattta aaaatag // LOCUS HSU91939 1400 bp DNA PRI 24-MAR-1997 DEFINITION Human putative G protein-coupled receptor (GPR25) gene, complete cds. ACCESSION U91939 NID g1905877 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1400) AUTHORS Jung,B.P., Nguyen,T., Kolakowski,L.F. Jr., Lynch,K.R., Heng,H.H., George,S.R. and O'Dowd,B.F. TITLE Discovery of a novel human G protein-coupled receptor gene (GPR25) located on chromosome 1 JOURNAL Biochem. Biophys. Res. Commun. 230 (1), 69-72 (1997) MEDLINE 97148573 REFERENCE 2 (bases 1 to 1400) AUTHORS Jung,B.P., Nguyen,T., Kolakowski,L.F. Jr., Lynch,K.R., Heng,H.H.Q., George,S.R. and O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1400 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q32.1" gene 80..1162 /gene="GPR25" CDS 80..1162 /gene="GPR25" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g1905878" /translation="MAPTEPWSPSPGSAPWDYSGLDGLEELELCPAGDLPYGYVYIPA LYLAAFAVGLLGNAFVVWLLAGRRGPRRLVDTFVLHLAAADLGFVLTLPLWAAAAARR PWPFGDGLCKLSTFALAGTRSAGALLLAGMSVDRYLAVVKLLEARPLRTPRCAVASCC GVWAVALLAGLPSLVYRGLQPLPGGQDSQCGEEPSHAFQGLSLLLLLLTFVLPLVVTL FCYCRISRRLRRPPHVGRARRNSLRIIFAIESTFVGSWLPFSALRAVFHLARLGALPL PCPLLLALRWGLTIATCLAFVNSCANPLIYLLLDRSFRARALDGACGRTGRLARRISS ASSLSRDDSSVFRCRAQAANTASASW" BASE COUNT 154 a 524 c 474 g 248 t ORIGIN 1 tgaagagcaa accccctcct gctcagagct cgtgccgcct gccccagggc tcgactccgc 61 gcaggcctca tagccagcca tggcccccac agagccctgg agccccagcc cggggtcagc 121 gccctgggac tactcggggt tggacggcct ggaggagctg gagctgtgtc cggccgggga 181 cctgccctac ggctacgtct acatccccgc gctctacctg gcggccttcg ccgtgggcct 241 gctgggcaac gcctttgtgg tgtggctgct ggccgggcgg cggggcccgc ggcggctggt 301 ggataccttc gtgctgcacc tggcggcagc tgacctgggc ttcgtgctca cgctgccgct 361 gtgggccgcg gcggcggcta ggcggccgtg gccgttcggc gatggcctct gcaagctcag 421 cacgttcgcg ctggcgggca cgcgctcggc gggcgcgctg ctgctggcgg gcatgagcgt 481 ggaccgctac ctggccgtgg tgaagctgct cgaggcgagg ccactgcgca ccccgcgctg 541 cgccgtggcc tcgtgctgcg gcgtctgggc cgtggcgctg ctggccggcc tgccctccct 601 ggtctaccgg gggttgcagc ccctgcctgg gggccaggac agccagtgcg gcgaggagcc 661 ctcccacgcc ttccagggcc tcagcttgct gctgctgctg ctgaccttcg tgctgcccct 721 ggtcgtcacc ctcttctgct actgccgcat ctcgcgccgc ctgcgacggc cgccgcacgt 781 gggtcgggcc cggaggaact cgctgcgcat catcttcgcc atcgagagca cgtttgtggg 841 ctcctggctg cccttcagcg ccctgcgggc cgtcttccac ctggcgcgtc tgggggcgct 901 gccgctgccg tgccccctgc tgctggcgct gcgctggggc ctcaccattg ccacctgcct 961 ggccttcgtc aacagctgcg ccaacccgct catctacctc ctgctggacc gctcattccg 1021 agcccgggcg ctggacgggg cctgcgggcg caccggccgc ctggcgcgaa ggatcagctc 1081 agcctcctcg ctctccaggg acgacagttc cgtgttccgt tgccgggccc aggccgcgaa 1141 cactgcctcg gcctcctggt agctgccccg ggccgctgga ggtgggcggc agcggagcat 1201 cgagaggagg ccagatgtcc cggaggggac tgagctcccc agacgcgcct gttctggcgg 1261 cagcaagctg ctcgggccgg catcgcattt cctcgcgcgc tgcctggact cccaaggcct 1321 cctccatcgg tttccccgga acctcagaac aattgaactc ccctaaacca ggctcctgtg 1381 actagctgtt ccctctcagc // LOCUS HSU93563 6019 bp DNA PRI 08-MAY-1997 DEFINITION Human L1 element L1.6 putative p150 gene, complete cds. ACCESSION U93563 NID g2072947 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6019) AUTHORS Sassaman,D.M., Dombroski,B.A., Moran,J.V., Kimberland,M.L., Naas,T.P., DeBerardinis,R.J., Gabriel,A., Swergold,G.D. and Kazazian,H.H. Jr. TITLE Many human L1 elements are capable of retrotransposition JOURNAL Nature Genet. 16 (1), 37-43 (1997) MEDLINE 97285120 REFERENCE 2 (bases 1 to 6019) AUTHORS Sassaman,D.M., Dombroski,B.A., Moran,J.V., Kimberland,M.L., Naas,T.P., DeBerardinis,R.J., Gabriel,A., Swergold,G.D. and Kazazian,H.H. Jr. TITLE Direct Submission JOURNAL Submitted (14-MAR-1997) Genetics, U. Pennsylvania School of Medicine, 415 Curie Blvd., Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..6019 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /clone_lib="see Dombroski, et al., Science 254:1805-1808(1991)" repeat_region 1..6019 /note="LINE; L1.6; full length L1 element; bicistronic" /rpt_family="L1" /rpt_type=dispersed misc_feature 906..1922 /note="ORF1; contains an internal stop codon causing premature truncation; there is no experimental evidence that this interval is expressed" misc_feature 1293..1295 /note="stop codon within ORF1 which causes premature truncation" CDS 1986..5813 /note="ORF2" /codon_start=1 /product="putative p150" /db_xref="PID:g2072948" /translation="MTGSNSHITILTLNINGLNSAIKRHRRASWIKSQDPSVCCIQET HLTCRDTHRLKIKGWRKIYQANGKQKKAGVAILVSDKTDFKPTKIKRDKEGHYIMVKG SIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTRQK VNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFLAPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFET NENKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKA SRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDT IKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLN RPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNS FYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGF IPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDG TYFKIIRAIYDKPTANIRLNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQA FLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNK WKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACI AKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNY LIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPK TIKTLEENLGITIQDIGTGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVN RQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKNWAKDMNRHFSKEDIYAA KKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWD CKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPNDYKSCCYKDTCTRMFIAALFTI AKTWNQPKCPTMIDWIKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLETIIVSKLSQEQ KTKHRIFSLIGGN" polyA_site 6019 /note="20 A nucleotides sequenced" BASE COUNT 2311 a 1329 c 1224 g 1155 t ORIGIN 1 gggggaggag ccaagatggc cgaataggaa cagctccggt ctacagctcc cagcgtgagc 61 aacgcagaag acgggtgatt tctgcatttc catctgaggt accgggttca tctcactagg 121 gagtgccaga cagtgggcgc aggtcagtgg gtgcgtgcac cgtgcgcgag cgaagcaggg 181 cgaggcattg cctcacctgg gaagcgcaag gggtcaggga gttccctttc cgagtcaaag 241 aaaggggtga cggacgcacc tggaaaatcg ggtcactccc acccgaatat tgcgcttttc 301 agaccggctt aaaaaacggc gcaccaggag actatatccc acacctggct cagagggtcc 361 tacacccacg gaatctcgct gattgctagc acagcagtct gagatcaaac tgcaaggcgg 421 caacgaggct gggggagggg cgcccgccat tgcccaggct tgcttaggta aacaaagcag 481 ccgggaagct cgaactgggt ggagcccacc acagctcaag gaggcctgcc tgcctctgta 541 ggctccacct ctgggggcag ggcacagaca aacaaaaaga cagcagtaac ctctgcagac 601 tcaagtgtcc ctgtctgaca gctttgaaga gagcagtggt tctcccagca cgcagctgga 661 gatctgagaa cgggcagact gcatcctcaa gtgggtccct gacccctgac ccccgagcag 721 cctaactggg aggcaccccc cagcaggggc acactgacac ctcacacggc agggtattcc 781 aacagacctg cagctgaggg tcctgtctgt tagaaggaaa actaacaacc agaaaggaca 841 tccacaccaa aaacccatct atacatcacc atcatcaaag accaaaagta gataaaacaa 901 caaagatggg gaaaaaacag aacagaaaaa ctggaaactc taaaacgcag agcgcctctc 961 ctcctccaaa ggaacgcagt tcctcaccag caacggaaca aagctggatg gagaatgatt 1021 ttgacgagct gagagaagaa ggcttcagac gatcaaatta ctctgagcta cgggaggaca 1081 ttcaaaccaa aggcaaagaa gttgaaaact ttgaaaaaaa tttagaagaa tgtataacta 1141 gaatatccaa tacagagaag tgcttaaagg agctgatgga gctgaaaacc aaggctcgag 1201 aactacgtga agaatgcaga agcctcagga gccgatgcga tcaactggaa gaaagggtat 1261 cagcaatgga agatgaaatg aatgaaatga agtgagaagg gaagtttaga gaaaaaagaa 1321 taaacagaaa tgagcaaagc ctccaagaaa tatgggacta tgtgaaaaga ccaaatctac 1381 gtctgattgg tgtacctgaa agtgatatgg agaatggaac caagttggaa aacactctgc 1441 aggatattat ccaggagaac ttccccaatc tagcaaggca ggccaacgtt cagattcagg 1501 aaatacagag aacgccacaa agatactcct cgagaagagc aactccaaga cacataattg 1561 tcagattcac caaagttgaa atgaaggaaa aaatgttaag ggcagccaga gagaaaggtc 1621 gggttaccct caaaggaaag cccatcagac taacagcgga tctctcggca gaaaccctac 1681 aagccagaag agagtggggg ccaatattca acattcttaa agaaaagaat tttcaaccca 1741 gaatttcata tccagccaaa ctaagcttca taagtgaagg agaaataaaa tactttatag 1801 acaagcaaat gctgagagat tttgtcacca cgaggcctgc cctaaaagag ctcctgaagg 1861 aagcgctaaa catggaaagg aacaaccggt accagccgct gcaaaatcat gccaaaatgt 1921 aaagaccatc gagactagga agaaactgca tcaactaacg agcaaaatca ccagctaaca 1981 tcataatgac aggatcaaat tcacacataa caatattaac tttaaatata aatggactaa 2041 attctgcaat taaaagacac agacgggcaa gttggataaa gagtcaagac ccatcagtgt 2101 gctgtattca ggaaacccat ctcacgtgca gagacacaca taggctcaaa ataaaaggat 2161 ggaggaagat ctaccaagca aatggaaaac aaaaaaaggc aggggttgca atcctagtct 2221 ctgataaaac agactttaaa ccaacaaaga tcaaaagaga caaagaaggc cattacataa 2281 tggtaaaggg atcaattcaa caagaggagc taactatcct aaatatttat gcacccaata 2341 caggagcacc cagattcata aagcaagtcc tgagtgacct acaaagagac ttagactccc 2401 acacattaat aatgggagac tttaacaccc cactgtcaac attagacaga tcaacgagac 2461 agaaagtcaa caaggatacc caggaattga actcagctct gcaccaagcg gacctaatag 2521 acatctacag aactctccac cccaaatcaa cagaatatac atttttttta gcaccacacc 2581 acacctattc caaaattgac cacatagttg gaagtaaagc tctcctcagc aaatgtaaaa 2641 gaacagaaat tataacaaac tatctctcag accacagtgc aatcaaacta gaactcagga 2701 ttaagaatct cactcaaagc cgctcaacta catggaaact gaacaacctg ctcctgaatg 2761 actactgggt acataatgaa atgaaggcag aaataaagat gttctttgaa accaacgaga 2821 acaaagacac cacataccag aatctctggg acgcattcaa agcagtgtgt agagggaaat 2881 ttatagcact aaatgcctac aagagaaagc aggaaagatc caaaattgac accctaacat 2941 cacaattaaa agaactagaa aagcaagagc aaacacattc aaaagctagc agaaggcaag 3001 aaataactaa aatcagagca gaactgaagg aaatagagac acaaaaaacc cttcaaaaaa 3061 tcaatgaatc caggagctgg ttttttgaaa ggatcaacaa aattgataga ccactagcaa 3121 gactaataaa gaaaaaaaga gagaagaatc aaatagacac aataaaaaat gataaagggg 3181 atatcaccac cgatcccaca gaaatacaaa ctaccatcag agaatactac aaacacctct 3241 atgcaaataa actagaaaat ctagaagaaa tggatacatt ccttgacaca tacactctcc 3301 caagactaaa ccaggaagaa gttgaatctc tgaatagacc aataacaggc tctgaaattg 3361 tggcaataat caatagttta ccaaccaaaa agagtccagg accagatgga ttcacagccg 3421 aattctacca gaggtacaag gaggaactgg taccattcct tctgaaacta ttccaatcaa 3481 tagaaaaaga gggaatcctc cctaactcat tttatgaggc cagcatcatt ctgataccaa 3541 agccgggcag agacacaacc aaaaaagaga attttagacc aatatccttg atgaacattg 3601 atgcaaaaat cctcaataaa atactggcaa accgaatcca gcagcacatc aaaaagctta 3661 tccaccatga tcaagtgggc ttcatccctg ggatgcaagg ctggttcaat atacgcaaat 3721 caataaatgt aatccagcat ataaacagag ccaaagacaa aaaccacatg attatctcaa 3781 tagatgcaga aaaagccttt gacaaaattc aacaaccctt catgctaaaa actctcaata 3841 aattaggtat tgatgggacg tatttcaaaa taataagagc tatctatgac aaaccaacag 3901 ccaatatcag actgaatggg caaaaactgg aagcattccc tttgaaaact ggcacaagac 3961 agggatgccc tctctcaccg ctcctattca acatagtgtt ggaagttctg gccagggcaa 4021 tcaggcagga gaaggaaata aagggtattc aattaggaaa agaggaagtc aaattgtccc 4081 tgtttgcaga cgacatgatt gtttatctag aaaaccccat cgtctcagcc caaaatctcc 4141 ttaagctgat aagcaacttc agcaaagtct caggatacaa aatcaatgta caaaaatcac 4201 aagcattctt atacaccaac aacagacaaa cagagagcca aatcatgagt gaactcccat 4261 tcacaattgc ttcaaagaga ataaaatacc taggaatcca acttacaagg gatgtgaagg 4321 acctcttcaa ggagaactac aaaccactgc tcaaggaaat aaaagaggac acaaacaaat 4381 ggaagaacat tccatgctca tgggtaggaa gaatcaatat cgtgaaaatg gccatactgc 4441 ccaaggtaat ttacagattc aatgccatcc ccatcaagct accaatgact ttcttcacag 4501 aattggaaaa aactacttta aagttcatat ggaaccaaaa aagagcctgc atcgccaagt 4561 caatcctaag ccaaaagaac aaagctggag gcatcacact acctgacttc aaactatact 4621 acaaggctac agtaaccaaa acagcatggt actggtacca aaacagagat atagatcaat 4681 ggaacagaac agagccctca gaaataacgc cgcatatcta caactatctg atctttgaca 4741 aacctgagaa aaacaagcaa tggggaaagg attccctatt taataaatgg tgctgggaaa 4801 actggctagc catatgtaga aagctgaaac tggatccctt ccttacacct tatacaaaaa 4861 tcaattcaag atggattaaa gatttaaacg ttagacctaa aaccataaaa accctagaag 4921 aaaacctagg cattaccatt caggacatag gcacgggcaa ggacttcatg tccaaaacac 4981 caaaagcaat ggcaacaaaa gccaaaattg acaaatggga tctaattaaa ctcaagagct 5041 tctgcacagc aaaagaaact accatcagag tgaacaggca acctacaaca tgggagaaaa 5101 ttttcgcaac ctactcatct gacaaagggc taatatccag aatctacaat gaactcaaac 5161 aaatttacaa gaaaaaaaca aacaacccca tcaaaaactg ggcaaaggac atgaacagac 5221 acttctcaaa agaagacatt tatgcagcca aaaaacacat gaaaaaatgc tcatcatcac 5281 tggccatcag agaaatgcaa atcaaaacca ctatgagata tcatctcaca ccagttagaa 5341 tggcaatcat taaaaagtca ggaaacaaca ggtgctggag aggatgtgga gaaataggaa 5401 cacttttaca ctgttggtgg gactgtaaac tagttcaacc attgtggaag tcagtgtggc 5461 gattcctcag ggatctagaa ctagaaatac catttgaccc agccatccca ttactgggta 5521 tatacccaaa tgactataaa tcatgctgct ataaagacac atgcacacgt atgtttattg 5581 cggcattatt cacaatagca aagacttgga accaacccaa atgtccaaca atgatagact 5641 ggattaagaa aatgtggcac atatacacca tggaatacta tgcagccata aaaaatgatg 5701 agttcatgtc ctttgtaggg acatggatga aattggaaac catcattgtc agtaaactat 5761 cgcaagaaca aaaaaccaaa caccgcatat tctcactcat aggtgggaat tgaacaatga 5821 gatcacatgg acacaggaag gggaatatca cactctgggg actgtggtgg ggttggggga 5881 ggggaggggg atagcattgg gagatatacc taatgctaga tgacacatta gtgggtgcag 5941 cacaccagca tggcacatgt atacatatgt aactaacctg cacaatgtgc acatgtaccc 6001 taaaacttaa agtataata // LOCUS HSX99050 3510 bp DNA PRI 09-OCT-1997 DEFINITION H.sapiens mRNA; UV Radiation Resistance Associated Gene. ACCESSION X99050 NID g2102666 KEYWORDS UV radiation resistance associated gene; UVRAG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3510) AUTHORS Canaani,D. TITLE Direct Submission JOURNAL Submitted (04-JUL-1996) D. Canaani, Tel-Aviv University, Biochemistry, Faculty of Life Sciences, Tel-Aviv University, Ramat-Aviv, Tel-Aviv 69978, ISRAEL REMARK Revised by [3] REFERENCE 2 (bases 1 to 3510) AUTHORS Canaani,D. TITLE Direct Submission JOURNAL Submitted (13-MAY-1997) D. Canaani, Tel-Aviv University, Biochemistry, Faculty of Life Sciences, Tel-Aviv University, Ramat-Aviv, Tel-Aviv 69978, ISRAEL FEATURES Location/Qualifiers source 1..3510 /organism="Homo sapiens" /note="between markers D11S906 and D11S916" /db_xref="taxon:9606" /cell_line="KCL22" /cell_type="hematopoetic" /chromosome="11" /map="q13" /clone="6a" /clone="7" /clone_lib="lambda ZAP II/KCL22" mRNA <1..>3510 gene 176..2122 /gene="UVRAG" CDS 176..2122 /gene="UVRAG" /note="UV Radiation Resistance Associated Gene" /codon_start=1 /evidence=experimental /product="p63 (processed form)" /db_xref="PID:e354226" /db_xref="PID:g2102667" /translation="MSASASVGGPVPQPPPGPAAALPPGSAARALHVELPSQQRRLRH LRNIAARNIVNRNGHQLLDTYFTLHLCSTEKIYKEFYRSEVIKNSLNPTWRSLDFGIM PDRLDTSVSCFVVKIWGGKENIYQLLIEWKVCLDGLKYLGQQIHARNQNEIIFGLNDG YYGAPFEHKGYSNAQKTILLQVDQNCVRNSYDVFSLLRLHRAQCAIKQTQVTVQKIGK EIEEKLRLTSTSNELKKKSECLQLKILVLQNELERQKKALGREVALLHKQQIALQDKG SAFSAEHLKLQLQKESLNELRKECTAKRELFLKTNAQLTIRCRQLLSELSYIYPIDLN EHKDYFVCGVKLPNSEDFQAKDDGSIAVALGYTAHLVSMISFFLQVPLRYPIIHKGSR STIKDNINDKLTEKEREFPLYPKGGEKLQFDYGVYLLNKNIAQLRYQHGLGTPDLRQT LPNLKNFMEHGLMVRCDRHHTSSAIPVPKRQSSIFGGADVGFSGGIPSPDKGHRKRAS SENERLQYKTPPPSYNSALAQPVTTVPSMGETERKITSLSSSLDTSLDFSKENKKKGE DLVGSLNGGHANVHPSQEQGEALSGHRATVNGTLLPSEQAGSASVQLPGEFHPVSEAE LCCTVEQAEEIIGLEAQVSPQVIS" polyA_signal 3489..3494 BASE COUNT 974 a 795 c 849 g 892 t ORIGIN 1 tccagcggcg gcaacggcgg cagcggcggc agcggcggcg gctactgtct gggctgagca 61 gtagtgcctc tcgggtggcg ggtttctagg ctgcaggggc ttggtaggtg gtggcaaggg 121 ggcggcggcg gatgccggaa gagtgcccgc cccgcttggc ggcccctgga tcgagatgag 181 cgcctccgcg tcggtcgggg gccccgtccc ccagccaccc ccgggcccgg ccgctgctct 241 gcctcccggt tctgccgcgc gggccctgca tgtggagctg ccgtctcagc agcggcgtct 301 tcgacatctt cggaacattg ctgcccggaa cattgttaat agaaatggcc atcagctcct 361 tgatacctac tttacacttc acttgtgtag tactgaaaag atatataaag aattttatag 421 aagtgaagtg attaagaatt ccttgaatcc cacgtggcga agtctcgatt ttggaattat 481 gccagaccgt cttgatacat ctgtgtcttg tttcgtggtg aagatatggg gtggaaagga 541 gaacatctac cagctgttga ttgaatggaa agtctgtttg gatgggctga aatacttggg 601 tcagcagatt catgcccgaa accaaaatga gataattttt gggctgaatg atggatacta 661 tggtgctcca tttgaacata agggttattc aaatgctcag aagactattc ttctgcaggt 721 ggatcagaac tgtgttcgca attcttacga tgtcttctct ttgctacggc ttcatagagc 781 ccagtgtgca attaaacaga ctcaggtaac tgttcagaaa attggaaagg aaattgaaga 841 aaaactaaga ctcacatcta caagcaatga actgaaaaaa aaaagtgaat gcctgcagtt 901 aaaaattttg gtgcttcaga atgaactgga acggcagaag aaagctttgg gacgggaggt 961 ggcattactg cataagcaac aaattgcatt acaagacaaa ggaagtgcat tttcagctga 1021 gcacctcaaa cttcaactcc agaaggaatc cctaaatgag ctgaggaagg agtgcactgc 1081 aaaaagagaa ctcttcttga agactaatgc tcagttgaca attcgttgca ggcagttact 1141 ctctgagctt tcctacattt accctattga tttgaatgaa cataaggatt actttgtatg 1201 cggtgtcaag ttgcctaatt ctgaggactt ccaagcaaaa gatgatggaa gcattgctgt 1261 tgcccttggt tatactgcac atctggtctc catgatttcc tttttcctac aagtgcccct 1321 cagatatcct ataattcata aggggtctag atcaacaatc aaagacaata tcaatgacaa 1381 actgacggaa aaggagagag agtttccact gtatccaaaa ggaggggaga agttgcagtt 1441 tgattatggt gtctatcttc tgaacaaaaa tatagcacag ctaagatatc aacatggact 1501 agggactcca gacttgcggc aaacccttcc caacctgaaa aacttcatgg agcatggact 1561 aatggtcagg tgtgacagac atcacacctc cagtgcaatc cctgttccta agagacaaag 1621 ctccatattt gggggtgcag atgtaggctt ctctgggggg atcccttcac cagacaaagg 1681 acatcgaaaa cgggccagct ctgagaatga gagacttcag tacaaaaccc ctcctcccag 1741 ttacaactca gcattagccc agcctgtgac caccgtcccc tccatgggag agaccgagag 1801 aaagataaca tctctatcct cctccttgga tacctccttg gacttctcca aagaaaacaa 1861 gaaaaaagga gaggatctag ttggcagctt aaacggaggc cacgcgaatg tgcaccctag 1921 ccaagaacaa ggagaagccc tctccgggca ccgggccaca gtcaatggca ctctcctacc 1981 cagcgagcag gccgggtccg ccagtgtcca gcttccaggc gagttccacc cagtctcaga 2041 agctgagctc tgctgtactg tggagcaagc agaagaaatc atcgggctgg aagcacaggt 2101 ttcgcctcag gtgatcagct agaagcattt aactgcatcc cagtggacag tgctgtggca 2161 gtagagtgtg acgaacaagt tctgggagaa tttgaagagt tctcccgaag gatctatgca 2221 ctgaatgaaa acgtatccag cttccgccgg ccgcgcagga gttccgataa gtgaagtgag 2281 caggtcaaca gtaggactgg ggcagaagct ctgcctaaaa tgaagtgaaa gctgcactta 2341 accctttgtg ataatgatga cacaaaatga atattaatgg aggatattcc tcggaaaaac 2401 agactttggg aatgaaggag ggactcagga tcattgttat cagtgggcca aagttagatt 2461 ttgctttcaa gatttgcttt tcgggcctga tgattttaaa gcaaaaatca ccctctagtt 2521 gaaagagctt acagctcgag tcacctttta gctatttgtc tgctttttat ttacccttgt 2581 atgttatcct cagagggaag atgataatat ataataatat aatgaacaca cccttagttt 2641 ctcataagca tttgccctca ccatggttta taaaactttg ggaaaacgga atattcagaa 2701 ataggtttcc gccatgtact gaaaggtctg tggccatctg tgaggtagat gaagaagcag 2761 catagtggtc tccttacatc taggcctaac tgtccctctt cctgcccccg ggtaccacag 2821 tccaccttta gaccctactg tcgccccatc ttctccgtgg atgggccatg cgttcctgaa 2881 aacaggacat caagattcac tggttctgta acccagtagc tgtgacgttc catctcttct 2941 aaccagccat ggccttcccc tcctctgcca tacccttaat gcggccctca gattagatga 3001 aaaacttgct cctggtggat cccaagggac cctcaaggac ctcgaggtta ctgcagtcag 3061 atgccatctc atccctgtgg gggccaaagt ttttatgtgg gcagatgctg tggtcaggaa 3121 ctaggcatgc tttctggcaa tgcactcacc agacaaaaat ccttgatgta aatcccatgt 3181 taatttatta aatttagtca gaaggtcagc atttacatga cagaatgtat gtagagagtt 3241 ggggtgtctg gtaggcaaac tgcaaggcag ttgagatagt tggattaaga ggctagacga 3301 gacatagaat actattggtg atgtgtgcaa tttcatgaat attaaattat gtttcgaagt 3361 ccagttgtca ttcccgcatt cagatttcat ttgctgatga ctttatacgt tacgtaccca 3421 aggacattgc ctcagggttg caaactcttt aaaggcaaaa tttatccata tatccatgta 3481 ttatatagaa taaaaattga agtttacttc // LOCUS HSY08564 1737 bp DNA PRI 01-MAY-1997 DEFINITION H.sapiens GalNAc-T4 gene. ACCESSION Y08564 NID g1934911 KEYWORDS GalNAc-T4 gene; UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1737) AUTHORS Bennett,E.P. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1737) AUTHORS Bennett,E.P. TITLE Direct Submission JOURNAL Submitted (01-OCT-1996) E.P. Bennett, Dental School, University Of Copenhagen, Norre Alle 20, 2200 Copenhagen N, DENMARK REMARK Revised by [3] REFERENCE 3 (bases 1 to 1737) AUTHORS Bennett,E.P. TITLE Direct Submission JOURNAL Submitted (10-APR-1997) E.P. Bennett, Dental School, University Of Copenhagen, Norre Alle 20, 2200 Copenhagen N, DENMARK FEATURES Location/Qualifiers source 1..1737 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="genomic P1 library, #6212" gene 1..1737 /gene="GalNAc-T4" CDS 1..1737 /gene="GalNAc-T4" /note="fourth member of the GalNAc transferase gene family" /codon_start=1 /product="UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase" /db_xref="PID:e307951" /db_xref="PID:g1934912" /translation="MAVRWTWAGKTCLLLAFLTVAYIFVELLVSTFHASAGAGRAREL GSRRLSDLQKNTEDLSRPLYKKPPADSRALGEWGKASKLQLNEDELKQQEELIERYAI NIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLE TSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDRVRLIRTNKREGLVRARLIGATFA TGDVLTFLYCHCECNSGWLEPLLERIGRYETAVVCPVIDTIDWNTFEFYMQIGEPMIG GFDWRLTFQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVW GGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRS RGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFNSVTELCAEVPE QKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEGRPDVQMRTCD ALDKNQIWSFEK" BASE COUNT 482 a 354 c 437 g 464 t ORIGIN 1 atggcggtga ggtggacttg ggcaggcaag acctgcctgc tgctggcgtt tttaacagtg 61 gcctatatct tcgtggagct cttggtctct acttttcatg cctccgcagg agccggccgt 121 gccagggagc tggggtcaag aaggctctca gacctccaga aaaatacgga ggatttgtct 181 cgaccgcttt ataagaagcc ccctgcagat tcccgtgcac ttggggagtg ggggaaagcc 241 agcaaactcc agctcaacga ggatgaactg aagcagcaag aagaactcat tgagagatac 301 gccatcaata tttacctcag tgacaggatt tccctgcatc gacacataga ggataaaaga 361 atgtatgagt gtaagtccca gaagttcaac tataggacac ttcctaccac ctctgttatc 421 attgctttct ataacgaagc ctggtcgact ttgctccgta ccattcacag tgttttagaa 481 acttctcctg cagttctttt gaaagagatc atcttggtgg atgacttgag tgacagagtt 541 tatttgaaga cacaacttga aacttacatc agcaatcttg atagagtacg cttgattagg 601 accaataagc gagaggggct ggttagggcc cgtctgattg gggccacttt cgccactggg 661 gacgtcctca ctttcctgta ttgtcactgt gagtgtaatt ccggttggct ggaaccgctt 721 ttggaaagga ttgggagata tgaaacagca gttgtgtgtc ctgttataga cacaattgat 781 tggaatactt ttgaattcta tatgcagata ggggagccca tgattggtgg gtttgactgg 841 cgtttaacat ttcagtggca ttctgtcccc aaacaggaaa gggacaggcg gatatcaaga 901 attgacccca tcagatcacc taccatggct ggaggactgt ttgctgtcag caagaaatat 961 tttcagtacc ttggaacgta tgacacagga atggaagtgt ggggaggtga aaaccttgag 1021 ctgtctttta gggtgtggca gtgtggtggc aaattggaga tccacccgtg ttcccacgtg 1081 ggccatgtgt tccccaagcg ggcaccatat gctcgcccca atttcctaca gaatactgct 1141 cgggcagcag aagtttggat ggatgaatac aaagagcact tctacaatag aaaccctcca 1201 gcaagaaaag aagcttatgg tgatatttct gaaagaaaat tactacgaga gcggttgaga 1261 tgcaagagct ttgactggta tttgaaaaac gtttttccta atttacatgt tccagaggat 1321 agaccaggct ggcatggggc tattcgcagt agagggatct cgtctgaatg tttagattat 1381 aattctcctg acaacaaccc cacaggtgct aacctttcac tgtttggatg ccatggtcaa 1441 ggaggcaatc aattctttga atatacttca aacaaagaaa taaggtttaa ttctgtgaca 1501 gagttatgtg cagaggtacc tgagcaaaaa aattatgtgg gaatgcaaaa ttgtcccaaa 1561 gatgggttcc ctgtaccagc aaacattatt tggcatttta aagaagatgg aactattttt 1621 cacccacact caggactgtg tcttagtgct tatcggacac cggagggccg acctgatgta 1681 caaatgagaa cttgtgatgc tctagataaa aatcaaattt ggagttttga gaaatag // LOCUS HSYB1 2550 bp DNA PRI 08-JUL-1996 DEFINITION H.sapiens YB-1 gene promoter region. ACCESSION X96666 NID g1403348 KEYWORDS promoter region; Y box binding protein; YB-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2550) AUTHORS Makino,Y., Ohga,T., Toh,S., Koike,K., Okumura,K., Wada,M., Kuwano,M. and Kohno,K. TITLE Structural and functional analysis of the human Y-box binding protein (YB-1) gene promoter JOURNAL Nucleic Acids Res. 24 (10), 1873-1878 (1996) MEDLINE 96226173 REFERENCE 2 (bases 1 to 2550) AUTHORS Yoshinari,M. TITLE Direct Submission JOURNAL Submitted (15-MAR-1996) Yoshinari M., Department of Biochemistry, Kyushu University School of Medicine, Maidashi, Fukuoka, 812-82, JAPAN FEATURES Location/Qualifiers source 1..2550 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 1..1855 /gene="YB-1" gene 1..2411 /gene="YB-1" exon 1856..2352 /gene="YB-1" /number=1 repeat_region 1904..1914 /rpt_type=INVERTED protein_bind 1957..1962 /gene="YB-1" /bound_moiety="SP-1" protein_bind 2001..2006 /gene="YB-1" /bound_moiety="SP-1" protein_bind 2024..2029 /gene="YB-1" /bound_moiety="SP-1" protein_bind 2033..2038 /gene="YB-1" /bound_moiety="SP-1" repeat_region 2034..2044 /rpt_type=INVERTED protein_bind 2100..2105 /gene="YB-1" /bound_moiety="SP-1" CDS 2187..2411 /gene="YB-1" /codon_start=1 /db_xref="PID:e229758" /db_xref="PID:g1403349" /translation="MSSEAETQQPPAAPPAAPALSAADTKPGTTGSGAGSGGPGGLTS AAPAGGDKKVIGEDRQGRGWGPRAAQQRNR" BASE COUNT 611 a 708 c 660 g 571 t ORIGIN 1 gtttgctacc tctgctcctg cagtatcctg cagccagtag ttacaagcac ctggaagttt 61 ggacagggaa tacaagagaa gactctaaga cttctcttag aagtcataga gtccagtaaa 121 ccacattatg gaatctggac taccctaagg gtaatggtag tcactggaag attttaagta 181 agacagtttc atgatcagat agatgttctc aaaagaactt tctggctgtt gtggtgggta 241 atattagagt aagaaagagg tagagaggtg tggttagtag ccaacagcac tgctctatgc 301 taggggtcag ccgactacac tccgtttttc tgtggctcat gagctaagaa tacaaatggt 361 ttttaaaatg tttaaatagt tgaaaatatg taaagattcg tgtcatgtaa aaattaaggc 421 tgttttcaag ctccaaacct gagtagtaag tcagaaccat atggtcggcc aagcttatat 481 tattaactat ttggctctca cagaaaaagg ttcctacccc ttgctctaag caattggtga 541 tgatggccta cagtagtgac agtaggaatg aaaagactct gaattgacat ttagagggtt 601 taggactgac tcaagaacgc ctttaggagg tggaactcac aggcctagac ggcattggta 661 ggggtaagga atactgcatt agcctcctaa ctgctttctc tacttccatt ccttgcccct 721 ctgcaaccca ttctccactc cgcagccatt tttaaaaaga tgcccctccc tacttatgac 781 tctaaaattg ctcttctcac tcttcccctc aggatatatt tccaattaaa tatacctaag 841 tgactgccca cctctgcaac ccaatgtcac attcgagtct tactgaacta cttgactgca 901 tttcccgaga tctcacctct tctcgcctgt accctgtgcg cggaaagtca gccctccacc 961 ttctccctgc ttccactccc aaaatacttc gtggttttgc agctctggag tatttaccgt 1021 gttggctgtt taaatttctg cctccatcag aaggcagaaa ctgactcgcg aactattcca 1081 tccccagccg atagtagacg cttaaaaaag aacggaagaa ggtgggtggg aggacttcag 1141 taacatcagg tggcagcctc aattttatcg tttgtgaaac gtggatagta atccctctat 1201 cacgtggctg ttgcaggaat aaagtgaaaa aacaaaacag gctagcttgt tcaataaatg 1261 tgagttgaat taaatctgat ttgtggtcag tagaaaaaga tgtgaatact tggaaaggaa 1321 gacacatttt tttaaatata tgcctggtaa aacggatcag aaggcaggtc cccatggagc 1381 acaccctcgc cctaaacatg ctgaacccgg gctgccatag cctgcgtggt ccctccaagg 1441 tgactgctcc gacaaaaggg tacgctcttc aaacgcatac gtttaaggca attccagaaa 1501 ccctcggctg tgccgcgact acacggccat taaagaaaag acgactctat gcccgccgta 1561 atgttctcag atcacaggga ccgtatttgg agctgggagg gagggaagcc ttttcttcac 1621 ggggggctaa ggcgtcttcg agcccccttc caatcccggg tccggccggg taatccctgc 1681 ccagcgttcg ggcgtgcctt tttttcagcc gagacacaac cctgaacgtg ggggcccgcc 1741 agcccggcgg ctgcctcgtg gaagtcacgt tccttctgcc cgtcctctcg ggtactctat 1801 ggttttcgtg gccgactact ctaattctag ttccggtctc tatggcggcc ggcggaggca 1861 ggaacggttg taggtcgact gaattagccg ccaaaggtcc aatgagaatg gaggactgat 1921 aaaatattag ccaatagaag ctagggattg gggtcaggtg ggcagattga cagtaccact 1981 ggccagtgaa caacgcctag ggcgggtcgc tcgtagggct tatcccgcct gtcccgccat 2041 tctcgctagt tcgatcggta gcgggagcgg agagcggacc ccagagagcc ctgagcagcc 2101 ccaccgccgc cgccggccta gttaccatca caccccggga ggagccgcag ctgccgcagc 2161 cggccccagt caccatcacc gcaaccatga gcagcgaggc cgagacccag cagccgcccg 2221 ccgccccccc cgccgccccc gccctcagcg ccgccgacac caagcccggc actacgggca 2281 gcggcgcagg gagcggtggc ccgggcggcc tcacatcggc ggcgcctgcc ggcggggaca 2341 agaaggtcat cggtgaggac cgacagggac gggggtgggg ccctcgggca gcccagcagc 2401 ggaaccgtta gccggagctg ggcgagccgg cgggcgcgcg gccggtgggc acccactccg 2461 cggcggcggc ccgccatccc ccccgtgccc ccctcaatcc ctctcgcggg gaccgcccgg 2521 cagtgcgcgc gcactgctcc cgctccccct // LOCUS HUM215MBP 622 bp DNA PRI 19-JAN-1996 DEFINITION Homo sapiens synthetic myelin basic protein 21.5 kDa isoform gene, complete cds. ACCESSION L41657 NID g1162921 KEYWORDS myelin basic protein; synthetic DNA; synthetic gene. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kamholz,J., de Ferra,F., Puckett,C. and Lazzarini,R. TITLE Identification of three forms of human myelin basic protein by cDNA cloning JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (13), 4962-4966 (1986) MEDLINE 86259714 REFERENCE 2 (bases 1 to 622) AUTHORS Nye,S.H., Pelfrey,C.M., Burkwit,J.J., Voskuhl,R.R., Lenardo,M.J. and Mueller,J.P. TITLE Purification of immunologically active recombinant 21.5 kDa isoform of human myelin basic protein JOURNAL Mol. Immunol. 32 (14-15), 1131-1141 (1995) MEDLINE 96128281 COMMENT Sequence M13577 overlaps this sequence. FEATURES Location/Qualifiers source 1..622 /organism="Homo sapiens" /note="synthetically derived in the laboratory with oligonucleotides" /db_xref="taxon:9606" /map="18q22-qter" mRNA 1..622 gene 4..615 /gene="MBP" CDS 4..615 /gene="MBP" /note="21.5 kDa isoform; bp 595-612: histidine tag" /codon_start=1 /db_xref="GDB:G00-119-379" /product="myelin basic protein" /db_xref="PID:g1162922" /translation="MASQKRPSQRHGSKYLATASTMDHARHGFLPRHRDTGILDSIGR FFGGDRGAPKRGSGKVPWLKPGRSPLPSHARSQPGLCNMYKDSHHPARTAHYGSLPQK SHGRTQDENPVVHFFKNIVTPRTPPPSQGKGRGLSLSRFSWGAEGQRPGFGYGGRASD YKSAHKGFKGVDAQGTLSKIFKLGGRDSRSGSPMARRHHHHHH" BASE COUNT 120 a 220 c 167 g 115 t ORIGIN 1 catatggcgt ctcagaaacg tccgtcccag cgtcacggct ccaaatacct ggccaccgcc 61 agcaccatgg accatgcccg tcatggcttc ctgccgcgtc accgtgacac cggcatcctg 121 gactccatcg gccgcttctt cggcggtgac cgtggtgcgc cgaaacgtgg ctctggcaaa 181 gtgccgtggc tgaaaccggg ccgtagcccg ctgccgtctc atgcccgtag ccagccgggc 241 ctgtgcaaca tgtacaaaga ctcccaccac ccggctcgta ccgcgcacta tggctccctg 301 ccgcagaaat cccacggccg tacccaggat gaaaacccgg tggtgcactt cttcaaaaac 361 attgtgaccc cgcgtacccc gccgccgtct cagggcaaag gccgtggcct gtccctgagc 421 cgtttcagct ggggcgccga aggccagcgt ccgggcttcg gctacggcgg ccgtgcgtcc 481 gactataaat ctgctcacaa aggcttcaaa ggcgtggatg cccagggcac cctgtccaaa 541 attttcaaac tgggcggccg tgatagccgt tctggctctc cgatggctag acgtcatcac 601 catcaccatc actaataagc tt // LOCUS HUM5HT1DA 1506 bp DNA PRI 23-MAR-1992 DEFINITION Human 5-HT1D-type serotonin receptor gene, complete cds. ACCESSION M89955 NID g177771 KEYWORDS 5-HT1D-type serotonin receptor. SOURCE Homo sapiens (library: lambda FIX II, stratagene #946203) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1506) AUTHORS Hamblin,M.W. and Metcalf,M.A. TITLE Primary structure and functional characterization of a human 5-HT1D-type serotonin receptor JOURNAL Mol. Pharmacol. 40, 143-148 (1991) MEDLINE 91342595 FEATURES Location/Qualifiers source 1..1506 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda FIX II, stratagene #946203" CDS 271..1404 /note="RDC4 homologue; putative" /codon_start=1 /product="5-HT1D-type serotonin receptor" /db_xref="PID:g177772" /translation="MSPLNQSAEGLPQEASNRSLNATETSEAWDPRTLQALKISLAVV LSVITLATVLSNAFVLTTILLTRKLHTPANYLIGSLATTDLLVSILVMPISIAYTITH TWNFGQILCDIWLSSDITCCTASILHLCVIALDRYWAITDALEYSKRRTAGHAATMIA IVWAISICISIPPLFWRQAKAQEEMSDCLVNTSQISYTIYSTCGAFYIPSVLLIILYG RIYRAARNRILNPPSLYGKRFTTAHLITGSAGSSLCSLNSSLHEGHSHSAGSPLFFNH VKIKLADSALERKRISAARERKATKILGIILGAFIICWLPFFVVSLVLPICRDSCWIH PALFDFFTWLGYLNSLINPIIYTVFNEEFRQAFQKIVPFRKAS" BASE COUNT 309 a 457 c 322 g 418 t ORIGIN 1 agaccttaac taccagctgg tagttgtctc agcattcttc aaatagtccg gtcttgttta 61 atattattat tattattgtt atttaatttt attttattgc aactgtactt agagaatagt 121 ctggttcttg agaccttttc actgtggtct gttctggtgt acggctccca ccagtgtgaa 181 gcagaaggat gactttgctc tgttgtcagg acaaccttga aggaaggagc caaatgtgtg 241 gaggtctgtg ggaagagaga gccacctagc atgtccccac tgaaccagtc agcagaaggc 301 cttccccagg aggcctccaa cagatccctg aatgccacag aaacctcaga ggcttgggat 361 cccaggaccc tccaggcgct caagatctcc cttgccgtgg tcctttccgt catcacactg 421 gccacagtcc tctccaatgc ctttgtactc accaccatct tactcaccag gaagctccac 481 acccctgcca actacctgat tggctccctg gccaccaccg acctcttggt ttccatcttg 541 gtaatgccca tcagcatcgc ctataccatc acccacacct ggaactttgg ccaaatcttg 601 tgtgacatct ggctgtcctc tgacatcacg tgctgcacag cctccatcct gcatctctgt 661 gtcattgctc tggacaggta ctgggcaatc acagatgccc tggaatacag taaacgcagg 721 acggctggcc acgcggccac catgatcgcc attgtctggg ccatctccat ctgcatctcc 781 atccccccgc tcttctggcg gcaggccaag gcccaggagg agatgtcgga ctgtctggtg 841 aacacctctc agatctccta caccatctac tccacctgtg gggccttcta cattccctcg 901 gtgttgctca tcatcctata tggccggatc taccgggctg cccggaaccg catcctgaat 961 ccaccctcac tctatgggaa gcgcttcacc acggcccacc tcatcacagg ctctgccggg 1021 tcctcgctct gctcgctcaa ctccagcctc catgaggggc actcgcactc ggctggctcc 1081 cctctctttt tcaaccacgt gaaaatcaag cttgctgaca gtgccctgga acgcaagagg 1141 atttctgctg ctcgagaaag gaaagccact aaaatcctgg gcatcattct gggggccttt 1201 atcatctgct ggctgccctt cttcgtggtg tctctggtcc tccccatctg ccgggactcc 1261 tgctggatcc acccggcgct ctttgacttc ttcacctggc taggctattt aaactccctc 1321 atcaatccaa taatctacac tgtgtttaat gaagagtttc ggcaagcttt tcagaaaatt 1381 gtccctttcc ggaaggcctc ctagtcttat tcggtgatga ctcttgttat cttttgtgtc 1441 ctgtaacctc atcgggattg tctttttttt ttttaattat tttctgagac ttggattaat 1501 tcatgg // LOCUS HUM5HTR1E 1221 bp DNA PRI 31-DEC-1994 DEFINITION Human serotonin receptor (5-HTR1E) gene, complete cds. ACCESSION M92826 NID g177777 KEYWORDS 5-hydroxytryptamine receptor; serotonin receptor. SOURCE Homo sapiens (tissue library: lambda DASHII, Stratagene) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1221) AUTHORS Zgombick,J.M., Schechter,L.E., Macchi,M., Hartig,P.R., Branchek,T.A. and Weinshank,R.L. TITLE Human gene S31 encodes the pharmacologically defined serotonin 5-hydroxytryptamine1E receptor JOURNAL Mol. Pharmacol. 42 (2), 180-185 (1992) MEDLINE 92382553 FEATURES Location/Qualifiers source 1..1221 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="placenta" /tissue_lib="lambda DASHII, Stratagene" 5'UTR 1..75 /gene="5HTR1E" gene 1..1221 /gene="5HTR1E" CDS 76..1173 /gene="5HTR1E" /note="putative" /codon_start=1 /product="serotonin receptor" /db_xref="PID:g177778" /translation="MNITNCTTEASMAIRPKTITEKMLICMTLVVITTLTTLLNLAVI MAIGTTKKLHQPANYLICSLAVTDLLVAVLVMPLSIIYIVMDRWKLGYFLCEVWLSVD MTCCTCSILHLCVIALDRYWAITNAIEYARKRTAKRAALMILTVWTISIFISMPPLFW RSHRRLSPPPSQCTIQHDHVIYTIYSTLGAFYIPLTLILILYYRIYHAAKSLYQKRGS SRHLSNRSTDSQNSFASCKLTQTFCVSDFSTSDPTTEFEKFHASIRIPPFDNDLDHPG ERQQISSTRERKAARILGLILGAFILSWLPFFIKELIVGLSIYTVSSEVADFLTWLGY VNSLINPLLYTSFNEDFKLAFKKLIRCREHT" 3'UTR 1174..1221 /gene="5HTR1E" BASE COUNT 303 a 343 c 268 g 307 t ORIGIN 1 tcgaggctac atagttttca gccaaaggaa aataaccaac agcttctcca cagtgtagac 61 tgaaacaagg gaaacatgaa catcacaaac tgtaccacag aggccagcat ggctataaga 121 cccaagacca tcactgagaa gatgctcatt tgcatgactc tggtggtcat caccaccctc 181 accacgttgc tgaacttggc tgtgatcatg gctattggca ccaccaagaa gctccaccag 241 cctgccaact acctaatctg ttctctggcc gtgacggacc tcctggtggc agtgctcgtc 301 atgcccctga gcatcatcta cattgtcatg gatcgctgga agcttgggta cttcctctgt 361 gaggtgtggc tgagtgtgga catgacctgc tgcacctgct ccatcctcca cctctgtgtc 421 attgccctgg acaggtactg ggccatcacc aatgctattg aatacgccag gaagaggacg 481 gccaagaggg ccgcgctgat gatccttacc gtctggacca tctccatttt catctccatg 541 ccccctctgt tctggagaag ccaccgccgc ctaagccctc cccctagtca gtgcaccatc 601 cagcacgacc atgttatcta caccatttac tccacgctgg gtgcgtttta tatccccttg 661 actttgatac tgattctcta ttaccggatt taccacgcgg ccaagagcct ttaccagaaa 721 aggggatcaa gtcggcactt aagcaacaga agcacagata gccagaattc ttttgcaagt 781 tgtaaactta cacagacttt ctgtgtgtct gacttctcca cctcagaccc taccacagag 841 tttgaaaagt tccatgcctc catcaggatc ccccccttcg acaatgatct agatcaccca 901 ggagaacgtc agcagatctc tagcaccagg gaacggaagg cagcacgcat cctggggctg 961 attctgggtg cattcatttt atcctggctg ccatttttca tcaaagagtt gattgtgggt 1021 ctgagcatct acaccgtgtc ctcggaagtg gccgactttc tgacgtggct cggttatgtg 1081 aattctctga tcaaccctct gctctatacg agttttaatg aagactttaa gctggctttt 1141 aaaaagctca ttagatgccg agagcatact tagactgtaa aaagctaaaa ggcacgactt 1201 tttccagagc ctcatgagtg g // LOCUS HUMA1ADAR 2077 bp DNA PRI 08-OCT-1996 DEFINITION Human DNA for alphalA/D adrenergic receptor, complete cds. ACCESSION D29952 NID g914933 KEYWORDS alpha1A/D adrenergic receptor. SOURCE Homo sapiens placenta (DNA) and prostate (mRNA) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Esbenshade,T.A., Hirasawa,A., Tsujimoto,G., Tanaka,T., Yano,J., Minneman,K.P. and Murphy,T.J. TITLE Cloning of the human alpha 1d-adrenergic receptor and inducible expression of three human subtypes in SK-N-MC cells JOURNAL Mol. Pharmacol. 47 (5), 977-985 (1995) MEDLINE 95265059 REFERENCE 2 (bases 1 to 2077) AUTHORS Yano,J. JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 2077) AUTHORS Yano,J. TITLE Direct Submission JOURNAL Submitted (13-APR-1994) to the DDBJ/EMBL/GenBank databases. Junichi Yano, Nippon Shinyaku Co., Ltd., Dept. of Molecular Biology Division; Nishiohji, Hachijo, Minami-ku, Kyoto, Kyoto 601, Japan (Tel:075-321-1111(ex.7884), Fax:075-314-3269) COMMENT Suquence updated (08-JUL-1995) by: Junichi Yono. FEATURES Location/Qualifiers source 1..2077 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta(genomic DNA) and prostate(mRNA)" CDS 5..1723 /codon_start=1 /product="alpha1A/D adrenergic receptor" /db_xref="PID:d1006786" /db_xref="PID:g914934" /translation="MTFRDLLSVSFEGPRPDSSAGGSSAGGGGGSAGGAAPSEGPAVG GVPGGAGGGGGVVGAGSGEDNRSSAGEPGSAGAGGDVNGTAAVGGLVVSAQGVGVGVF LAAFILMAVAGNLLVILSVACNRHLQTVTNYFIVNLAVADLLLSATVLPFSATMEVLG FWAFGRAFCDVWAAVDVLCCTASILSLCTISVDRYVGVRHSLKYPAIMTERKAAAILA LLWVVALVVSVGPLLGWKEPVPPDERFCGITEEAGYAVFSSVCSFYLPMAVIVVMYCR VYVVARSTTRSLEAGVKRERGKASEVVLRIHCRGAATGADGAHGMRSAKGHTFRSSLS VRLLKFSREKKAAKTLAIVVGVFVLCWFPFFFVLPLGSLFPQLKPSEGVFKVIFWLGY FNSCVNPLIYPCSSREFKRAFLRLLRCQCRRRRRRRPLWRVYGHHWRASTSGLRQDCA PSSGDAPPGAPLALTALPDPDPEPPGTPEMQAPVASRRKPPSAFREWRLLGPFRRPTT QLRAKVSSLSHKIRAGGAQRAEAACAQRSEVEAVSLGVPHEVAEGATCQAYELADYSN LRETDI" BASE COUNT 302 a 719 c 703 g 353 t ORIGIN 1 tgagatgact ttccgcgatc tcctgagcgt cagtttcgag ggaccccgcc cggacagcag 61 cgcagggggc tccagcgcgg gcggcggcgg gggcagcgcg ggcggcgcgg ccccctcgga 121 gggcccggcg gtgggcggcg ttccgggggg cgcgggcggc ggcggcggcg tggtgggcgc 181 aggcagcggc gaggacaacc ggagctccgc gggggagccg gggagcgcgg gcgcgggcgg 241 cgacgtgaat ggcacggcgg ccgtcggggg actggtggtg agcgcgcagg gcgtgggcgt 301 gggcgtcttc ctggcagcct tcatccttat ggccgtggca ggtaacctgc ttgtcatcct 361 ctcagtggcc tgcaaccgcc acctgcagac cgtcaccaac tatttcatcg tgaacctggc 421 cgtggccgac ctgctgctga gcgccaccgt actgcccttc tcggccacca tggaggttct 481 gggcttctgg gcctttggcc gcgccttctg cgacgtatgg gccgccgtgg acgtgctgtg 541 ctgcacggcc tccatcctca gcctctgcac catctccgtg gaccggtacg tgggcgtgcg 601 ccactcactc aagtacccag ccatcatgac cgagcgcaag gcggccgcca tcctggccct 661 gctctgggtc gtagccctgg tggtgtccgt agggcccctg ctgggctgga aggagcccgt 721 gccccctgac gagcgcttct gcggtatcac cgaggaggcg ggctacgctg tcttctcctc 781 cgtgtgctcc ttctacctgc ccatggcggt catcgtggtc atgtactgcc gcgtgtacgt 841 ggtcgcgcgc agcaccacgc gcagcctcga ggcaggcgtc aagcgcgagc gaggcaaggc 901 ctccgaggtg gtgctgcgca tccactgtcg cggcgcggcc acgggcgccg acggggcgca 961 cggcatgcgc agcgccaagg gccacacctt ccgcagctcg ctctccgtgc gcctgctcaa 1021 gttctcccgt gagaagaaag cggccaagac tctggccatc gtcgtgggtg tcttcgtgct 1081 ctgctggttc cctttcttct ttgtcctgcc gctcggctcc ttgttcccgc agctgaagcc 1141 atcggagggc gtcttcaagg tcatcttctg gctcggctac ttcaacagct gcgtgaaccc 1201 gctcatctac ccctgttcca gccgcgagtt caagcgcgcc ttcctccgtc tcctgcgctg 1261 ccagtgccgt cgtcgccggc gccgccgccc tctctggcgt gtctacggcc accactggcg 1321 ggcctccacc agcggcctgc gccaggactg cgccccgagt tcgggcgacg cgccccccgg 1381 agcgccgctg gccctcaccg cgctccccga ccccgacccc gaacccccag gcacgcccga 1441 gatgcaggct ccggtcgcca gccgtcgaaa gccacccagc gccttccgcg agtggaggct 1501 gctggggccg ttccggagac ccacgaccca gctgcgcgcc aaagtctcca gcctgtcgca 1561 caagatccgc gccgggggcg cgcagcgcgc agaggcagcg tgcgcccagc gctcagaggt 1621 ggaggctgtg tccctaggcg tcccacacga ggtggccgag ggcgccacct gccaggccta 1681 cgaattggcc gactacagca acctacggga gaccgatatt taaggacccc agagctaggc 1741 cgcggagtgt gctgggcttg ggggtaaggg ggaccagaga ggcgggctgg tgttctaaga 1801 gcccccgtgc aaatcggaga cccggaaact gatcagggca gctgctctgt gacatccctg 1861 aggaactggg cagagcttga ggctggagcc cttgaaaggt gaaaagtagt ggggccccct 1921 gctggactca ggtgcccaga actcttttct tagaagggag aggctgcggg ctccgtgggg 1981 ccttttgctc ccaatcccta tttgagaaac actgccccat cctccatgcc ctgaaccctg 2041 agtagacagc cccaagcatg gccaggaagg cctgccc // LOCUS HUMACHRM2 2210 bp DNA PRI 30-OCT-1994 DEFINITION Human m2 muscarinic acetylcholine receptor gene. ACCESSION M16404 NID g177989 KEYWORDS acetylcholine receptor; m2 muscarinic acetylcholine receptor; neurotransmitter. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2210) AUTHORS Bonner,T.I., Buckley,N.J., Young,A.C. and Brann,M.R. TITLE Identification of a family of muscarinic acetylcholine receptor genes [published erratum appears in Science 1987 Sep 25;237(4822):237] JOURNAL Science 237 (4814), 527-532 (1987) MEDLINE 87263421 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by T.I.Bonner, 17-JUL-1987. FEATURES Location/Qualifiers source 1..2210 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7q35-qter" intron <1..148 /note="ACHR-m2 intron A" prim_transcript <1..2171 /note="ACHR-m2 pre-mRNA" gene 195..1595 /gene="CHRM2" CDS 195..1595 /gene="CHRM2" /note="muscarinic acetylcholine receptor m2" /codon_start=1 /db_xref="GDB:G00-125-214" /db_xref="PID:g177990" /translation="MNNSTNSSNNSLALTSPYKTFEVVFIVLVAGSLSLVTIIGNILV MVSIKVNRHLQTVNNYFLFSLACADLIIGVFSMNLYTLYTVIGYWPLGPVVCDLWLAL DYVVSNASVMNLLIISFDRYFCVTKPLTYPVKRTTKMAGMMIAAAWVLSFILWAPAIL FWQFIVGVRTVEDGECYIQFFSNAAVTFGTAIAAFYLPVIIMTVLYWHISRASKSRIK KDKKEPVANQDPVSPSLVQGRIVKPNNNNMPSSDDGLEHNKIQNGKAPRDPVTENCVQ GEEKESSNDSTSVSAVASNMRDDEITQDENTVSTSLGHSKDENSKQTCIRIGTKTPKS DSCTPTNTTVEVVGSSGQNGDEKQNIVARKIVKMTKQPAKKKPPPSREKKVTRTILAI LLAFIITWAPYNVMVLINTFCAPCIPNTVWTIGYWLCYINSTINPACYALCNATFKKT FKHLLMCHYKNIGATR" BASE COUNT 657 a 467 c 474 g 612 t ORIGIN 19 bp upstream of AccI site. 1 acatgggaat taggcaggta gacacagtaa tcatgcaggg gaagggagat ttgggagaaa 61 ataatgtggt ttaaaaggag aaacaacatt atgtatttta aaccaatgtt tatattatgt 121 ttgttaattt tattctattt ccttgcaggt ttaaatgttt atttgctact tggctactga 181 ttagagaacg caaaatgaat aactcaacaa actcctctaa caatagcctg gctcttacaa 241 gtccttataa gacatttgaa gtggtgttta ttgtcctggt ggctggatcc ctcagtttgg 301 tgaccattat cgggaacatc ctagtcatgg tttccattaa agtcaaccgc cacctccaga 361 ccgtcaacaa ttacttttta ttcagcttgg cctgtgctga ccttatcata ggtgttttct 421 ccatgaactt gtacaccctc tacactgtga ttggttactg gcctttggga cctgtggtgt 481 gtgacctttg gctagccctg gactatgtgg tcagcaatgc ctcagttatg aatctgctca 541 tcatcagctt tgacaggtac ttctgtgtca caaaacctct gacctaccca gtcaagcgga 601 ccacaaaaat ggcaggtatg atgattgcag ctgcctgggt cctctctttc atcctctggg 661 ctccagccat tctcttctgg cagttcattg taggggtgag aactgtggag gatggggagt 721 gctacattca gtttttttcc aatgctgctg tcacctttgg tacggctatt gcagccttct 781 atttgccagt gatcatcatg actgtgctat attggcacat atcccgagcc agcaagagca 841 ggataaagaa ggacaagaag gagcctgttg ccaaccaaga ccccgtttct ccaagtctgg 901 tacaaggaag gatagtgaag ccaaacaata acaacatgcc cagcagtgac gatggcctgg 961 agcacaacaa aatccagaat ggcaaagccc ccagggatcc tgtgactgaa aactgtgttc 1021 agggagagga gaaggagagc tccaatgact ccacctcagt cagtgctgtt gcctctaata 1081 tgagagatga tgaaataacc caggatgaaa acacagtttc cacttccctg ggccattcca 1141 aagatgagaa ctctaagcaa acatgcatca gaattggcac caagacccca aaaagtgact 1201 catgtacccc aactaatacc accgtggagg tagtggggtc ttcaggtcag aatggagatg 1261 aaaagcagaa tattgtagcc cgcaagattg tgaagatgac taagcagcct gcaaaaaaga 1321 agcctcctcc ttcccgggaa aagaaagtca ccaggacaat cttggctatt ctgttggctt 1381 tcatcatcac ttgggcccca tacaatgtca tggtgctcat taacaccttt tgtgcacctt 1441 gcatccccaa cactgtgtgg acaattggtt actggctttg ttacatcaac agcactatca 1501 accctgcctg ctatgcactt tgcaatgcca ccttcaagaa gacctttaaa caccttctca 1561 tgtgtcatta taagaacata ggcgctacaa ggtaaaatat ctttgaaaaa gatagaaggt 1621 gggcaagggg agcttgagaa gaataaaagg gataaacgag ctcctagttt taaaatctct 1681 gccattgcac tttatagtct gattacaaaa cgtgcaattc aggagcccag cagtgacaca 1741 cttatcacgc ctaggctcca gtttgcaaaa attgcacctt ataaactgtc agtattagga 1801 gcaatgagac aatgaaagaa acatgttggg atcgtggatt taagaaacta tacactgttt 1861 ctcataatct cttgaagaag ggcttctgat tctacaattt tatcagtctc tgcacaagag 1921 gaataacctt gttccttttt tgttactttt gttgttgttg ttctcatgtg tccttaagag 1981 aaggaatgcc acagttacaa ggtaaacatg gagacttaaa cataaagaaa taggcactat 2041 acaatgggga cataaaaaaa gaaaatgaaa gaaggatgca gaaatttgtc tccggagtgt 2101 taagcatatt ttattctttt gttacggtcc tatttagagg attggaatgt aataaatgct 2161 tattttttgc ctttcttttt cccaccatga agagaaagca aacaaacaga // LOCUS HUMACHRM4 2595 bp DNA PRI 30-OCT-1994 DEFINITION Human m4 muscarinic acetylcholine receptor gene. ACCESSION M16405 NID g177991 KEYWORDS acetylcholine receptor; muscarinic acetylcholine receptor; neurotransmitter. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2595) AUTHORS Bonner,T.I., Buckley,N.J., Young,A.C. and Brann,M.R. TITLE Identification of a family of muscarinic acetylcholine receptor genes [published erratum appears in Science 1987 Sep 25;237(4822):237] JOURNAL Science 237 (4814), 527-532 (1987) MEDLINE 87263421 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by T.I.Bonner, 17-JUL-1987. FEATURES Location/Qualifiers source 1..2595 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p12-p11.2" intron <1..771 /note="ACHR-m4 intron A" prim_transcript <1..2595 /note="ACHR-m4 pre-mRNA" gene 801..2237 /gene="CHRM4" CDS 801..2237 /gene="CHRM4" /note="muscarinic acetylcholine receptor m4" /codon_start=1 /db_xref="GDB:G00-125-216" /db_xref="PID:g177992" /translation="MANFTPVNGSSGNQSVRLVTSSSHNRYETVEMVFIATVTGSLSL VTVVGNILVMLSIKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTVYIIKGYWPLGA VVCDLWLALDYVVSNASVMNLLIISFDRYFCVTKPLTYPARRTTKMAGLMIAAAWVLS FVLWAPAILFWQFVVGKRTVPDNHCFIQFLSNPAVTFGTAIAAFYLPVVIMTVLYIHI SLASRSRVHKHRPEGPKEKKAKTLAFLKSPLMKQSVKKPRPGGRPGGLRNGKLEEAPP PALPPPPRPVADKDTSNESSSGSATQNTKERPATELSTTEATTPAMPAPPLQPRALNP ASRWSKIQIVTKQTGNECVTAIEIVPATPAGMRPAANVARKFASIARNQVRKKRQMAA RERKVTRTIFAILLAFILTWTPYNVMVLVNTFCQSCIPDTVWSIGYWLCYVNSTINPA CYALCNATFKKTFRHLLLCQYRNIGTAR" BASE COUNT 528 a 839 c 674 g 552 t 2 others ORIGIN 1 bp upstream of XbaI site. 1 tctagaccac cagcctggac aacataccaa gaccctgtct ctacaaataa atagataaat 61 aaatagacac tttttttaag tgtcaaaagt gcttggcact tagtagacca tcagtgttag 121 gtgctcatac ataccccgat tattgccttg tcccagtgtc ttgtacaggg gttggagagn 181 aggtgttaag aaatgaccga atgggtaaat ggatgaacag aacacctccc tccagagccc 241 acatgctcgt gggcctctgg gaccactctc ctcctcctct tgcttccctg agctccccca 301 gcatggcctc tgtccaggcc ttgcgctgcc tccaggcctt tgctgtggct actgcccctg 361 gagcgccatn tccacagctc ctcctgtggc tggctcctca tcacccagat gacctggtgg 421 gtgaggccac ctagcaagga gtcatgcctg tcctgccttc tgactcactc tctcatcacc 481 ctgccttttt tttcttttgt ggctcacgtg tttgcatgtc tccccccatg aggcaggggg 541 ccatgtgtgt cttattcact tctgtagcca cagcaccctg agcaatgctt gccacatagt 601 aggtgctcaa ttaatgttga atgaatgggc aaaatgcggg atggcgggac agagttctct 661 caaggcattc tgccagagaa tgtccctctg tcaccttgaa tccagtgtac ctccagatga 721 ctcccccatt ccctcctgta gttcatgctt ttctctcccc ttcctcccca gacacggcct 781 acccacccct ggcaaccaac atggccaact tcacacctgt caatggcagc tcgggcaatc 841 agtccgtgcg cctggtcacg tcatcatccc acaatcgcta tgagacggtg gaaatggtct 901 tcattgccac agtgacaggc tccctgagcc tggtgactgt cgtgggcaac atcctggtga 961 tgctgtccat caaggtcaac aggcagctgc agacagtcaa caactacttc ctcttcagcc 1021 tggcgtgtgc tgatctcatc ataggcgcct tctccatgaa cctctacacc gtgtacatca 1081 tcaagggcta ctggcccctg ggcgccgtgg tctgcgacct gtggctggcc ctggactacg 1141 tggtgagcaa cgcctccgtc atgaaccttc tcatcatcag ctttgaccgc tacttctgcg 1201 tcaccaagcc tctcacctac cctgcccggc gcaccaccaa gatggcaggc ctcatgattg 1261 ctgctgcctg ggtactgtcc ttcgtgctct gggcgcctgc catcttgttc tggcagtttg 1321 tggtgggtaa gcggacggtg cccgacaacc actgcttcat ccagttcctg tccaacccag 1381 cagtgacctt tggcacagcc attgctgcct tctacctgcc tgtggtcatc atgacggtgc 1441 tgtacatcca catctccctg gccagtcgca gccgagtcca caagcaccgg cccgagggcc 1501 cgaaggagaa gaaagccaag acgctggcct tcctcaagag cccactaatg aagcagagcg 1561 tcaagaagcc ccgcccggga ggccgcccgg gaggactgcg caatggcaag ctggaggagg 1621 cccccccgcc agcgctgcca ccgccaccgc gccccgtggc tgataaggac acttccaatg 1681 agtccagctc aggcagtgcc acccagaaca ccaaggaacg cccagccaca gagctgtcca 1741 ccacagaggc caccactccc gccatgcccg cccctcccct gcagccgcgg gccctcaacc 1801 cagcctccag atggtccaag atccagattg tgacgaagca gacaggcaat gagtgtgtga 1861 cagccattga gattgtgcct gccacgccgg ctggcatgcg ccctgcggcc aacgtggccc 1921 gcaagttcgc cagcatcgct cgcaaccagg tgcgcaagaa gcggcagatg gcggcccggg 1981 agcgcaaagt gacacgaacg atctttgcca ttctgctagc cttcatcctc acctggacgc 2041 cctacaacgt catggtcctg gtgaacacct tctgccagag ctgcatccct gacacggtgt 2101 ggtccattgg ctactggctc tgctacgtca acagcaccat caaccctgcc tgctatgctc 2161 tgtgcaacgc cacctttaaa aagaccttcc ggcacctgct gctgtgccag tatcggaaca 2221 tcggcactgc caggtaggca ggcaggagtg ccctaggagg tgcggtgtgc gtgcgtgtgc 2281 tgggggacca cacggctcac ttgctgtggg gaagagtgca ggcaccattc tgcgttcacg 2341 tttgctgagg aggaagttca gaagaggctc tgtggctgca ttcagagacc agatctctgc 2401 tcacccgtga ggaggctcac cccagggagt gtctgaactg gggctgcctg gcccacctct 2461 gtggccctgc ttcagcgagc tgcggggcac tggcctgggt gggcacctgc ccactgtgac 2521 caaccatcag cagtgctgga agaatggaga tctggatggg ggccgaagcc cagggccccc 2581 tcaggaagaa caaag // LOCUS HUMADHVII 1125 bp DNA PRI 11-DEC-1996 DEFINITION Homo sapiens alcohol dehydrogenase VII (ADH7) gene, complete cds. ACCESSION L47166 NID g975603 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1125) AUTHORS Yokoyama,H., Baraona,E. and Lieber,C.S. TITLE Molecular cloning and chromosomal localization of the ADH7 gene encoding human class IV (sigma) ADH JOURNAL Genomics 31 (2), 243-245 (1996) MEDLINE 96422193 FEATURES Location/Qualifiers source 1..1125 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="DuPont Merck Phanaceutical Co (DMPC-HFF#1)" /cell_type="fibroblast" /tissue_type="foreskin" /map="4q23-24" gene 1..1125 /gene="ADH7" CDS 1..1125 /gene="ADH7" /codon_start=1 /db_xref="GDB:G00-362-911" /product="alcohol dehydrogenase 7" /db_xref="PID:g975604" /translation="MGTAGKVIKCKAAVLWEQKQPFSIEEIEVAPPKTKEVRIKILAT GICRTDDHVIKGTMVSKFPVIVGHEATGIVESIGEGVTTVKPGDKVIPLFLPQVRECN ACRNPDGNLCIRSDITGRGVLADGTTRFTCKGKPVHHFMNTSTFTVITVVDESSEAKI DDAAPPEKVCLIGCGFSTGYGAAVKTGKVKPGSTCVVFGLGGVGVSVIMGCKSAGASR IIGIDLNKDKFEKAMAVGATECISPKDSTKPISEVLSEMTGNNVGYTFEVIGHLETMI DALASCHMNYGTSVVVGVPPSAKMLTYDPMLLFTGRTWKGCVFGGLKSRDDVPKLVTE FLAKKFDLDQLITHVLPFKKISEGFELLNSGQSIRTVLTF" BASE COUNT 314 a 237 c 291 g 283 t ORIGIN 1 atgggcactg ctggaaaagt tattaagtgc aaagcagctg tgctttggga gcagaagcaa 61 cccttctcca ttgaggaaat agaagttgcc ccaccaaaga ctaaagaagt tcgcattaag 121 attttggcca caggaatctg tcgcacagat gaccatgtga taaaaggaac aatggtgtcc 181 aagtttccag tgattgtggg acatgaggca actgggattg tagagagcat tggagaagga 241 gtgactacag tgaaaccagg tgacaaagtc atccctctct ttctgccaca agtgagagaa 301 tgcaatgctt gtcgcaaccc agatggcaac ctttgcatta ggagcgatat tactggtcgt 361 ggagtactgg ctgatggcac caccagattt acatgcaagg gcaaaccagt ccaccacttc 421 atgaacacca gtacatttac cgtgatcaca gtggtggatg aatcttctga agctaagatt 481 gatgatgcag ctcctcctga gaaagtctgt ttaattggct gtgggttttc cactggatat 541 ggcgctgctg ttaaaactgg caaggtcaaa cctggttcca cttgcgtcgt ctttggcctg 601 ggaggagttg gcgtgtcagt catcatgggc tgtaagtcag ctggtgcatc taggatcatt 661 gggattgacc tcaacaaaga caaatttgag aaggccatgg ctgtaggtgc cactgagtgt 721 atcagtccca aggactctac caaacccatc agtgaggtgc tgtcagaaat gacaggcaac 781 aacgtgggat acacctttga agttattggg catcttgaaa ccatgattga tgccctggca 841 tcctgccaca tgaactatgg gaccagcgtg gttgtaggag ttcctccatc agccaagatg 901 ctcacctatg acccgatgtt gctcttcact ggacgcacat ggaagggatg tgtctttgga 961 ggtttgaaaa gcagagatga tgtcccaaaa ctagtgactg agttcctggc aaagaaattt 1021 gacctggacc agttgataac tcatgtttta ccatttaaaa aaatcagtga aggatttgag 1081 ctgctcaatt caggacaaag cattcgaacg gtcctgacgt tttga // LOCUS HUMADRA 1521 bp DNA PRI 30-OCT-1994 DEFINITION Human platelet alpha-2-adrenergic receptor gene, complete cds. ACCESSION M18415 NID g178191 KEYWORDS alpha-2-adrenergic receptor; alpha-adrenergic receptor. SOURCE Human (lambda-EMBL 3 library) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1521) AUTHORS Kobilka,B.K., Matsui,H., Kobilka,T.S., Yang-Feng,T.L., Francke,U., Caron,M.G., Lefkowitz,R.J. and Regan,J.W. TITLE Cloning, sequencing, and expression of the gene coding for the human platelet alpha 2-adrenergic receptor JOURNAL Science 238 (4827), 650-656 (1987) MEDLINE 88042789 FEATURES Location/Qualifiers source 1..1521 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q23-q25" gene 59..1411 /gene="ZNF32" CDS 59..1411 /gene="ZNF32" /note="alpha-2-adrenergic receptor old gene name 'ADRA2R'" /codon_start=1 /db_xref="GDB:G00-125-339" /db_xref="PID:g178192" /translation="MGSLQPDAGNASWNGTEAPGGGARATPYSLQVTLTLVCLAGLLM LLTVFGNVLVIIAVFTSRALKAPQNLFLVSLASADILVATLVIPFSLANEVMGYWYFG KTWCEIYLALDVLFCTSSIVHLCAISLDRYWSITQAIEYNLKRTPRRIKAIIITCWVI SAVISFPPLISIEKKGGGGGPQPAEPRCEINDQKWYVISSCIGSFFAPCLIMILVYVR IYQIAKRRTRVPPSRRGPDAVAAPPGGTERRPNGLGPERSAGPGGAEAEPLPTQLNGA PGEPAPAGPRDTDALDLEESSSSDHAERPPGPRRPERGPRGKGKARASQVKPGDSLRG AGRGRRGSGRRLQGRGRSASGLPRRRAGAGGQNLEKRFTFVLAVVIGVFVVCWFPFFF TYTLTAVGCSVPRTLFKFFFWFGYCNSSLNPVIYTIFNHDFRRAFKKILCRGDRKRIV " BASE COUNT 223 a 546 c 499 g 253 t ORIGIN Chromosome 10q23-q25. 1 cccgccttca tcttccgcca ggaggccaag gccgttggcc gagggcagct ttgcgcccat 61 gggctccctg cagccggacg cgggcaacgc gagctggaac gggaccgagg cgccgggggg 121 cggcgcccgg gccacccctt actccctgca ggtgacgctg acgctggtgt gcctggccgg 181 cctgctcatg ctgctcaccg tgttcggcaa cgtgctcgtc atcatcgccg tgttcacgag 241 ccgcgcgctc aaggcgcccc aaaacctctt cctggtgtct ctggcctcgg ccgacatcct 301 ggtggccacg ctcgtcatcc ctttctcgct ggccaacgag gtcatgggct actggtactt 361 cggcaagact tggtgcgaga tctacctggc gctcgacgtg ctcttctgca cgtcgtccat 421 cgtgcacctg tgcgccatca gcctggaccg ctactggtcc atcacacagg ccatcgagta 481 caacctgaag cgcacgccgc gccgcatcaa ggccatcatc atcacctgtt gggtcatctc 541 ggccgtcatc tccttcccgc cgctcatctc catcgagaag aagggcggcg gcggcggccc 601 gcagccggcc gagccgcgct gcgagatcaa cgaccagaag tggtacgtca tctcgtcgtg 661 catcggctcc ttcttcgctc cctgcctcat catgatcctg gtctacgtgc gcatctacca 721 gatcgccaag cgtcgcaccc gcgtgccacc cagccgccgg ggtccggacg ccgtcgccgc 781 gccgccgggg ggcaccgagc gcaggcccaa cggtctgggc cccgagcgca gcgcgggccc 841 ggggggcgca gaggccgaac cgctgcccac ccagctcaac ggcgcccctg gcgagcccgc 901 gccggccggg ccgcgcgaca ccgacgcgct ggacctggag gagagctcgt cttccgacca 961 cgccgagcgg cctccagggc cccgcagacc cgagcgcggt ccccggggca aaggcaaggc 1021 ccgagcgagc caggtgaagc cgggcgacag cctgcgcggc gcgggccggg ggcgacgggg 1081 atcgggacgc cggctgcagg gccgggggag gagcgcgtcg gggctgccaa ggcgtcgcgc 1141 tggcgcgggc gggcagaacc tcgagaagcg cttcacgttc gtgctggccg tggtcatcgg 1201 agtgttcgtg gtgtgctggt tccccttctt cttcacctac acgctcacgg ccgtcgggtg 1261 ctccgtgcca cgcacgctct tcaaattctt cttctggttc ggctactgca acagctcgtt 1321 gaacccggtc atctacacca tcttcaacca cgatttccgc cgcgccttca agaagatcct 1381 ctgtcggggg gacaggaagc ggatcgtgtg aggtttccgc tggcgcccgc gtagactcac 1441 gctgactgca ggcagcgggg ggcatcgagg ggtgcttagc ccgagggcac tcagaaaccc 1501 gggcgctgct gctctgcgtt t // LOCUS HUMAGG 4668 bp DNA PRI 30-OCT-1994 DEFINITION Human angiogenin gene, complete cds, and three Alu repetitive sequences. ACCESSION M11567 NID g178249 KEYWORDS Alu repeat; angiogenin; repeat region. SOURCE Human DNA, library of Maniatis et al., clone lambda-HAG1, and liver, cDNA to mRNA, clone pHAG1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4668) AUTHORS Kurachi,K., Davie,E.W., Strydom,D.J., Riordan,J.F. and Vallee,B.L. TITLE Sequence of the cDNA and gene for angiogenin, a human angiogenesis factor JOURNAL Biochemistry 24 (20), 5494-5499 (1985) MEDLINE 86077688 COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by K.Kurachi, 29-MAY-1986. There is only one gene encoding angiogenin in the human genome. The signal peptide could start at position 1815 instead of 1809. FEATURES Location/Qualifiers source 1..4668 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q11" repeat_region 1082..1398 /note="Alu repeat copy A" mRNA 1697..2425 /note="angiogenin mRNA" CDS 1809..2252 /gene="ANG" /codon_start=1 /db_xref="GDB:G00-119-679" /product="angiogenin" /db_xref="PID:g178250" /translation="MVMGLGVLLLVFVLGLGLTPPTLAQDNSRYTHFLTQHYDAKPQG RDDRYCESIMRRRGLTSPCKDINTFIHGNKRSIKAICENKNGNPHRENLRISKSSFQV TTCKLHGGSPWPPCQYRATAGFRNVVVACENGLPVHLDQSIFRRP" sig_peptide 1809..1880 /gene="ANG" /note="angiogenin signal peptide; G00-119-679" gene 1809..2252 /gene="ANG" mat_peptide 1881..2249 /gene="ANG" /note="angiogenin; G00-119-679" repeat_region 2524..2855 /note="Alu repeat copy B" repeat_region 3351..3663 /note="Alu repeat copy C" BASE COUNT 1247 a 1091 c 982 g 1348 t ORIGIN 132 bp upstream of XbaI site. 1 tgtttgcatt aagttcatag attataattt gtaatggaat caacaccaaa tgcaaattag 61 aaagagagcc cactttgctc acccagtcac gtcttcccat gtaaccatag aacgttgggg 121 tcctgtgtct ttctagatcc acagtcttgc tctcagaaca ggctagccac accacaggcc 181 tagtgccagg acccatggcc tttttttaag ctcagactcc cttctgtgaa cagcaatatc 241 cccacaactt gtacaacatt ggtgcttcct gcaagggcta cagaactatt tgatacgaaa 301 atgttcattg acttacacac aagagaagca caaaataaaa aattaataat taatttaatg 361 tctttgaaaa tgtaccattt atttttacat ttggggtcat aagaattgta ttacacttaa 421 gaatgcaata caatttgaag atcagatttt tctccctttg tgagaatttc tcagtatgtg 481 tgatgactac caagaaatca tagccagtca taaattcagt gagttactca taaacgaaca 541 agaaccacct acttcttggg gaggtaggtc tgcttccctt caactcagga tacaactgct 601 ttcaactgct ttcttcacat tagctgacta attagctaga agcctgtcgt aaacaatttt 661 atggttgact ccttccctgg gctcagggtt ccctagaaca gagaggtccc caaatcccgg 721 tctgtggcct gtccgcctaa gctctgcctc ctgccagatc agcaggcagc attagattct 781 cataggagct ggacgcctat tgtgaactgc gcatgtgcgg gatccagatt gtgcactctt 841 tatgagaatc taactaatgc ttgatgatct atctgaacca gaacaatttc atcctgaaac 901 catcccccac caatccatag aaatactgtc ttccacaaaa atgatccctg gtgccaaaaa 961 tgttagagac cactccccta aaactctctt cttagctctc acctcctgta ttactatctc 1021 atctcagtac attgaagccc ccatcttttc cccatggatg cctcatttcc tattagggag 1081 gcattttttt attttttgtt tttatttttt tccgagacgg agtctcgctc tgtcgccaag 1141 gctggagtgc agtggcgcga tctcggctca ctgcaagctc cgcctcccgg gttcacgcca 1201 ttctcctgcc tcagcctccc aagtagctgg gactacaggc gcccgcacta cgcccggcta 1261 attttttgta tttttagtag agacggggtt tcaccgtggt agccaggatg gtctcgatct 1321 cctgacctcg tgatccgccc gccttggcct cccaaagtgc tgggattaca ggcgtgagac 1381 cgcgcccggc cgtcatttgg tatgtcttaa tgtgcctcag gacctagcac agtccctggt 1441 acccagtaga gacctatgta atgttcgtta ttcaataata aatacatgaa ttaaagagtg 1501 agagtggatt ttgtaatgtt acgactgata gagaaatact cagtgattct aagggatggg 1561 gaagaacggt tggagctaga ggttgtgctc aggaaactat taaatagacg ttccgcagga 1621 agggattgac gaagtgtgag gttaatgagg aagggaaaat agaatataaa atttggtggt 1681 ggaaaagatc tgattcatga tgccgtgtca gagagcaaag ctcctgtcct tttggcctaa 1741 tttggtgatg ctgttcttgg gtctaccaca cctccttttg ccctccgcag gagcctgtgt 1801 tggaagagat ggtgatgggc ctgggcgttt tgttgttggt cttcgtgctg ggtctgggtc 1861 tgaccccacc gaccctggct caggataact ccaggtacac acacttcctg acccagcact 1921 atgatgccaa accacagggc cgggatgaca gatactgtga aagcatcatg aggagacggg 1981 gcctgacctc accctgcaaa gacatcaaca catttattca tggcaacaag cgcagcatca 2041 aggccatctg tgaaaacaag aatggaaacc ctcacagaga aaacctaaga ataagcaagt 2101 cttctttcca ggtcaccact tgcaagctac atggaggttc cccctggcct ccatgccagt 2161 accgagccac agcggggttc agaaacgttg ttgttgcttg tgaaaatggc ttacctgtcc 2221 acttggatca gtcaattttc cgtcgtccgt aaccagcggg cccctggtca agtgctggct 2281 ctgctgtcct tgccttccat ttcccctctg cacccagaac agtggtggca acattcattg 2341 ccaagggccc aaagaaagag ctacctggac cttttgtttt ctgtttgaca acatgtttaa 2401 taaataaaaa tgtcttgata tcagtaagaa tcagagtctt ctcactgatt ctgggcatat 2461 tgatctttcc cccattttct ctacttggct gctccctgag aggactgcat aggatagaaa 2521 tgcctttttc ttttcttttc gttttttttt tttttttttt ttgagatgga gtctcactct 2581 gtcgcccagg cttaagtgca atggcacaat ctcggctcac tgcaacctct ctctcctggg 2641 ttcaagtgat tctcctgcct cagcctccca aatagctgag attacaggca tgcaccacca 2701 cacctggcta atttttgtgt ttttagtaga gacagggttt caccgttttg gccaggttgg 2761 tcttgaactc ctgacctcgg gagatccgcc caccttggcc tctctttgtg ctgggattac 2821 aggcatgagc cactgagccg ggccactttt tccttatcag tcagttttta caagtcatta 2881 gggaggtaga ctttacctct ctgtgaagga aagtatggta tgttgatcta cagagagaga 2941 tggaaaaatt ccagggctcg tagctactaa gcagaatttc caagataggc aaattgtttt 3001 ttctgtcaaa taataagcta atattacttc tacaaatatg agaccttgga gagaagtttc 3061 caaggaccaa gtaccaacat accaacagat tattatagtt tctctcactc ttacacacac 3121 acacacacat atacacatat gtaatccagc atgaatacca aaattcattc agggtagcca 3181 ccttttgtct taatcgagag ataattttga tgtttgaatg gaatgctccc aggatattct 3241 cttgtcatgg ttattttata taaaattcaa aaaccaatta cattatttcc tctgtaatct 3301 tttactttat caactaatgt ctggcaagtg tgatgttttg gggaagttat agaagattcc 3361 ggccaggcgc ttatctcacg cttgtaatcc agcactttgg gaagctgagg cggacagatc 3421 acgaggtcaa gagatcaaga ccatcctgga caacatggtg aaaccttgtc tctactaaaa 3481 atgtgaaaat tagctgggcg tggtggcaca cacctatagt cccagctact cgggaggctg 3541 aggcaggaga atcgcttgaa cctaggaggc ggaggttgca ctgagccgag atcacgccac 3601 tgcactccag cctgggcgac agagcgagac tccatctcaa aaaaaaaaaa aaaagaaaga 3661 tcccagttta tcccagttta tcccttattc ttcctcaatt ctcaagattt gtttttaagt 3721 taacataact taggttaaca cactctttgt aaaatacact gttcaatcta cagactcagt 3781 ggttagcttc ctgttaacta atttctgttg acaggtactt ggatatttta tttagaaagt 3841 ggttgccaat aaattagtta taagtcgcca gtttcactgc cttgtgaaca cataattatt 3901 gtggtctcag tattccctat ggtggcttct cctgctcctg gtattgccct gaaatgggcc 3961 aaaagccgtg gctccccaat gctcaggtta tagaacattg tccaggtacc acctaggaga 4021 gcccagcctc actgaaagta ttcaaattta ggaatgggtt tgagaagtag gtagctggta 4081 tgtgcttagc acaagaatct ctcttccttg ggttagtctg tttcaaaact gaaaacactg 4141 tcattcctta agaaaatagg aaaaagtatt ccaaacctct gtcactagaa aatttgccat 4201 attaccaaat ctcaaaaacc tctcaggaaa tgagaaagtc ccagtttctg gtaaactatt 4261 tgggcccttt tctcaagttc tccttccagt gctatttcct tgaggtgagg caaagttact 4321 caagatcatc gctgccactc aaggccttga tagggcaagt gaaaggcatg gaccattatt 4381 atattgatca cagcataagc tgtgaaaacc cacatcttct ccaaacatct gcttggagca 4441 ttatcatcgc atagtttgct ctggtgttca gggaaatcgc tgtttcatag gaaatcacat 4501 ggcagtggga tgggagtgtt tcctgacctg ccgatggtac tggcacctga gcaagcattc 4561 ctagtccttt ttggtctggg cctcttgttc tatcacaacc acaagctgtt taaaataaaa 4621 acgtcaagtc acaggcaggt cattttatcc tgcgtgaatc aattgaag // LOCUS HUMAIR 1816 bp DNA PRI 27-JAN-1996 DEFINITION Homo Sapiens angiotensin II receptor gene, complete cds. ACCESSION L48211 NID g1160612 KEYWORDS angiotensin; angiotensin II; angiotensin II receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1816) AUTHORS Razdan,K. and Kroll,M.H. TITLE Molecular cloning of a novel platelet protein showing homology to the angiotensin II receptor C-terminal domain JOURNAL J. Biol. Chem. 271 (4), 2221-2224 (1996) MEDLINE 96147204 FEATURES Location/Qualifiers source 1..1816 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1481..1696 /codon_start=1 /product="angiotensin II receptor" /db_xref="PID:g1160613" /translation="MHSSFCSLGDRAACSVITASELKSHRSPDSRSFLMHSSQSQLRQ YSRLYVLWQREADEHSFREKADGKPVS" BASE COUNT 477 a 415 c 461 g 463 t ORIGIN 1 gggtaaaacc atttgtttaa ttctaaatca aatcactttc acaacagtga aaattagtga 61 ctggttaagg tgtgccactg tacatatcat cattttctga ctggggtcag gacctggtcc 121 tagtccacaa gggtggcagg aggagggtgg aggctaagaa cacagaaaac acacaaaaga 181 aaggaaagct gccttggcag aaggatgagg tggtgagctt gccgagggat ggtgggaagg 241 gggctccctg ttggggccga gccaggagtc ccaagtcagc tctcctgcct tacttagctc 301 ctggcagagg gtgagtgggg acctacgagg ttcaaaatca aatggcattt ggccaggctg 361 gctttactaa caggttccca gagtgcctct gttggctgag ctctcctggg ctactcattt 421 cattgaagag tccaaatgat tcattttcct acccacaact tttcattatt cttctggaaa 481 cccatttctg ttgagtccat ctgacttaag tcctctctcc ctccactagt tggggccact 541 gcactgaggg gggtcccacc aattctctct agagaagaga cactccagag gcccctgcaa 601 ctttgcggat ttccagaagg tgataaaaag agcactcttg agtgggtgcc caggaatgtt 661 taaaatctat caggcacact ataaagctgg tggtttcttc ctaccaagtg gattcggcat 721 atgaaccacc tactcaatac tttatatttt gtctgtttaa acactgaact ctggtgttga 781 caggtacaag gagaagagat ggggactatg aagaggggag ggcttccctc atcttcctca 841 agatctttgt ttccacaaac tatgcagtca taatttgaga aaaagcaata gatggggctt 901 cctaccattt gttggttatt gctggggtta gccaggagca gtgtggatgg caaagtagga 961 gagaggccca gaggaaagcc catctccctc cagctttggg gtctccagaa agaggctgga 1021 tttctgggat gaagcctaga aggcagagca agaactgttc caccaggtga acagtcctac 1081 ctgcttggta ccatagtccc tcaataagat tcagaggaag aagcttatga aactgaaaat 1141 caaatcaagg tattgggaag aataatttcc cctcgattcc acaggaggga agaccacaca 1201 atatcattgt gctggggctc cccaaggccc tgccacctgg ctttacaaat catcaggggt 1261 tgcctgcttg gcagtcacat gcttccctgg ttttagcaca catacaagga gttttcaggg 1321 aactctatca agccatacca aaatcagggt cacatgtggg tttccccttt ccttgcctct 1381 tcataaaaga caacttggct tctgaggatg gtggtctttt gcatgcagtt gggctgacct 1441 gacaaagccc ccagtttcct gtggcaggtt ctgggagagg atgcattcaa gcttctgcag 1501 cctaggggac agggctgctt gttcagttat tactgcctcg gagctcaaat cccaccgaag 1561 tcctgactcc aggtctttcc taatgcacag tagtcagtct cagcttcggc agtattctcg 1621 gctgtatgtt ctctggcaga gagaggcaga tgaacatagt tttagggaga aagctgatgg 1681 gaaacctgtg agttaagcca catgtctcac caggaataat ttatgccagg aaaccaggaa 1741 gtcattcaag ttgttctctg aggccaaaga cactgagcac agcccagagc caataaaaga 1801 tctttgagtc tctggt // LOCUS HUMANONYMO 2754 bp DNA PRI 31-DEC-1994 DEFINITION Human anonymous gene, complete cds. ACCESSION L18972 NID g388011 KEYWORDS . SOURCE Homo sapiens (tissue library: Stratagene #936206) fetus DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2754) AUTHORS Xie,Y.G., Han,F.Y., Peyrard,M., Ruttledge,M.H., Fransson,I., DeJong,P., Collins,J., Dunham,I., Nordenskjold,M. and Dumanski,J.P. TITLE Cloning of a novel, anonymous gene from a megabase-range YAC and cosmid contig in the neurofibromatosis type 2/meningioma region on human chromosome 22q12 JOURNAL Hum. Mol. Genet. 2 (9), 1361-1368 (1993) MEDLINE 94061029 FEATURES Location/Qualifiers source 1..2754 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_lib="Stratagene #936206" /map="22q12" gene 35..2086 /gene="anonymous" CDS 35..2086 /gene="anonymous" /codon_start=1 /db_xref="PID:g388012" /translation="MSSESSKKRKPKVIRSDGAPAEGKRNRSDTEQEGKYYSEEAEVD LRDPGRDYELYKYTCQELQRLMAEIQDLKSRGGKDVAIEIEERRIQSCVHFMTLKKLN RLAHIRLKKGRDQTHEAKQKVDAYHLQLQNLLYEVMHLQKEITKCLEFKSKHEEIDLV SLEEFYKEAPPDISKAEVTMGDPHQQTLARLDWELEQRKRLAEKYRECLSNKEKILKE IEVKKEYLSSLQPRLNSIMQASLPVQEYLFMPFDQAHKQYETARHLPPPLYVLFVQAT AYGQACDKTLSVAIEGSVDEAKALFKPPEDSQDDESDSDAEEEQTTKRRRPTLGVQLD DKRKEMLKRHPLSVMLDLKCKDDSVLHLTFYYLMNLNIMTVKAKVTTAMELITPISAG DLLSPDSVLSCLYPGDHGKKTPNPANQYQFDKVGILTLSDYVLELGHPYLWVQKLGGL HFPKEQPQQTVIADHSLSASHMETTMKLLKTRVQSRLALHKQFASLEHGIVPVTSDCQ YLFPAKVVSRLVKWVTIAHEDYMELHFTKDIVDAGLAGDTNLYYMALIERGTAKLQAA VVLNPGYSSIPPIFQLCLNWKGEKTNSNDDNIRAMEGEVNVCYKELCGPWPSHQLLTN QLQRLCVLLDVYLETESHDDSVEGPKEFPQEKMCLRLFRGPSRMKPFKYNHPQGFFSH R" BASE COUNT 744 a 694 c 723 g 593 t ORIGIN 1 tgccaggttc ttggagctgt gaggaggaac aaccatgtca tcagaatcga gcaaaaaacg 61 gaagcccaaa gtgatccgaa gcgatggagc cccagctgaa ggaaagcgga atcgatctga 121 caccgagcag gaaggtaaat actacagtga ggaggccgag gtggatctgc gggaccctgg 181 cagagactat gagttataca agtacacctg ccaggagcta cagaggctga tggctgagat 241 ccaagacctg aagagcaggg gtggcaagga tgtggcaata gaaatagaag aacggaggat 301 ccagagctgt gtgcatttca tgactctaaa gaagcttaac cgattagccc acatcaggtt 361 gaagaaagga agagatcaga cccacgaggc taagcagaaa gtagatgcct atcatctgca 421 gctccagaac ctgttgtatg aggtgatgca cctacagaag gagatcacca aatgtttgga 481 gtttaagtca aagcatgaag aaattgatct ggtcagttta gaggagtttt ataaggaggc 541 tccaccagat atcagcaagg ccgaagtcac catgggagac cctcaccagc aaacactggc 601 acgtctggac tgggagctgg agcagcggaa aaggctggca gagaagtacc gagagtgcct 661 atctaacaag gagaagattc tcaaggagat tgaggtgaag aaggagtacc tgagcagcct 721 ccagccccgc ctcaacagca tcatgcaggc ttcccttccg gtgcaggagt acctgtttat 781 gccattcgac caggctcaca agcagtatga gacagccaga cacctgccgc ctcccctcta 841 tgtcctcttt gttcaggcca ctgcgtatgg gcaggcctgt gataagacgt tatctgtggc 901 aatcgaaggc agtgtggatg aagccaaggc tctgttcaaa cctccagagg actcccaaga 961 tgacgaaagt gactcagatg ccgaggagga gcagactacg aagcgccgga gacccacact 1021 gggggttcag ttggacgaca aacgcaagga gatgctgaag aggcacccac tgtctgtcat 1081 gctcgacctg aagtgcaaag atgacagtgt gcttcacctg actttctact acctcatgaa 1141 cctcaacatc atgacagtaa aagccaaagt gacaactgcc atggagctga tcacccccat 1201 cagtgcaggt gacttgctgt ctcctgactc agtcctgagt tgcttgtatc ctggggatca 1261 tggaaagaaa actccgaatc cagccaatca gtatcagttt gataaagttg gcatcctgac 1321 tttgagcgac tatgtacttg agctaggtca cccctatttg tgggtgcaga agctgggtgg 1381 cctccacttc cccaaagagc agccccagca aacagtgatt gctgaccact cgctgagcgc 1441 cagccacatg gagaccacca tgaaacttct gaagaccagg gtgcagtccc gcctggccct 1501 ccacaaacag tttgcatccc tagaacatgg cattgtgcca gttaccagtg attgccagta 1561 cctcttccct gccaaggttg tctctcgcct ggtgaaatgg gtgacaattg cccatgagga 1621 ttacatggag ctgcacttca ccaaagacat tgtggatgcg ggactggctg gggacaccaa 1681 tctctactac atggcgctca tcgaaagggg cacagccaaa ctgcaggccg ctgtggtgtt 1741 gaaccctggc tactcctcca tcccacctat tttccagctc tgtttgaact ggaaagggga 1801 gaaaaccaac agcaacgatg acaacattcg ggccatggag ggcgaagtca atgtgtgcta 1861 caaggagctg tgtggccctt ggcccagcca ccagctgttg accaaccagc tgcagcggct 1921 gtgtgtgctg ctggatgttt acctggagac cgagagccat gacgacagtg tggaggggcc 1981 caaggaattt ccccaggaga agatgtgtct gcggctcttc aggggtccta gcaggatgaa 2041 gccatttaaa tacaaccatc ctcagggatt cttcagccat cgctgatctc ccgcgcagac 2101 cgttgtttcc cccaaggcct caccctgagc actgggcttc tgctttctgc tctggcccac 2161 atgtgactct tgatattctc caaagacacc agccaattaa aaagcgtcac ctgaccagta 2221 gcctttgtct gtggttcctg gcaaggtggc tttgcagtct ggaagggcag gtgggagctg 2281 tgacacagtg tgaaaaagca tttgtagaga gactttttct cagcagccaa taaaagcaga 2341 gtggaaaaag attccaattc tgcagagaga tgctcacctc ttgtctacgc acaccctatt 2401 tgtgctttgc ggggtgaggt cctcatgatc ttgtatttat tatcccaagt tcctgctgtt 2461 aagaggtggt aggagaagcc aaaggcagca gagcacaaaa agcaaaactc ttccctcccc 2521 acccgctctt cccattagtc ctgtcagggt tgccgatgga caaattgtct ctgatcgttg 2581 gatgttataa atgtctgaca gtgcagtgca aacagaagac aaactcagtt gatccttgaa 2641 caactcaggg gttaggggca ccaacacccc ctgccctgca cagttgaaaa atccgtgtat 2701 aacttttgac tccctaaaaa cttaactaat agcctgctgt tgaccagtag tatg // LOCUS HUMASPA 584 bp DNA PRI 27-FEB-1996 DEFINITION Homo sapiens agouti signalling protein (ASP) gene, complete cds. ACCESSION L37019 NID g608647 KEYWORDS agouti signalling protein; homologue. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 584) AUTHORS Wilson,B.D., Ollmann,M.M., Kang,L., Stoffel,M., Bell,G.I. and Barsh,G.S. TITLE Structure and function of ASP, the human homolog of the mouse agouti gene JOURNAL Hum. Mol. Genet. 4 (2), 223-230 (1995) MEDLINE 95276734 FEATURES Location/Qualifiers source 1..584 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1..170 /gene="ASP" gene 1..584 /gene="ASP" CDS 11..409 /gene="ASP" /note="mouse homologue" /codon_start=1 /product="agouti signaling protein" /db_xref="PID:g608648" /translation="MDVTRLLLATLLVFLCFFTANSHLPPEEKLRDDRSLRSNSSVNL LDVPSVSIVALNKKSKPIGRKAAEKKRSSKKEASMKKVVRPRTPLSAPCVATRNSCKP PAPACCDPCASCQCRFFRSACSCRVLSLNC" exon 171..232 /gene="ASP" exon 233..584 /gene="ASP" BASE COUNT 117 a 191 c 169 g 107 t ORIGIN 1 gcctcctggg atggatgtca cccgcttact cctggccacc ctgctggtct tcctctgctt 61 cttcactgcc aacagccacc tgccacctga ggagaagctc cgagatgaca ggagcctgag 121 aagcaactcc tctgtgaacc tactggatgt cccttctgtc tctattgtgg cgctgaacaa 181 gaaatccaaa ccgatcggca gaaaagcagc agaaaagaaa agatcttcta agaaggaggc 241 ttcgatgaag aaagtggtgc ggccccggac ccccctatct gcgccctgcg tggccacccg 301 caacagctgc aagccgccgg cacccgcctg ctgcgacccg tgcgcctcct gccagtgccg 361 cttcttccgc agcgcctgct cctgccgcgt gctcagcctc aactgctgag cgcccccact 421 cccggccgcg agcaggcagg gcttcgggga cgcggggcgc ttctcgggcg ggtgatccct 481 aacagggcgg cttcccaggg ctgcaggcgg gcggaggttc caggagatgg gacttcaggg 541 agacctggct tgggctaaaa tcgaaataca atatatatag gctg // LOCUS HUMAT1A 1829 bp DNA PRI 21-JUL-1992 DEFINITION Human angiotensinogen II type-1A receptor gene, complete cds. ACCESSION M91464 NID g179121 KEYWORDS angiotensin II type-1A receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1829) AUTHORS Mauzy,C.A., Hwang,O., Egloff,A.M., Wu,L.-H. and Chung,F.-Z. TITLE Cloning, expression, and characterization of a gene encoding the human angiotensin II type1A receptor JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..1829 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" 5'UTR 1..684 CAAT_signal 88..96 TATA_signal 144..149 CDS 685..1764 /codon_start=1 /product="angiotensinogen II type-1A receptor" /db_xref="PID:g179122" /translation="MILNSSTEDGIKRIQDDCPKAGRHNYIFVMIPTLYSIIFVVGIF GNSLVVIVIYFYMKLKTVASVFLLNLALADLCFLLTLPLWAVYTAMEYRWPFGNYLCK IASASVSFNLYASVFLLTCLSIDRYLAIVHPMKSRLRRTMLVAKVTCIIIWLLAGLAS LPAIIHRNVFFIENTNITVCAFHYESQNSTLPIGLGLTKNILGFLFPFLIILTSYTLI WKALKKAYEIQKNKPRNDDIFKIIMAIVLFFFFSWIPHQIFTFLDVLIQLGIIRDCRI ADIVDTAMPITICIAYFNNCLNPLFYGFLGKKFKRYFLQLLKYIPPKAKSHSNLSTKM STLSYRPSDNVSSSTKKPAPCFEVE" BASE COUNT 538 a 361 c 319 g 611 t ORIGIN 1 ctgcagaggc ctggaaaacc atggaaaaaa tcctattcct caaagtcgag ccctacctcc 61 tacgcctctt aaatcaattg ccacagtgac caatttgtca gtcacctaaa ggcagctatc 121 ctgaccactt aagggagaat acttataatc tttataccta agtaaagatt gtgagaggaa 181 agaaatggtt taaagaatat gaatgtgtca cacttctgag gttaatgata aatgaattgg 241 tcctgcttac ctcaggaaaa actttcaagt ctttctgaaa aactaattta attcagtagt 301 attttctaag atttaggtta tgtttttaat caatttggaa accaagattt acttatagaa 361 aaaaaggaaa aggacctgaa taggtttatt cacatagaat cccaatttca cttctctgga 421 tgataccatt ttctacaaaa gcaattatgt tctaaaattt aagtgtgctt tcttaggctt 481 tatcagttca cagtgtttcc ttaagaaata tgatccagta ttttttccta agactaaagt 541 tgagttacta cgtttatgac tgagaaatga atgtttgtta gtttgtttgt ttacaataag 601 aattttttct ttaccatttt atttttattt tccccaggtg tatttgatat agtgtttgca 661 acaaattcga cccaggtgat caaaatgatt ctcaactctt ctactgaaga tggtattaaa 721 agaatccaag atgattgtcc caaagctgga aggcataatt acatatttgt catgattcct 781 actttataca gtatcatctt tgtggtggga atatttggaa acagcttggt ggtgatagtc 841 atttactttt atatgaagct gaagactgtg gccagtgttt ttcttttgaa tttagcactg 901 gctgacttat gctttttact gactttgcca ctatgggctg tctacacagc tatggaatac 961 cgctggccct ttggcaatta cctatgtaag attgcttcag ccagcgtcag tttcaacctg 1021 tacgctagtg tgtttctact cacgtgtctc agcattgatc gatacctggc tattgttcac 1081 ccaatgaagt cccgccttcg acgcacaatg cttgtagcca aagtcacctg catcatcatt 1141 tggctgctgg caggcttggc cagtttgcca gctataatcc atcgaaatgt atttttcatt 1201 gagaacacca atattacagt ttgtgctttc cattatgagt cccaaaattc aaccctcccg 1261 atagggctgg gcctgaccaa aaatatactg ggtttcctgt ttccttttct gatcattctt 1321 acaagttata ctcttatttg gaaggcccta aagaaggctt atgaaattca gaagaacaaa 1381 ccaagaaatg atgatatttt taagataatt atggcaattg tgcttttctt tttcttttcc 1441 tggattcccc accaaatatt cacttttctg gatgtattga ttcaactagg catcatacgt 1501 gactgtagaa ttgcagatat tgtggacacg gccatgccta tcaccatttg tatagcttat 1561 tttaacaatt gcctgaatcc tcttttttat ggctttctgg ggaaaaaatt taaaagatat 1621 tttctccagc ttctaaaata tattccccca aaagccaaat cccactcaaa cctttcaaca 1681 aaaatgagca cgctttccta ccgcccctca gataatgtaa gctcatccac caagaagcct 1741 gcaccatgtt ttgaggttga gtgacatgtt cgaaacctgt ccataaagta attttgtgaa 1801 agaaggagca agagaacatt cctctgcag // LOCUS HUMATXT 3110 bp DNA PRI 24-MAY-1996 DEFINITION Human autotaxin-t (atx-t) gene, complete cds. ACCESSION L46720 NID g1160615 KEYWORDS autotaxin; motility factor; phosphodiesterase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3110) AUTHORS Murata,J., Lee,H.Y., Clair,T., Krutzsch,H.C., Arestad,A.A., Sobel,M.E., Liotta,L.A. and Stracke,M.L. TITLE cDNA cloning of the human tumor motility-stimulating protein, autotaxin, reveals a homology with phosphodiesterases JOURNAL J. Biol. Chem. 269 (48), 30479-30484 (1994) MEDLINE 95074054 REFERENCE 2 (bases 1 to 3110) AUTHORS Lee,H.Y., Murata,J., Clair,T., Polymeropoulos,M.H., Torres,R., Manrow,R.E., Liotta,L.A. and Stracke,M.L. TITLE Cloning, chromosomal localization, and tissue expression of autotaxin from human teratocarcinoma cells JOURNAL Biochem. Biophys. Res. Commun. 218 (3), 714-719 (1996) MEDLINE 96158950 FEATURES Location/Qualifiers source 1..3110 /organism="Homo sapiens" /note="(vector lambda gt10)" /db_xref="taxon:9606" /cell_line="NTera2D1" /cell_type="teratocarcinoma" /dev_stage="adult" /sex="male" /tissue_type="testis" gene 60..2651 /gene="atx-t" CDS 60..2651 /gene="atx-t" /codon_start=1 /product="autotaxin-t" /db_xref="PID:g1160616" /translation="MARRSSFQSCQIISLFTFAVGVNICLGFTAHRIKRAEGWEEGPP TVLSDSPWTNISGSCKGRCFELQEAGPPDCRCDNLCKSYTSCCHDFDELCLKTARAWE CTKDRCGEVRNEENACHCSEDCLARGDCCTNYQVVCKGESHWVDDDCEEIKAAECPAG FVRPPLIIFSVDGFRASYMKKGSKVMPNIEKLRSCGTHSPYMRPVYPTKTFPNLYTLA TGLYPESHGIVGNSMYDPVFDATFHLRGREKFNHRWWGGQPLWITATKQGVKAGTFFW SVVIPHERRILTILQWLTLPDHERPSVYAFYSEQPDFSGHKYGPFGPEMTNPLREIDK IVGQLMDGLKQLKLHRCVNVIFVGDHGMEDVTCDRTEFLSNYLTNVDDITLVPGTLGR IRSKFSNNAKYDPKAIIANLTCKKPDQHFKPYLKQHLPKRLHYANNRRIEDIHLLVER RWHVARKPLDVYKKPSGKCFFQGDHGFDNKVNSMQTVFVGYGPTFKYKTKVPPFENIE LYNVMCDLLGLKPAPNNGTHGSLNHLLRTNTFRPTMPEEVTRPNYPGIMYLQSDFDLG CTCDDKVEPKNKLDELNKRLHTKGSTEERHLLYGRPAVLYRTRYDILYHTDFESGYSE IFLMPLWTSYTVSKQAEVSSVPDHLTSCVRPDVRVSPSFSQNCLAYKNDKQMSYGFLF PPYLSSSPEAKYDAFLVTNMVPMYPAFKRVWNYFQRVLVKKYASERNGVNVISGPIFD YDYDGLHDTEDKIKQYVEGSSIPVPTHYYSIITSCLDFTQPADKCDGPLSVSSFILPH RPDNEESCNSSEDESKWVEELMKMHTARVRDIEHLTSLDFFRKTSRSYPEILTLKTYL HTYESEI" polyA_signal 3065..3070 polyA_site 3094..3110 BASE COUNT 906 a 667 c 674 g 863 t ORIGIN 1 agtgcactcc gtgaaggcaa agagaacacg ctgcaaaagg ctttccaata atcctcgaca 61 tggcaaggag gagctcgttc cagtcgtgtc agataatatc cctgttcact tttgccgttg 121 gagtcaatat ctgcttagga ttcactgcac atcgaattaa gagagcagaa ggatgggagg 181 aaggtcctcc tacagtgcta tcagactccc cctggaccaa catctccgga tcttgcaagg 241 gcaggtgctt tgaacttcaa gaggctggac ctcctgattg tcgctgtgac aacttgtgta 301 agagctatac cagttgctgc catgactttg atgagctgtg tttgaagaca gcccgtgcgt 361 gggagtgtac taaggacaga tgtggggaag tcagaaatga agaaaatgcc tgtcactgct 421 cagaggactg cttggccagg ggagactgct gtaccaatta ccaagtggtt tgcaaaggag 481 agtcgcattg ggttgatgat gactgtgagg aaataaaggc cgcagaatgc cctgcagggt 541 ttgttcgccc tccattaatc atcttctccg tggatggctt ccgtgcatca tacatgaaga 601 aaggcagcaa agtcatgcct aatattgaaa aactaaggtc ttgtggcaca cactctccct 661 acatgaggcc ggtgtaccca actaaaacct ttcctaactt atacactttg gccactgggc 721 tatatccaga atcacatgga attgttggca attcaatgta tgatcctgta tttgatgcca 781 cttttcatct gcgagggcga gagaaattta atcatagatg gtggggaggt caaccgctat 841 ggattacagc caccaagcaa ggggtgaaag ctggaacatt cttttggtct gttgtcatcc 901 ctcacgagcg gagaatatta accatattgc agtggctcac cctgccagat catgagaggc 961 cttcggtcta tgccttctat tctgagcaac ctgatttctc tggacacaaa tatggccctt 1021 tcggccctga gatgacaaat cctctgaggg aaatcgacaa aattgtgggg caattaatgg 1081 atggactgaa acaactaaaa ctgcatcggt gtgtcaacgt catctttgtc ggagaccatg 1141 gaatggaaga tgtcacatgt gatagaactg agttcttgag taattaccta actaatgtgg 1201 atgatattac tttagtgcct ggaactctag gaagaattcg atccaaattt agcaacaatg 1261 ctaaatatga ccccaaagcc attattgcca atctcacgtg taaaaaacca gatcagcact 1321 ttaagcctta cttgaaacag caccttccca aacgtttgca ctatgccaac aacagaagaa 1381 ttgaggatat ccatttattg gtggaacgca gatggcatgt tgcaaggaaa cctttggatg 1441 tttataagaa accatcagga aaatgctttt tccagggaga ccacggattt gataacaagg 1501 tcaacagcat gcagactgtt tttgtaggtt atggcccaac atttaagtac aagactaaag 1561 tgcctccatt tgaaaacatt gaactttaca atgttatgtg tgatctcctg ggattgaagc 1621 cagctcctaa taatgggacc catggaagtt tgaatcatct cctgcgcact aataccttca 1681 ggccaaccat gccagaggaa gttaccagac ccaattatcc agggattatg taccttcagt 1741 ctgattttga cctgggctgc acttgtgatg ataaggtaga gccaaagaac aagttggatg 1801 aactcaacaa acggcttcat acaaaagggt ctacagaaga gagacacctc ctctatgggc 1861 gacctgcagt gctttatcgg actagatatg atatcttata tcacactgac tttgaaagtg 1921 gttatagtga aatattccta atgccactct ggacatcata tactgtttcc aaacaggctg 1981 aggtttccag cgttcctgac catctgacca gttgcgtccg gcctgatgtc cgtgtttctc 2041 cgagtttcag tcagaactgt ttggcctaca aaaatgataa gcagatgtcc tacggattcc 2101 tctttcctcc ttatctgagc tcttcaccag aggctaaata tgatgcattc cttgtaacca 2161 atatggttcc aatgtatcct gctttcaaac gggtctggaa ttatttccaa agggtattgg 2221 tgaagaaata tgcttcggaa agaaatggag ttaacgtgat aagtggacca atcttcgact 2281 atgactatga tggcttacat gacacagaag acaaaataaa acagtacgtg gaaggcagtt 2341 ccattcctgt tccaactcac tactacagca tcatcaccag ctgtctggat ttcactcagc 2401 ctgccgacaa gtgtgacggc cctctctctg tgtcctcctt catcctgcct caccggcctg 2461 acaacgagga gagctgcaat agctcagagg acgaatcaaa atgggtagaa gaactcatga 2521 agatgcacac agctagggtg cgtgacattg aacatctcac cagcctggac ttcttccgaa 2581 agaccagccg cagctaccca gaaatcctga cactcaagac atacctgcat acatatgaga 2641 gcgagattta actttctgag catctgcagt acagtcttat caactggttg tatattttta 2701 tattgttttt gtatttatta atttgaaacc aggacattaa aaatgttagt attttaatcc 2761 tgtaccaaat ctgacatatt atgcctgaat gactccactg tttttctcta atgcttgatt 2821 taggtagcct tgtgttctga gtagagcttg taataaatac tgcagcttga gtttttagtg 2881 gaagcttcta aatggtgctg cagatttgat atttgcattg aggaaatatt aattttccaa 2941 tgcacagttg ccacatttag tcctgtactg tatggaaaca ctgattttgt aaagttgcct 3001 ttatttgctg ttaactgtta actatgacag atatatttaa gccttataaa ccaatcttaa 3061 acataataaa tcacacattc agttttttct ggtaaaaaaa aaaaaaaaaa // LOCUS HUMB1LYM 1146 bp DNA PRI 15-JUL-1993 DEFINITION Human B-lymphocyte cell-surface antigen B1 (CD20). ACCESSION M27394 J03574 NID g179307 KEYWORDS B1 antigen; antigen. SOURCE Human tonsillar lymphocyte cDNA to mRNA, clone pB1-21. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1146) AUTHORS Tedder,T.F., Streuli,M., Schlossman,S.F. and Saito,H. TITLE Isolation and structure of a cDNA encoding the B1 (CD20) cell-surface antigen of human B lymphocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 208-212 (1988) MEDLINE 88124792 COMMENT Submitted in computer readable form by T.Tedder 23-NOV-1987. FEATURES Location/Qualifiers source 1..1146 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q12-13" CDS 134..1027 /codon_start=1 /product="cell surface antigen B1" /db_xref="PID:g179308" /translation="MTTPRNSVNGTFPAEPMKGPIAMQSGPKPLFRRMSSLVGPTQSF FMRESKTLGAVQIMNGLFHIALGGLLMIPAGIYAPICVTVWYPLWGGIMYIISGSLLA ATEKNSRKCLVKGKMIMNSLSLFAAISGMILSIMDILNIKISHFLKMESLNFIRAHTP YINIYNCEPANPSEKNSPSTQYCYSIQSLFLGILSVMLIFAFFQELVIAGIVENEWKR TCSRPKSNIVLLSAEEKKEQTIEIKEEVVGLTETSSQPKNEEDIEIIPIQEEEEEETE TNFPEPPQDQESSPIENDSSP" BASE COUNT 349 a 247 c 224 g 326 t ORIGIN Chromosome 11q12-13. 1 cctcaatgac actcatggag gaaatgctga gagaagcatt cagatgcatg acacaaggta 61 agactgccaa aaatcttgtt cttgctctcc tcattttgtt atttgtttta tttttaggag 121 ttttgagagc aaaatgacaa cacccagaaa ttcagtaaat gggactttcc cggcagagcc 181 aatgaaaggc cctattgcta tgcaatctgg tccaaaacca ctcttcagga ggatgtcttc 241 actggtgggc cccacgcaaa gcttcttcat gagggaatct aagactttgg gggctgtcca 301 gattatgaat gggctcttcc acattgccct ggggggtctt ctgatgatcc cagcagggat 361 ctatgcaccc atctgtgtga ctgtgtggta ccctctctgg ggaggcatta tgtatattat 421 ttccggatca ctcttggcag caacggagaa aaactctagg aagtgtttgg tcaaaggaaa 481 aatgataatg aattcattga gcctctttgc tgccatttct ggaatgattc tttcaatcat 541 ggacatactt aatattaaaa tttcccattt tttaaaaatg gagagtctga attttattag 601 agctcacaca ccatatatta acatatacaa ctgtgaacca gctaatccct ctgagaaaaa 661 ctccccatct acccaatact gttacagcat acaatctctg ttcttgggca ttttgtcagt 721 gatgctgatc tttgccttct tccaggaact tgtaatagct ggcatcgttg agaatgaatg 781 gaaaagaacg tgctccagac ccaaatctaa catagttctc ctgtcagcag aagaaaaaaa 841 agaacagact attgaaataa aagaagaagt ggttgggcta actgaaacat cttcccaacc 901 aaagaatgaa gaagacattg aaattattcc aatccaagaa gaggaagaag aagaaacaga 961 gacgaacttt ccagaacctc cccaagatca ggaatcctca ccaatagaaa atgacagctc 1021 tccttaagtg atttcttctg ttttctgttt ccttttttaa acattagtgt tcatagcttc 1081 caagagacat gctgactttc atttcttgag gtactctgca catacgcacc acatctctat 1141 ctggcc // LOCUS HUMBDNF 918 bp DNA PRI 31-OCT-1994 DEFINITION Human brain-derived neurotrophic factor (BDNF) gene, complete cds. ACCESSION M37762 NID g179402 KEYWORDS neurotrophic factor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 918) AUTHORS Jones,K.R. and Reichardt,L.F. TITLE Molecular cloning of a human gene that is a member of the nerve growth factor family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (20), 8060-8064 (1990) MEDLINE 91045937 COMMENT Draft entry and computer-readable sequence for [Proc. Natl. Acad. Sci. U.S.A. (1990) In press] kindly submitted by K.R.Jones, 13-AUG-1990. FEATURES Location/Qualifiers source 1..918 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" sig_peptide 76..123 /gene="NTF3" /note="G00-125-917; putative" /product="brain-derived neurotrophic factor" CDS 76..819 /gene="BDNF" /note="putative" /codon_start=1 /db_xref="GDB:G00-125-916" /product="brain-derived neurotrophic factor" /db_xref="PID:g179403" /translation="MTILFLTMVISYFGCMKAAPMKEANIRGQGGLAYPGVRTHGTLE SVNGPKAGSRGLTSLADTFEHVIEELLDEDQKVRPNEENNKDADLYTSRVMLSSQVPL EPPLLFLLEEYKNYLDAANMSMRVRRHSDPARRGELSVCDSISEWVTAADKKTAVDMS GGTVTVLEKVPVSKGQLKQYFYETKCNPMGYTKEGCRGIDKRHWNSQCRTTQSYVRAL TMDSKKRIGWRFIRIDTSCVCTLTIKRGR" gene 76..816 /gene="NTF3" /map="12p13" gene 76..819 /gene="BDNF" /map="11p13" mat_peptide 124..816 /gene="NTF3" /note="G00-125-917; putative" /product="brain-derived neurotrophic factor" BASE COUNT 269 a 192 c 237 g 220 t ORIGIN 1 ggtgaaagaa agccctaacc agttttctgt cttgtttctg ctttctccct acagttccac 61 caggtgagaa gagtgatgac catccttttc cttactatgg ttatttcata ctttggttgc 121 atgaaggctg cccccatgaa agaagcaaac atccgaggac aaggtggctt ggcctaccca 181 ggtgtgcgga cccatgggac tctggagagc gtgaatgggc ccaaggcagg ttcaagaggc 241 ttgacatcat tggctgacac tttcgaacac gtgatagaag agctgttgga tgaggaccag 301 aaagttcggc ccaatgaaga aaacaataag gacgcagact tgtacacgtc cagggtgatg 361 ctcagtagtc aagtgccttt ggagcctcct cttctctttc tgctggagga atacaaaaat 421 tacctagatg ctgcaaacat gtccatgagg gtccggcgcc actctgaccc tgcccgccga 481 ggggagctga gcgtgtgtga cagtattagt gagtgggtaa cggcggcaga caaaaagact 541 gcagtggaca tgtcgggcgg gacggtcaca gtccttgaaa aggtccctgt atcaaaaggc 601 caactgaagc aatacttcta cgagaccaag tgcaatccca tgggttacac aaaagaaggc 661 tgcaggggca tagacaaaag gcattggaac tcccagtgcc gaactaccca gtcgtacgtg 721 cgggccctta ccatggatag caaaaagaga attggctggc gattcataag gatagacact 781 tcttgtgtat gtacattgac cattaaaagg ggaagatagt ggatttatgt tgtatagatt 841 agattatatt gagacaaaaa ttatctattt gtatatatac ataacagggt aaattattca 901 gttaagaaaa aaataatt // LOCUS HUMBTFD 1350 bp DNA PRI 31-DEC-1994 DEFINITION Human BTF3 protein homologue gene, complete cds. ACCESSION M90356 NID g179575 KEYWORDS BTF3 protein homologue. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1350) AUTHORS Kanno,M., Chalut,C. and Egly,J.M. TITLE Genomic structure of the putative BTF3 transcription factor JOURNAL Gene 117 (2), 219-228 (1992) MEDLINE 92347696 FEATURES Location/Qualifiers source 1..1350 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" gene 517..1161 /gene="BTF3 homologue" CDS 517..1161 /gene="BTF3 homologue" /codon_start=1 /product="BTF3 homologue" /db_xref="PID:g179576" /translation="MVLSRGSLLNTRPRITSGSGSTGRPRSGSTCRSTDHEPGKLAKL QAQVRIGGKGTAHRKKKVFHRTATADDKKLQFSLKKLQVNNISGIEKVNMFTNQGTVI HFNNPKFQASLAVNTFTITGHAEAKQVTEMLPSVLSQLGADSLTSLRRLAEVLPKQPV DGKAPLATGGDDDDGVPELWRILMRLPGMRQAELSRLLKKIKLEEVTGSCYFLL" BASE COUNT 423 a 272 c 279 g 376 t ORIGIN 1 agaaaattaa gggaaaatcg attcctctta tctagttact tagatattgg ccttggcttt 61 atctcaatat tatatggatc atagctggca actaattcag tccagtaaat atcctcaata 121 gggaataata tatgcttccc attccatcgg gaaaaagttt tgttcaacac accaagctca 181 atcaactcac taatgtatgg gaattgtttt gatgtaacca catacttcct gccttcatta 241 agggctgcgc acaaaaccat agattgctct tctgtaaggt tttgaattac tgatcgcact 301 ttatcgtttt gcatcttaat gcgttttctt agcttaaatc gcttatatct ggcgctggca 361 atagctgata atcgatgcac attaattgct agcgaaaatg caagagcaaa gacgaaaaca 421 tgccacacat gaggaatacc gattctctca ttaacatatt caggccagtt atctgggctt 481 aaaagcagaa gtccaaccca gataacgatc atatacatgg ttctctccag aggttcatta 541 ctgaacactc gtccgagaat aacgagtgga tctgggtcga ccggtcgacc cagatctggg 601 tcgacctgca ggtcaacgga tcatgaacca ggaaaactcg ccaaactgca ggcacaagtg 661 cgcattggtg ggaaaggaac tgctcacaga aagaagaagg tcttccatag aacagccaca 721 gccgatgata aaaagcttca attctcctta aagaagttac aggtaaacaa tatctctggt 781 attgaaaagg tgaatatgtt tacaaaccaa ggaacagtga tccactttaa caaccctaaa 841 tttcaggcat cgctggcagt gaacactttc accataacag gtcatgctga ggcaaagcag 901 gtgacagaaa tgctacccag tgtcttaagc cagcttggtg cagacagtct gactagttta 961 aggagactgg ctgaagttct gcccaaacaa cctgtggatg gaaaagcacc acttgctact 1021 ggaggggatg atgatgatgg agttccagaa ttgtggagaa ttttgatgag gcttccagga 1081 atgaggcaag ctgaattgag tcgacttctg aagaagataa aacttgaaga agttactggg 1141 agctgctatt ttctattatg actgcttttt aagaaatttt ttgttcatgg atctgataaa 1201 atctagatct ctatacttct aagcccaagc cccttggaca ctgtagcact ttttagtttt 1261 cgcttataca taatcattct ttttagctaa ttaagctgca gaacgtggga aataaagttc 1321 gaaacaaagg ttaataaagt tctttgcctt // LOCUS HUMC2CNT 2204 bp DNA PRI 10-APR-1996 DEFINITION Homo sapiens core 2 beta-1,6-N-acetylglucosaminyltransferase (core 2 GnT) gene, complete cds. ACCESSION L41415 NID g886272 KEYWORDS beta-1,6-N-acetylglucosaminyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2204) AUTHORS Bierhuizen,M.F., Maemura,K., Kudo,S. and Fukuda,M. TITLE Genomic organization of core 2 and I branching beta-1,6-N-acetylglucosaminyltransferases. Implication for evolution of the beta-1,6-N-acetylglucosaminyltransferase gene family JOURNAL Glycobiology 5 (4), 417-425 (1995) MEDLINE 96078409 FEATURES Location/Qualifiers source 1..2204 /organism="Homo sapiens" /note="(vector lambda EMBL3)" /db_xref="taxon:9606" /map="chromosome 9" /tissue_type="placenta" intron <1..100 /number=1 exon 101..2204 /number=2 gene 244..1530 /gene="core 2 GnT" CDS 244..1530 /gene="core 2 GnT" /EC_number="2.4.1.102" /note="core 2" /codon_start=1 /product="beta-1,6-N-acetylglucosaminyltransferase" /db_xref="PID:g886273" /translation="MLRTLLRRRLFSYPTKYYFMVLVLSLITFSVLRIHQKPEFVSVR HLELAGENPSSDINCTKVLQGDVNEIQKVKLEILTVKFKKRPRWTPDDYINMTSDCSS FIKRRKYIVEPLSKEEAEFPIAYSIVVHHKIEMLDRLLRAIYMPQNFYCVHVDTKSED SYLAAVMGIASCFSNVFVASRLESVVYASWSRVQADLNCMKDLYAMSANWKYLINLCG MDFPIKTNLEIVRKLKLLMGENNLETERMPSHKEERWKKRYEVVNGKLTNTGTVKMLP PLETPLFSGSAYFVVSREYVGYVLQNEKIQKLMEWAQDTYSPDEYLWATIQRIPEVPG SLPASHKYDLSDMQAVARFVKWQYFEGDVSKGAPYPPCDGVHVRSVCIFGAGDLNWML RKHHLFANKFDVDVDLFAIQCLDEHLRHKALETLKH" BASE COUNT 641 a 414 c 498 g 651 t ORIGIN 1 gatttattgt gaaaaactct ctctctctct ctctctctgt atatatatat atatatatat 61 atatatttat ttatatttat aattgcttct tttatttcag tgctgctctt catttcaaga 121 tgccgttgca gctctgataa atgcaaactg acaaccttca aggccacgac ggagggaaaa 181 tcattggtgc ttggagcata gaagactgcc cttcacaaag gaaatccctg attattgttt 241 gaaatgctga ggacgttgct gcgaaggaga cttttttctt atcccaccaa atactacttt 301 atggttcttg ttttatccct aatcaccttc tccgttttaa ggattcatca aaagcctgaa 361 tttgtaagtg tcagacactt ggagcttgct ggggagaatc ctagtagtga tattaattgc 421 accaaagttt tacagggtga tgtaaatgaa atccaaaagg taaagcttga gatcctaaca 481 gtgaaattta aaaagcgccc tcggtggaca cctgacgact atataaacat gaccagtgac 541 tgttcttctt tcatcaagag acgcaaatat attgtagaac cccttagtaa agaagaggcg 601 gagtttccaa tagcatattc tatagtggtt catcacaaga ttgaaatgct tgacaggctg 661 ctgagggcca tctatatgcc tcagaatttc tattgcgttc atgtggacac aaaatccgag 721 gattcctatt tagctgcagt gatgggcatc gcttcctgtt ttagtaatgt ctttgtggcc 781 agccgattgg agagtgtggt ttatgcatcg tggagccggg ttcaggctga cctcaactgc 841 atgaaggatc tctatgcaat gagtgcaaac tggaagtact tgataaatct ttgtggtatg 901 gattttccca ttaaaaccaa cctagaaatt gtcaggaagc tcaagttgtt aatgggagaa 961 aacaacctgg aaacggagag gatgccatcc cataaagaag aaaggtggaa gaagcggtat 1021 gaggtcgtta atggaaagct gacaaacaca gggactgtca aaatgcttcc tccactcgaa 1081 acacctctct tttctggcag tgcctacttc gtggtcagta gggagtatgt ggggtatgta 1141 ctacagaatg aaaaaatcca aaagttgatg gagtgggcac aagacacata cagccctgat 1201 gagtatctct gggccaccat ccaaaggatt cctgaagtcc cgggctcact ccctgccagc 1261 cataagtatg atctatctga catgcaagca gttgccaggt ttgtcaagtg gcagtacttt 1321 gagggtgatg tttccaaggg tgctccctac ccgccctgcg atggagtcca tgtgcgctca 1381 gtgtgcattt tcggagctgg tgacttgaac tggatgctgc gcaaacacca cttgtttgcc 1441 aataagtttg acgtggatgt tgacctcttt gccatccagt gtttggatga gcatttgaga 1501 cacaaagctt tggagacatt aaaacactga ccattacggg caattttatg aacaagaaga 1561 aggatacaca aaacgtaccc ttatctgttt ccccttcctt gtcagcatcg ggaagatggt 1621 atgaagtcct ctttggggca gggactctag tagatcttct tgtcagagaa gctgcatggt 1681 ttctgcagag cacagttagc tagaaaggtg atagcattaa atgttcatct agagttaata 1741 gtgggaggag taaaggtagc cttgaggcca gagcaggtag caaggcattg tggaaagagg 1801 ggaccagggt ggctggggaa gaggccgatg cataaagtca gcctgttcaa agtgctcagg 1861 gacttagcaa aatgagaaga tgtgacctgt gccaaaacta ttttgagaat tttaaatgtg 1921 accatttttc tggtatgaat aaacttacag caacaaataa tcaaagatac aattaatctg 1981 atattatatt tgttgaaata gaaatttgat tgtactataa atgatttttg taaataattt 2041 atattctgct ctaatactgt actgtgtagt gtgtctccgt atgtcatctc agggagctta 2101 aaatgggctt gatttaacat tgtttttgtg ttatttttgc ttgaaacaac gcacacattt 2161 tcaacaacca aaaaatgaca atttctagtt tagttaattt ctac // LOCUS HUMCD43 5050 bp DNA PRI 01-NOV-1994 DEFINITION Human leukosialin (CD43) gene, complete cds. ACCESSION M61827 NID g180125 KEYWORDS leukosialin; sialoglycoprotein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5050) AUTHORS Kudo,S. and Fukuda,M. TITLE A short, novel promoter sequence confers the expression of human leukosialin, a major sialoglycoprotein on leukocytes JOURNAL J. Biol. Chem. 266 (13), 8483-8489 (1991) MEDLINE 91217090 COMMENT From EMBL entry HSCD43; dated 21-JUL-1991. FEATURES Location/Qualifiers source 1..5050 /organism="Homo sapiens" /db_xref="taxon:9606" /map="16p11.2" mRNA join(1799..1868,2247..4068) /gene="SPN" /note="G00-120-384" /product="leukosialin" gene join(1799..1868,2247..4068) /gene="SPN" exon 1799..1868 /gene="SPN" /number=1 /product="leukosialin" gene 1799..4068 /gene="CD43" exon 2247..4068 /gene="SPN" /number=2 /product="leukosialin" CDS 2281..3483 /gene="SPN" /codon_start=1 /db_xref="GDB:G00-120-384" /product="leukosialin" /db_xref="PID:g180126" /translation="MATLLLLLGVLVVSPDALGSTTAVQTPTSGEPLVSTSEPLSSKM YTTSITSDPKADSTGDQTSALPPSTSINEGSPLWTSIGASTGSPLPEPTTYQEVSIKM SSVPQETPHATSHPAVPITANSLGSHTVTGGTITTNSPETSSRTSGAPVTTAASSLET SRGTSGPPLTMATVSLETSKGTSGPPVTMATDSLETSTGTTGPPVTMTTGSLEPSSGA SGPQVSSVKLSTMMSPTTSTNASTVPFRNPDENSRGMLPVAVLVALLAVIVLVALLLL WRRRQKRRTGALVLSRGGKRNGVVDAWAGPAQVPEEGAVTVTVGGSGGDKGSGFPDGE GSSRRPTLTTFFGRRKSRQGSLAMEELKSGSGPSLKGEEEPLVASEDGAVDAPAPDEP EGGDGAAP" BASE COUNT 1087 a 1496 c 1353 g 1114 t ORIGIN 1 cccccctgca gaatgggcac cccgttacct ttctgagcca ctgtgcgcag aaaagagagc 61 atgttggcca ggctggtctc gaactcctga cctcaagtga tcagcctgcc ttacctccca 121 aagtcctggg attacaggcg tgaaccacca cgctcagcct ctgaatactt tgtactcaag 181 ccatttttca gtgctgtgtt tgcagtgagc acacccgagg gatgaagaca cgtctccctg 241 tgggaacctg ggcttaccag ggcccctaga ggaggggaat ctctcaagct cagagctcta 301 tggctgcggt gcaggcccac tgtgtgcatg gtgtcagtct gggcccttcc atgttgcccc 361 cgtgggactt ggggtaaggg gaactgatgc aaacatcacg ctgctgttgc ttggtgtgag 421 caattaattc ctgtggctct cacccaggag tctcatgtct ttgggtcaga caaactcatc 481 agcttgtaga aatggcacag tcccacgggc ctgttagaat cttctattgt gcacatgttg 541 ctcttaaaat atacaaatca gttttgattt taaaaaatta tttatttttt tagtgatagg 601 agttttgcta cgttgcccag gctggtttca aactcttggg ctcaggaggt cctcccactt 661 tggcctggac tgccagcata atgtatcacc acacccggga ctgattttcg tttttcaaga 721 acaaaaacca aaaacataca caaaccgaga gtcaaagctt gctaattaga ggaaagtcag 781 gaaatgggaa ccattcaaag aagaaaatac ccccacctcc tactctcacc tatccaaaga 841 caattaggtg aatcccttag tagatatctt tccagacggt tttccatata gattcccata 901 tctggccagg cgcggtggct cacacctgta atcctagcgc ttggggaggc tgaggcggat 961 ggaccacctg aggtcaggag ttcgagacca gcctgaccaa catggagaaa cctcgtctct 1021 acgaaaaata caaaattagc cgggcacagt ggtgcaagcc tgtaatccca gctactcagg 1081 aggccgaggc aggagaattg cttgaaccta ggaggcagac attgtgctga gccgagccaa 1141 gatcatgcca ttgcactaaa ctccgcctta aaaaaaaaaa aaaagattcc cacatcttta 1201 ctagtttgca gaaataagat cctagcatat gcagtgtgta ggaaccacct tggtttagcc 1261 acgtctctgt gactgggggc cactgtggtg acccccagct ccccggacag agtcaagagc 1321 tcaccagcct gcaaaggttt tcacggcccc cagccagact cgggggcttc ctcttgccct 1381 gctacttcct gggagctctg agggcaggaa atggcgccac tcagctcctg gcctaacagc 1441 ttggggacca caaatgcaaa ggaaaccacc ctcccctccc acctcctcct ctgcaccctt 1501 gagttctcag gctcacattc ccaccaccca cctctgagcc cagccctccc tagcatcacc 1561 acttccatcc cattcctcag ccaagagcca ggaatcctga ttccagatcc cacgcttccc 1621 tgcctccctc aggtgagccc cagaccccca ggcaccccgc tggcccctga aggagcaggt 1681 gatggtgctg tcttcgccca gcagctgtgg gagcaggcgg gtggggcagg atggaggggt 1741 gggtggggtg ggtggagcca gggcccactt cctttcccct tggggccctg tccttcccag 1801 tcttgcccca gcctcgggag gtggtggagt gacctggccc cagtgctgcg tccttatcag 1861 ccgagccggt aagagggtga gacttggtgg ggtaggggcc tcagtgggcc tgggaatgtg 1921 cctgtggctt gaaaagactc tgacaggtta tgatgggaag agattgggag ccattgggct 1981 gcacagggtc agggaaggcc aggaggggct ggtcactgct ggaatctaag ctgctgaggc 2041 tggagggagc ctcaggatgg ggctgatggg ggagctgcca gcatctgttc ctctgtcatt 2101 tctgataaca gtaaaagcca gcatggaaaa aaccgttaaa ccgcaggttg ggcctggccg 2161 ttggcaggga agtgggcaga ggggaggccc ggccaggtcc tccggcaact cccgcgtgtt 2221 ctgcttctcc ggctgcccac ctgcaggtcc cagctcttgc tcctgcctgt ttgcctggaa 2281 atggccacgc ttctccttct ccttggggtg ctggtggtaa gcccagacgc tctggggagc 2341 acaacagcag tgcagacacc cacctccgga gagcctttgg tctctactag cgagcccctg 2401 agctcaaaga tgtacaccac ttcaataaca agtgacccta aggccgacag cactggggac 2461 cagacctcag ccctacctcc ctcaacttcc atcaatgagg gatcccctct ttggacttcc 2521 attggtgcca gcactggttc ccctttacct gagccaacaa cctaccagga agtttccatc 2581 aagatgtcat cagtgcccca ggaaacccct catgcaacca gtcatcctgc tgttcccata 2641 acagcaaact ctctaggatc ccacaccgtg acaggtggaa ccataacaac gaactctcca 2701 gaaacctcca gtaggaccag tggagcccct gttaccacgg cagctagctc tctggagacc 2761 tccagaggca cctctggacc ccctcttacc atggcaactg tctctctgga gacttccaaa 2821 ggcacctctg gaccccctgt taccatggca actgactctc tggagacctc cactgggacc 2881 actggacccc ctgttaccat gacaactggc tctctggagc cctccagcgg ggccagtgga 2941 ccccaggtct ctagcgtaaa actatctaca atgatgtctc caacgacctc caccaacgca 3001 agcactgtgc ccttccggaa cccagatgag aactcacgag gcatgctgcc agtggctgtg 3061 cttgtggccc tgctggcggt catagtcctc gtggctctgc tcctgctgtg gcgccggcgg 3121 cagaagcggc ggactggggc cctcgtgctg agcagaggcg gcaagcgtaa cggggtggtg 3181 gacgcctggg ctgggccagc ccaggtccct gaggaggggg ccgtgacagt gaccgtggga 3241 gggtccgggg gcgacaaggg ctctgggttc cccgatgggg aggggtctag ccgtcggccc 3301 acgctcacca ctttctttgg cagacggaag tctcgccagg gctccctggc gatggaggag 3361 ctgaagtctg ggtcaggccc cagcctcaaa ggggaggagg agccactggt ggccagtgag 3421 gatggggctg tggacgcccc agctcctgat gagcccgaag ggggagacgg ggctgcccct 3481 taagtgtcgg tgaatagtga ggctggaggc cgcaatctca gccagcctcc agcaccttcc 3541 ctctcaccat cccactgccc cctcgctccc atgtttccac ccggcaccct gatcctcacc 3601 cgaatctcct tttttttttt cttttgagac agagtttcgc tttgtcgccc aggctggagt 3661 gcaatgcacg atctcagttc actgcaacct ctgcctccta agttcaggcg attctcctgc 3721 ctcagcttcc cgagtaactg agattacagg cacccaccac catgcccagc tgcttttttg 3781 tatttttggt agagatgggg tttcaccatg ttggctaggc tggtctcaaa ctcctgacct 3841 caggtgatct acctgcctca gcctcccaaa gtgctgagat tacagacatg agcctccgcg 3901 ccttgcctcc tcacccacct cttcactctg aatcctcatg aggcttctca gccctggatt 3961 tcctgctgcc atcctcaccc agcacccaca actagcgcct gggcagggca gggctggcac 4021 ctctcaacgt ctgtggactg aatgaataaa ccctcctcat ccacccctat ttatctccat 4081 caccatttcc ccctctttct tgttcctgga aacggctgct gagtctccat cggccaaact 4141 tatctgccct gtgatttctt tgacaattct ccttttcccc cagaacccac cctgggttga 4201 ccagagtctg ggaagaagga caagagaacc cggcaaactc cctcctagga ttaactttgt 4261 aaagcaccct tgccctgtag ctgcaagggc tgtggaacct gggcagcccg caaccacctt 4321 tagctctggg ccccccaggc cagcctggag catggctggg tggggccacc agcccatgct 4381 ctcaggcggg cctgtgatct ttcccagggc acatggactg taggctggcc ctggcccaca 4441 ccaccacact ctccccagcc atggacagag gcagccagag gcctcacggt ttctcctccg 4501 agtttctggc tgggtgtagt tctcagaaac cccagtgcct gcgtgtgtcc actcgtgggt 4561 gtggtttgtg tgcaagagct gaggatttgg cgatgcttgg gaggggtagt tgtgggtaca 4621 gacggtgtgg gggtgggaag tggtgcagag actgaagagg gtcaacctgg gcatggggga 4681 cacagggact gctgagaacg tgcgtgtcat ctttgctctg atggggtgga catagcagaa 4741 aatctaactc tgtctgtagc cccatacaga atgccagggt gagcacagtg gctggtgcct 4801 ttaatcccag cactttggaa agttgaggca ggaggatcgc ttgagcccag gagttcgagt 4861 ctgaagtgag ctgtgattgc accactgcac ttcagcctgg gcaacagagt gagcccctgt 4921 ctcaaaaaag aaaagaaaaa gaaagccagg cttcatggaa agatcgtatg tgtgacccaa 4981 tatgagttct tcagctcagc catggtaatc ccttccttga agtctccatt tctgcagtac 5041 acatgcatgt // LOCUS HUMCDR34 2412 bp DNA PRI 01-NOV-1994 DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene, complete cds. ACCESSION M31423 NID g180188 KEYWORDS cerebellar-degeneration-related antigen. SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2412) AUTHORS Chen,Y.T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A., Chaganti,R.S., Posner,J.B. and Old,L.J. TITLE Cerebellar degeneration-related antigen: a highly conserved neuroectodermal marker mapped to chromosomes X in human and mouse JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (8), 3077-3081 (1990) MEDLINE 90222173 COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.-T.Chen, 17-JAN-1990. FEATURES Location/Qualifiers source 1..2412 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq27.1-q27.2" gene 503..1174 /gene="CDR1" CDS 503..1174 /gene="CDR1" /note="cerebellar-degeneration-related antigen (CDR34)" /codon_start=1 /db_xref="GDB:G00-119-053" /db_xref="PID:g180189" /translation="MAWLEDVDFLEDVPLLEDIPLLEDVPLLEDVPLLEDTSRLEDIN LMEDMALLEDVDLLEDTDFLEDLDFSEAMDLREDKDFLEDMDSLEDMALLEDVDLLED TDFLEDPDFLEAIDLREDKDFLEDMDSLEDLRPLEDVDFLEDMAFLEDVDFQEDPNYP EDLDCWEDVDFLEDWRLLEDMDFLEDMDFLEDVDLQEDIYWLEDLDFFRKMWIDWKTW IWWKT" BASE COUNT 743 a 334 c 669 g 666 t ORIGIN 1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct 61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat 121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg 181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga 241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg 301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag 361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg 421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga 481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt 541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt 601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga 661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat 721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt 781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga 841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct 901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt 961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga 1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt 1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat 1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg 1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa 1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac 1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa 1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt 1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact 1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat 1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct 1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga 1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct 1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac 1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg 1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata 1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa 1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc 2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt 2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta 2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga 2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta 2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat 2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt 2401 gagatgctgc ag // LOCUS HUMCMOS 1303 bp DNA PRI 01-NOV-1994 DEFINITION human humos gene homologous to transforming gene of mmsv. ACCESSION J00119 NID g180640 KEYWORDS c-myc proto-oncogene; mos oncogene. SOURCE human placental dna, clone lambdahm1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1303) AUTHORS Watson,R., Oskarsson,M. and Vande Woude,G.F. TITLE Human DNA sequence homologous to the transforming gene (mos) of Moloney murine sarcoma virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79 (13), 4078-4082 (1982) MEDLINE 82275068 COMMENT human c-mos (humos) was aligned with mouse c-mos (mumos) dna, both homologs to v-mos of moloney murine sarcoma virus. extensive similarity was found. however, humos dna fragments were unable to transform mouse nih 3t3 cells. FEATURES Location/Qualifiers source 1..1303 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q11" gene 241..1281 /gene="MOS" CDS 241..1281 /gene="MOS" /note="c-mos transforming protein" /codon_start=1 /db_xref="GDB:G00-119-396" /db_xref="PID:g180641" /translation="MPSPLALRPYLRSEFSPSVDARPCSSPSELPAKLLLGATLPRAP RLPRRLAWCSIDWEQVCLLQRLGAGGFGSVYKATYRGVPVAIKQVNKCTKNRLASRRS FWAELNVARLRHDNIVRVVAASTRTPAGSNSLGTIIMEFGGNVTLHQVIYGAAGHPEG DAGEPHCRTGGQLSLGKCLKYSLDVVNGLLFLHSQSIVHLDLKPANILISEQDVCKIS DFGCSEKLEDLLCFQTPSYPLGGTYTHRAPELLKGEGVTPKADIYSFAITLWQMTTKQ APYSGERQHILYAVVAYDLRPSLSAAVFEDSLPGQRLGDVIQRCWRPSAAQRPSARLL LVDLTSLKAELG" BASE COUNT 265 a 382 c 381 g 275 t ORIGIN 5' to the ecor-i site at 5.4kb on the lambda hm1 fragment. 1 cggaagggaa atgctttcat ctgaaaggga tagctgtgct tcattccggt ttctccctcc 61 atctgataaa aactcttgct gagtgacagc acagatgtag ctcatttgga acaagtgaag 121 gaaaaggaga aaagggatga ggtggagcga aggagtagtc agtcatgttt ccaaagtccc 181 gcggtttccc ctagtctctt cattcactcc agcggccctg gtgtccccct gcaaagtgcg 241 atgccctcgc ccctggccct acgcccctac ctccggagcg agttttcccc atcggtggac 301 gcgcggccct gcagcagtcc ctcagagcta cctgcgaagc tgcttctggg ggccactctt 361 cctcgggccc cgcggctgcc gcgccggctg gcctggtgct ccattgactg ggagcaggtg 421 tgcttgctgc agaggctggg agctggaggg tttggctcgg tgtacaaggc gacttaccgc 481 ggtgttcctg tggccataaa gcaagtgaac aagtgcacca agaaccgact agcatctcgg 541 cggagtttct gggctgagct caacgtagca aggctgcgcc acgataacat cgtgcgcgtg 601 gtggctgcca gcacgcgcac gcccgcaggg tccaatagcc tagggaccat catcatggag 661 ttcggtggca acgtcacttt acaccaagtc atctatggcg ccgccggcca ccctgagggg 721 gacgcagggg agcctcactg ccgcactgga ggacagttaa gtttgggaaa gtgtctcaag 781 tactcactag atgttgtgaa cggcctgctc ttcctccact cgcaaagcat tgtgcacttg 841 gacctgaagc ccgcgaacat cttgatcagt gagcaggatg tctgtaaaat tagtgacttc 901 ggttgctctg agaagttgga agatctgctg tgcttccaga caccctctta ccctctagga 961 ggcacataca cccaccgcgc cccggagctc ctgaaaggag agggcgtgac gcctaaagcc 1021 gacatttatt cctttgccat cactctctgg caaatgacta ccaagcaggc gccgtattcg 1081 ggggagcggc agcacatact gtacgcggtg gtggcctacg acctgcgccc gtccctctcc 1141 gctgccgtct tcgaggactc gctccccggg cagcgccttg gggacgtcat ccagcgctgc 1201 tggagaccca gcgcggcgca gaggccgagc gcgcggctgc ttttggtgga tctcacctct 1261 ttgaaagctg aactcggctg actgaaaact tggtcaagat aag // LOCUS HUMCNGCCA 3408 bp DNA PRI 01-MAY-1995 DEFINITION Homo sapiens clone hRCNC2b retinal rod cyclic nucleotide-gated cation channel gene, complete cds. ACCESSION L15296 NID g291913 KEYWORDS cyclic nucleotide-gated cation channel; retinal protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3408) AUTHORS Chen,T.Y., Peng,Y.W., Dhallan,R.S., Ahamed,B., Reed,R.R. and Yau,K.W. TITLE A new subunit of the cyclic nucleotide-gated cation channel in retinal rods JOURNAL Nature 362 (6422), 764-767 (1993) MEDLINE 93226050 REFERENCE 2 (bases 1 to 3408) AUTHORS Ahamed,B. TITLE Direct Submission JOURNAL Submitted (17-MAY-1993) Basheer Ahamed, Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..3408 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hRCNC2b" /tissue_type="retinal" CDS 105..2834 /codon_start=1 /product="cyclic nucleotide-gated cation channel" /db_xref="PID:g790511" /translation="MPRELSRIEEEKEDEEEEEEEEEEEEEEEVTEVLLDSCVVSQVG VGQSEEDGTRPQSTSDQKLWEEVGEEAKKEAEEKAKEEAEEVAEEEAEKEPQDWAETK EEPEAEAEAASSGVPATKQHPEVQVEDTDADSCPLMAEENPPSTVLPPPSPAKSDTLI VPSSASGTHRKKLPSEDDEAEELKALSPAESPVVAWSDPTTPKDTDGQDRAASTASTN SAIINDRLQELVKLFKERTEKVKEKLIDPDVTSDEESPKPSPAKKAPEPAPDTKPAEA EPVEEEHYCDMLCCKFKHRPWKKYQFPQSIDPLTNLMYVLWLFFVVMAWNWNCWLIPV RWAFPYQTPDNIHHWLLMDYLCDLIYFLDITVFQTRLQFVRGGDIITDKKDMRNNYLK SRRFKMDLLSLLPLDFLYLKVGVNPLLRLPRCLKYMAFFEFNSRLESILSKAYVYRVI RTTAYLLYSLHLNSCLYYWASAYQGLGSTHWVYDGVGNSYIRCYYFAVKTLITIGGLP DPKTLFEIVFQLLNYFTGVFAFSVMIGQMRDVVGAATAGQTYYRSCMDSTVKYMNFYK IPKSVQNRVKTWYEYTWHSQGMLDESELMVQLPDKMRLDLAIDVNYNIVSKVALFQGC DRQMIFDMLKRLRSVVYLPNDYVCKKGEIGREMYIIQAGQVQVLGGPDGKSVLVTLKA GSVFGEISLLAVGGGNRRTANVVAHGFTNLFILDKKDLNEILVHYPESQKLLRKKARR MLRSNNKPKEEKSVLILPPRAGTPKLFNAALAMTGKMGGKGAKGGKLAHLRARLKELA ALEAAAKHEELVEQAKSSQDVKGEEGSAAPDQHTHPKEAATDPPAPRTPPEPPGSPPS SPPPASLGSCEGEEEGPAEPEEHSVRICMSPGPEPGEQILSVKMPEEREEKAE" BASE COUNT 807 a 948 c 1008 g 645 t ORIGIN 1 gagattctgc tccagccaca gaagcagccg cagcccaggc ctagtgtctt ggcctcaagg 61 gcctcattgg gttcatctgt gcaccctgtg tccccagtgc tgccatgccc agagagctgt 121 cccggattga agaggagaaa gaagatgagg aggaggaaga ggaagaggag gaggaggagg 181 aagaggagga ggtgactgag gtgctgctgg atagctgtgt ggtgtcgcag gtgggcgtgg 241 gccagagtga agaagacggg acccggcccc agagcacttc agatcagaag ctgtgggagg 301 aagttgggga ggaggccaag aaggaggctg aagagaaggc caaggaggag gccgaggagg 361 tggctgaaga ggaggctgaa aaggagcccc aggactgggc ggagaccaag gaggagcctg 421 aggctgaggc cgaggctgcc agttcaggag tgcctgccac gaaacagcac ccagaagtgc 481 aggtggaaga tactgatgct gatagctgcc ccctcatggc agaagagaat ccaccctcaa 541 ccgtgttgcc gccaccgtct cctgccaaat cagacaccct tatagtccca agctcagcct 601 cggggacaca caggaagaag ctgccctctg aggatgatga ggctgaagag ctcaaggcgt 661 tgtcaccagc agagtcccca gtggttgcct ggtctgaccc caccaccccg aaggacactg 721 atggccagga ccgtgcggcc tccacggcca gcacaaatag cgccatcatc aacgaccggc 781 tccaggagct ggtgaagctc ttcaaggagc ggacagagaa agtgaaggag aaactcattg 841 accctgacgt cacctctgat gaggagagcc ccaagccctc cccagccaag aaagccccag 901 agccagctcc agacacaaag cccgctgaag ccgagccagt ggaagaggag cactattgcg 961 acatgctctg ctgcaagttc aaacaccgcc cctggaagaa gtaccagttt ccccagagca 1021 ttgacccgct gaccaacctg atgtatgtcc tatggctgtt cttcgtggtg atggcctgga 1081 attggaactg ttggctgatt cccgtgcgct gggccttccc ctaccagacc ccggacaaca 1141 tccaccactg gctgctgatg gattacctat gcgacctcat ctacttcctg gacatcaccg 1201 tgttccagac acgcctgcag tttgtcagag gcggggacat cattacggac aaaaaggaca 1261 tgcgaaataa ctacctgaag tctcgccgct tcaagatgga cctgctcagc ctcctgccct 1321 tggattttct ctatttgaaa gtcggtgtga accccctcct ccgcctgccc cgctgtttaa 1381 agtacatggc cttcttcgag tttaacagcc gcctggaatc catcctcagc aaagcctacg 1441 tgtacagggt catcaggacc acagcctacc ttctctacag cctgcatttg aattcctgtc 1501 tttattactg ggcatcggcc tatcagggcc tcggctccac tcactgggtt tacgatggcg 1561 tgggaaacag ttatattcgc tgttactact ttgctgtgaa gaccctcatc accatcgggg 1621 ggctgcctga ccccaagaca ctctttgaaa ttgtcttcca gctgctgaat tatttcacgg 1681 gcgtctttgc tttctctgtg atgatcggac agatgagaga tgtggtaggg gccgccaccg 1741 cgggacagac ctactaccgc agctgcatgg acagcacggt gaagtacatg aatttctaca 1801 agatccccaa gtccgtgcag aaccgcgtca agacctggta cgagtacacc tggcactcgc 1861 aaggcatgct ggatgagtca gagctgatgg tgcagcttcc agacaagatg cggctggacc 1921 tcgccatcga cgtgaactac aacatcgtta gcaaagtcgc actctttcag ggctgtgacc 1981 ggcagatgat ctttgacatg ctgaagaggc ttcgctctgt tgtctacctg cccaacgact 2041 atgtgtgcaa gaagggggag atcggccgtg agatgtacat catccaggca gggcaagtgc 2101 aggtcttggg cggccctgat gggaaatctg tgctggtgac gctgaaagct ggatctgtgt 2161 ttggagaaat aagcttgctg gctgttgggg gcgggaaccg gcgcacggcc aacgtggtgg 2221 cgcacgggtt taccaacctc ttcatcctgg ataagaagga cctgaatgag attttggtgc 2281 attatcctga gtctcagaag ttactccgga agaaagccag gcgcatgctg agaagcaaca 2341 ataagcccaa ggaggagaag agcgtgctga tccttccacc ccgggcgggc accccaaagc 2401 tcttcaacgc tgccctcgct atgacaggaa agatgggtgg caagggggca aaaggcggca 2461 aacttgctca cctccgggcc cggctcaaag aactggccgc gctggaggcg gctgcaaagc 2521 acgaagagtt ggtggaacag gccaagagct cgcaagacgt caagggagag gaaggctccg 2581 ccgccccaga ccagcacacg cacccaaagg aggccgccac cgacccaccc gcgccccgga 2641 cgccccccga gcccccgggg tctccaccga gctctccacc gcctgcctcc cttgggagct 2701 gcgagggaga ggaggagggg ccggccgagc ccgaagagca ctcggtgagg atctgcatga 2761 gcccgggccc ggagccggga gagcagatcc tgtcggtgaa gatgccggag gaaagggagg 2821 agaaggcgga gtaaggtggg gtgaggcgga tcccgcgcgc agttccagca ggtgtgtccc 2881 cagcgcccgc tgcgcccctc gccccagcgc cccaccttcc cccacggctc aagagaagat 2941 gcttttccgt agtcgtgacc tcagtggctg cagctctgac cgtcccgcca gcacgccagc 3001 cccgactcag ctcctcgcgg ggctgggcct gagctcgaca agttgcatca agtgttcgag 3061 tccctgagct ctcactatca tttgagagcc ctaccttttc ctgactgttc ctctttttaa 3121 gaacaaaatg attttccact tttaaagttc atgggtgagg gataaaatga ggctccaata 3181 catgggaagc tttgctgaaa aagtaaagtg ttattgaatg tgtggggttt cccctcagga 3241 atttgtctaa cacatttcaa ggatagaaaa tacttcactg ccgggcatgg tggctcatgc 3301 ctataatccc agtgctttgg gaagccgaag caggaggatc actggaggcc aagagttgga 3361 gagcagactg ggcaacatag cgagacctca tctcaaaccg gaattcgg // LOCUS HUMCSYNA 2647 bp DNA PRI 30-SEP-1988 DEFINITION Human c-syn protooncogene, complete cds. ACCESSION M14333 NID g181171 KEYWORDS c-myc proto-oncogene. SOURCE Human (placental) DNA, clone lambda-SN-2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2647) AUTHORS Semba,K., Nishizawa,M., Miyajima,N., Yoshida,M.C., Sukegawa,J., Yamanashi,Y., Sasaki,M., Yamamoto,T. and Toyoshima,K. TITLE yes-related protooncogene, syn, belongs to the protein-tyrosine kinase family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 5459-5463 (1986) MEDLINE 86287278 COMMENT syn belongs to the protein-tyrosine kinase family of retroviral oncogenes. FEATURES Location/Qualifiers source 1..2647 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 580..2193 /note="c-syn" /codon_start=1 /db_xref="PID:g181172" /translation="MGCVQCKDKEATKLTEERDGSLNQSSGYRYGTDPTPQHYPSFGV TSIPNYNNFHAAGGQGLTVFGGVNSSSHTGTLRTRGGTGVTLFVALYDYEARTEDDLS FHKGEKFQILNSSEGDWWEARSLTTGETGYIPSNYVAPVDSIQAEEWYFGKLGRKDAE RQLLSFGNPRGTFLIRESETTKGAYSLSIRDWDDMKGDHVKHYKIRKLDNGGYYITTR AQFETLQQLVQHYSERAAGLCCRLVVPCHKGMPRLTDLSVKTKDVWEIPRESLQLIKR LGNGQFGEVWMGTWNGNTKVAIKTLKPGTMSPESFLEEAQIMKKLKHDKLVQLYAVVS EEPIYIVTEYMNKGSLLDFLKDGEGRALKLPNLVDMAAQVAAGMAYIERMNYIHRDLR SANILVGNGLICKIADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVW SFGILLTELVTKGRVPYPGMNNREVLEQVERGYRMPCPQDCPISLHELMIHCWKKDPE ERPTFEYLQSFLEDYFTATEPQYQPGENL" BASE COUNT 683 a 695 c 716 g 553 t ORIGIN 1 gccgcgctgg tggcggcggc gcgtcgttgc agttgcgcca tctgtcagga gcggagccgg 61 cgaggagggg gctgccgcgg gcgaggagga ggggtcgccg cgagccgaag gccttcgaga 121 cccgcccgcc gcccggcggc gagagtagag gcgaggttgt tgtgcgagcg gcgcgtcctc 181 tcccgcccgg gcgcgccgcg cttctcccag cgcaccgagg accgcccggg cgcacacaaa 241 gccgccgccc gcgccgcacc gcccggcggc cgccgcccgc gccagggagg gattcggccg 301 ccgggccggg gacaccccgg cgccgccccc tcggtgctct cggaaggccc accggctccc 361 gggcccgccg gggacccccc ggagccgcct cggccgcgcc ggaggagggc ggggagagga 421 ccatgtgagt gggctccgga gcctcagcgc cgcgcagttt ttttgaagaa gcaggatgct 481 gatctaaacg tggaaaaaga ccagtcctgc ctctgttgta gaagacatgt ggtgtatata 541 aagtttgtga tcgttggcgg aaattttgga atttagataa tgggctgtgt gcaatgtaag 601 gataaagaag caacaaaact gacggaggag agggacggca gcctgaacca gagctctggg 661 taccgctatg gcacagaccc cacccctcag cactacccca gcttcggtgt gacctccatc 721 cccaactaca acaacttcca cgcagccggg ggccaaggac tcaccgtctt tggaggtgtg 781 aactcttcgt ctcatacggg gaccttgcgt acgagaggag gaacaggagt gacactcttt 841 gtggcccttt atgactatga agcacggaca gaagatgacc tgagttttca caaaggagaa 901 aaatttcaaa tattgaacag ctcggaagga gattggtggg aagcccgctc cttgacaact 961 ggagagacag gttacattcc cagcaattat gtggctccag ttgactctat ccaggcagaa 1021 gagtggtact ttggaaaact tggccgaaaa gatgctgagc gacagctatt gtcctttgga 1081 aacccaagag gtacctttct tatccgcgag agtgaaacca ccaaaggtgc ctattcactt 1141 tctatccgtg attgggatga tatgaaagga gaccatgtca aacattataa aattcgcaaa 1201 cttgacaatg gtggatacta cattaccacc cgggcccagt ttgaaacact tcagcagctt 1261 gtacaacatt actcagagag agctgcaggt ctctgctgcc gcctagtagt tccctgtcac 1321 aaagggatgc caaggcttac cgatctgtct gtcaaaacca aagatgtctg ggaaatccct 1381 cgagaatccc tgcagttgat caagagactg ggaaatgggc agtttgggga agtatggatg 1441 ggtacctgga atggaaacac aaaagtagcc ataaagactc ttaaaccagg cacaatgtcc 1501 cccgaatcat tccttgagga agcgcagatc atgaagaagc tgaagcacga caagctggtc 1561 cagctctatg cagtggtgtc tgaggagccc atctacatcg tcaccgagta tatgaacaaa 1621 ggaagtttac tggatttctt aaaagatgga gaaggaagag ctctgaaatt accaaatctt 1681 gtggacatgg cagcacaggt ggctgcagga atggcttaca tcgagcgcat gaattatatc 1741 catagagatc tgcgatcagc aaacattcta gtggggaatg gactcatatg caagattgct 1801 gacttcggat tggcccgatt gatagaagac aatgagtaca cagcaagaca aggtgcaaag 1861 ttccccatca agtggacggc ccccgaggca gccctgtacg ggaggttcac aatcaagtct 1921 gacgtgtggt cttttggaat cttactcaca gagctggtca ccaaaggaag agtgccatac 1981 ccaggcatga acaaccggga ggtgctggag caggtggagc gaggctacag gatgccctgc 2041 ccgcaggact gccccatctc tctgcatgag ctcatgatcc actgctggaa aaaggaccct 2101 gaagaacgcc ccacttttga gtacttgcag agcttcctgg aagactactt taccgcgaca 2161 gagccccagt accaacctgg tgaaaacctg taaggcccgg gtctgcggag agaggccttg 2221 tcccagaggc tgccccaccc ctccccatta gctttcaatt ccgtagccag ctgctcccca 2281 gcagcggaac cgcccaggat cagattgcat gtgactctga agctgacgaa cttccatggc 2341 cctcattaat gacacttgtc cccaaatccg aacctcctct gtgaagcatt cgagacagaa 2401 ccttgttatt tctcagactt tggaaaatgc attgtatcga tgttatgtaa aaggccaaac 2461 ctctgttcag tgtaaatagt tactccagtg ccaacaatcc tagtgctttc cttttttaaa 2521 aatgcaaatc ctatgtgatt ttaactctgt cttcacctga ttcaactaaa aaaaaaaagt 2581 attattttcc aaaagtggcc tctttgtcta aaacaataaa attttttttc atgttttaac 2641 aaaaacc // LOCUS HUMDNAHEL 3888 bp DNA PRI 26-JUL-1995 DEFINITION Human DNA helicase gene, complete cds. ACCESSION L24544 NID g908916 KEYWORDS helicase. SOURCE human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3888) AUTHORS Zhang,Q. and Montalvo,E.A. TITLE A putative DNA helicase binds the EBV BZLF1 promoter JOURNAL Unpublished (1993) REFERENCE 2 (bases 675 to 1850) AUTHORS Montalvo,E.A. TITLE Direct Submission JOURNAL Submitted (01-APR-1994) Eduardo A. Montalvo, Institute of Biotechnology, Center for Molecular Medicine, 15355 Lambda Drive, San Antonio, TX 78245, USA REFERENCE 3 (bases 1 to 3888) AUTHORS Montalvo,E.A. TITLE Direct Submission JOURNAL Submitted (10-MAY-1995) Eduardo A. Montalvo, Institute of Biotechnology, Center for Molecular Medicine, 15355 Lambda Drive, San Antonio, TX 78245, USA FEATURES Location/Qualifiers source 1..3888 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_lib="lambda gt11" CDS 36..3017 /codon_start=1 /product="DNA helicase" /db_xref="PID:g908917" /translation="MASAAVESFVTKQLDLLELERDAEVEERRSWQENISLKELQSRG VCLLKLQVSSQRTGLYGRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLA TGILTRVTQKSVTVAFDESHDFQLSLDRENSYRLLKLANDVTYRRLKKALIALKKYHS GPASSLIEVLFGRSAPSPASEIHPLTFFNTCLDTSQKEAVLFALSQKELAIIHGPPGT GKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKQRILRLGHPARLLESIQ QHSLDAVLARSDSAQIVADIRKDIDQVFVKNKKTQDKREKSNFRNEIKLLRKELKERE EAAMLESLTSANVVLATNTGASADGPLKLLPESYFDVVVIDECAQALEASCWIPLLKA RKCILAGDHKQLPPTTVSHKAALAGLSLSLMERLAEEYGARVVRTLTVQYRMHQAIMR WASDTMYLGQLTAHSSVARHLLRDLPGVAATEETGVPLLLVDTAGCGLFELEEEDEQS KGNPGEVRLVSLHIQALVDAGVPARDIAVVSPYNLQVDLLRQSLVHRHPELEIKSVDG FQGREKEAVILSFVRSNRKGEVGFLAEDRRINVAVTRARRHVAVICDSRTVNNHAFLK TLVEYFTQHGEVRTAFEYLDDIVPENYSHENSQGSSHAATKPQGPATSTRTGSQRQEG GQEAAAPARQGRKKPAGKSLASEAPSQPSLNGGSPEGVESQDGVDHFRAMIVEFMASK KMQLEFPPSLNSHDRLRVHQIAEEHGLRHDSSGEGKRRFITVSKRAPRPRAALGPPAG TGGPAPLQPVPPTPAQTEQPPREQRGPDQPDLRTLHLERLQRVRSAQGQPASKEQQAS GQQKLPEKKKKKAKGHPATDLPTEEDFEALVSAAVKADNTCGFAKCTAGVTTLGQFCQ LCSRRYCLSHHLPEIHGCGERARAHARQRISREGVLYAGSGTKNGSLDPAKRAQLQRR LDKKLSELSNQRTSRRKERGT" BASE COUNT 857 a 1132 c 1167 g 732 t ORIGIN 1 acgtcggctt ctaggggccc aggccggcgg cggcgatggc ctcggcagct gtggagagct 61 tcgtgaccaa gcaactggac ctgctggagc ttgagagaga cgcggaggtg gaggagcgca 121 ggtcctggca ggagaacatc tctctgaaag agctccagag ccgaggcgtg tgtttgctga 181 agctgcaggt atccagccag cgcactgggc tgtacggacg gctgctggtc acctttgagc 241 ccaggcgata cgggtccgcg gcagctcttc ccagtaacag ctttacttct ggtgatatcg 301 tgggcctgta cgatgctgct aatgagggca gtcagctggc cactgggatc ttgacccggg 361 tcacccagaa gtcggtcacg gtggcctttg atgagtccca cgatttccag ttgagcttgg 421 accgagagaa ttcctacaga ctgttaaaac ttgccaatga tgtcacttac aggcgactga 481 aaaaagccct gattgctcta aagaagtatc attctggccc agcctcctca ctcatagaag 541 tgctctttgg cagatctgct cccagtcctg ccagtgaaat acacccgctg acattcttca 601 acacctgcct ggacacctcc cagaaagaag cggttttatt tgcgctgtct cagaaagaac 661 ttgccatcat ccatggacct cctggcactg ggaaaaccac gactgtggtt gagatcattc 721 ttcaagctgt gaaacaaggc ttaaaggttc tgtgctgcgc cccctccaac atcgccgtgg 781 acaatctggt ggagcgcctg gctctgtgta agcagcggat tctgcgcctg ggacaccctg 841 cccgcctcct ggagtccatt cagcagcact ccctggatgc ggttttagcg cggagcgaca 901 gtgcccagat tgttgcagat atcaggaagg acatcgacca ggtctttgtg aaaaacaaaa 961 agacccagga taagagagag aaaagtaatt ttcgaaatga aattaagctg ttaagaaaag 1021 aactgaagga gagggaagaa gcagctatgc tcgagagcct cacttcggca aacgtggtcc 1081 ttgcaacaaa cacaggtgcg tctgccgatg gccccctgaa gttgctgccc gagagctact 1141 tcgacgtggt ggtcattgac gagtgtgccc aggccctcga ggcgagctgc tggatccccc 1201 tgctgaaggc cagaaagtgc atcctggcgg gcgatcacaa gcagctgccc cccaccacag 1261 tctctcacaa ggctgcgctg gcaggactgt cactcagcct gatggaacgc ctggctgagg 1321 agtacggcgc gagggtggtg cggacactga cggtgcagta ccgcatgcac caggctatca 1381 tgcgctgggc ctcagacacc atgtaccttg ggcagctcac agcccactct tccgtggcaa 1441 ggcacctcct gagggacctc ccaggtgtgg ctgccacaga agagacgggt gtgcccctgc 1501 tcttggtgga caccgccggc tgcgggctgt ttgagctgga ggaggaggac gaacagtcga 1561 aagggaaccc tggcgaagtc cgcctcgtca gtttgcacat ccaggctctg gtggacgctg 1621 gtgttccagc ccgtgacatt gctgtggtct cgccatacaa cctccaggtg gacctgctca 1681 gacagagcct tgtgcacagg caccctgagc ttgaaatcaa gtctgtcgat ggcttccaag 1741 gccgagagaa ggaggccgtg atactgtcct tcgtcagatc caacaggaaa ggtgaagttg 1801 gttttcttgc tgaggaccgg aggatcaacg tggctgtcac ccgtgcccga cgccacgtgg 1861 cggtcatctg tgactcccgt actgtcaaca accatgcatt tttgaagacc ctggtggagt 1921 atttcacaca gcatggggaa gtacgcacgg cctttgagta tcttgacgat attgtcccag 1981 aaaactattc ccatgagaac tcccagggtt ccagccacgc tgccaccaag ccccagggac 2041 ctgctacgtc caccaggacc ggaagccagc ggcaggaggg aggccaggag gctgcagcac 2101 ctgccagaca gggccggaag aagccggctg ggaagtctct ggcctctgaa gctccatctc 2161 agcccagcct caacggaggc agcccagagg gagtggagag ccaagatggc gtggaccact 2221 tccgggccat gatagtggag ttcatggcca gcaagaagat gcagttggag tttcctcctt 2281 ccctcaattc ccacgacagg ctgcgggtcc accaaatagc cgaggagcac gggctgaggc 2341 acgacagttc cggggaaggg aagaggaggt tcatcactgt gagcaagagg gccccgcgac 2401 cccgagcagc cctgggaccc ccagcaggga ccggtggccc agcccctctc cagccagtgc 2461 cccctacccc tgcgcagaca gagcagcctc ccagggagca gcgtggccca gaccagcctg 2521 atctgaggac gctgcacctg gagagactgc agagggtcag gagcgcgcag gggcagcccg 2581 ccagcaagga gcagcaggcc tcagggcagc agaaacttcc agaaaagaaa aagaaaaaag 2641 ccaaaggaca tccggccaca gatctgccca cggaggagga ctttgaggcc ctggtttctg 2701 ccgccgttaa ggctgataac acctgcggct ttgccaagtg cacagccggc gtcacaaccc 2761 tgggccagtt ctgccagctc tgcagccgcc gctactgcct cagccaccac ctgcccgaga 2821 tccatggctg cggtgagagg gctcgcgccc atgcccggca gagaatcagc cgggaagggg 2881 tcctctatgc cggcagcggg accaagaacg gatccctgga cccagccaag agggcccagc 2941 tgcagaggag gctggataag aagctgagtg agctcagcaa ccagaggacc agccggagga 3001 aggagagggg gacgtgaccg gccgcatcct tgcacgcccc gcggagctct ctccatggta 3061 gcccagggcg ctggcagacc atgctccgcc tccaccaggg ccacagagga gcggaggggc 3121 ctatggggga ggagcggagg gccctgttgg ggaaggttgg gtttttggac cccagggata 3181 agcttttccg atgtcacaat gtggaggaaa gcacctgggg gacaacagtg ctcgtgcagg 3241 tggggcttgg gaaatgcacg tcccttcccc tcactccccg ccaaaaccca catcccagcc 3301 tctggatcct ggggaaggtt ccagtccctg gagaataccc agggcctcaa acttgaagtc 3361 actcctccaa tgtctgggac ttgccagctc agcccgttag gatgagggtg ctgagaggaa 3421 acaggaaaca agactgcgaa tggtgctcag gcagggagca gggagtggcg tttggcttgc 3481 acgttcccat gtggccagat gctggggcca ctttccttct gtctgctggt gactgcagtg 3541 ttccccctcc tcctcaccac ggggctcctg tgagtctggg gggcacctct ttctggcctg 3601 tgcacctctc tctggcttat aaaggtgcct ggcctgtgcc agcccctcct tgttgcgcct 3661 caccgtgggg accaggtgag ccggctctcc cacgtggttg tcccgggaaa gctgccccac 3721 agcctcagca tcttcagcac ttaccgatcc agagcctccc ggccttctcc ggtgtcctgt 3781 accaactctt ctatttaaga gaacctcaga tgatgtacct gagcctcagg gttttgtttc 3841 agagggatat aaattattta aaaattaaat gaaaacgttg cacactgc // LOCUS HUMDRD5A 1673 bp DNA PRI 07-NOV-1994 DEFINITION Human D5 dopamine receptor (DRD5) gene, complete cds. ACCESSION M67439 NID g181830 KEYWORDS G-protein coupled receptor; dopamine D5 receptor; transmembrane protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1673) AUTHORS Grandy,D.K., Zhang,Y.A., Bouvier,C., Zhou,Q.Y., Johnson,R.A., Allen,L., Buck,K., Bunzow,J.R., Salon,J. and Civelli,O. TITLE Multiple human D5 dopamine receptor genes: a functional receptor and two pseudogenes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (20), 9175-9179 (1991) MEDLINE 92021013 FEATURES Location/Qualifiers source 1..1673 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 148..1581 /gene="DRD5" CDS 148..1581 /gene="DRD5" /codon_start=1 /db_xref="GDB:G00-127-548" /product="dopamine receptor D5" /db_xref="PID:g181831" /translation="MLPPGSNGTAYPGQFALYQQLAQGNAVGGSAGAPPLGPSQVVTA CLLTLLIIWTLLGNVLVCAAIVRSRHLRANMTNVFIVSLAVSDLFVALLVMPWKAVAE VAGYWPFGAFCDVWVAFDIMCSTASILNLCVISVDRYWAISRPFRYKRKMTQRMALVM VGLAWTLSILISFIPVQLNWHRDQAASWGGLDLPNNLANWTPWEEDFWEPDVNAENCD SSLNRTYAISSSLISFYIPVAIMIVTYTRIYRIAQVQIRRISSLERAAEHAQSCRSSA ACAPDTSLRASIKKETKVLKTLSVIMGVFVCCWLPFFILNCMVPFCSGHPEGPPAGFP CVSETTFDVFVWFGWANSSLNPVIYAFNADFQKVFAQLLGCSHFCSRTPVETVNISNE LISYNQDIVFHKEIAAAYIHMMPNAVTPGNREVDNDEEEGPFDRMFQIYQTSPDGDPV AESVWELDCEGEISLDKITPFTPNGFH" BASE COUNT 311 a 551 c 471 g 340 t ORIGIN 1 cccggcgcag ctcatggtga gcgcctctgg ggctcgaggg tcccttggct gagggggcgc 61 atcctcgggg tgcccgatgg ggctgcctgg gggtcgcagg gctgaagttg ggatcgcgca 121 caaaccgacc ctgcagtcca gcccgaaatg ctgccgccag gcagcaacgg caccgcgtac 181 ccggggcagt tcgctctata ccagcagctg gcgcagggga acgccgtggg gggctcggcg 241 ggggcaccgc cactggggcc ctcacaggtg gtcaccgcct gcctgctgac cctactcatc 301 atctggaccc tgctgggcaa cgtgctggtg tgcgcagcca tcgtgcggag ccgccacctg 361 cgcgccaaca tgaccaacgt cttcatcgtg tctctggccg tgtctgacct tttcgtggcg 421 ctgctggtca tgccctggaa ggcagtcgcc gaggtggccg gttactggcc ctttggagcg 481 ttctgcgacg tctgggtggc cttcgacatc atgtgctcca ctgcctccat cctgaacctg 541 tgcgtcatca gcgtggaccg ctactgggcc atctccaggc ccttccgcta caagcgcaag 601 atgactcagc gcatggcctt ggtcatggtc ggcctggcat ggaccttgtc catcctcatc 661 tccttcattc cggtccagct caactggcac agggaccagg cggcctcttg gggcgggctg 721 gacctgccaa acaacctggc caactggacg ccctgggagg aggacttttg ggagcccgac 781 gtgaatgcag agaactgtga ctccagcctg aatcgaacct acgccatctc ttcctcgctc 841 atcagcttct acatccccgt tgccatcatg atcgtgacct acacgcgcat ctaccgcatc 901 gcccaggtgc agatccgcag gatttcctcc ctggagaggg ccgcagagca cgcgcagagc 961 tgccggagca gcgcagcctg cgcgcccgac accagcctgc gcgcttccat caagaaggag 1021 accaaggttc tcaagaccct gtcggtgatc atgggggtct tcgtgtgttg ctggctgccc 1081 ttcttcatcc ttaactgcat ggtccctttc tgcagtggac accctgaagg ccctccggcc 1141 ggcttcccct gcgtcagtga gaccaccttc gacgtcttcg tctggttcgg ctgggctaac 1201 tcctcactca accccgtcat ctatgccttc aacgccgact ttcagaaggt gtttgcccag 1261 ctgctggggt gcagccactt ctgctcccgc acgccggtgg agacggtgaa catcagcaat 1321 gagctcatct cctacaacca agacatcgtc ttccacaagg aaatcgcagc tgcctacatc 1381 cacatgatgc ccaacgccgt tacccccggc aaccgggagg tggacaacga cgaggaggag 1441 ggtcctttcg atcgcatgtt ccagatctat cagacgtccc cagatggtga ccctgttgct 1501 gagtctgtct gggagctgga ctgcgagggg gagatttctt tagacaaaat aacacctttc 1561 accccgaatg gattccatta aactgcatta agaaaccccc tcatggatct gcataaccgc 1621 acagacactg acaagcacgc acacacacgc aaatacatgc ctttccagta ctg // LOCUS HUMEL4REC 999 bp DNA PRI 04-AUG-1993 DEFINITION Human melanocortin 4 receptor gene, complete cds. ACCESSION L08603 NID g291977 KEYWORDS melanocortin 4 receptor. SOURCE Homo sapiens (library: lambda EMBL3) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 999) AUTHORS Gantz,I., Miwa,H., Konda,Y., Shimoto,Y., Tashiro,T., Waston,S.J. and DelValle,J. TITLE Molecular cloning, expression, and gene localization of a fourth melanocortin receptor JOURNAL J. Biol. Chem. 268, 15174-15179 (1993) MEDLINE 93315499 FEATURES Location/Qualifiers source 1..999 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda EMBL3" CDS 1..999 /codon_start=1 /product="melanocortin 4 receptor" /db_xref="PID:g291978" /translation="MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQL FVSPEVFVTLGVISLLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETI IITLLNSTDTDAQSFTVNIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNI MTVKRVGIIISCIWAACTVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLM ARLHIKRIAVLPGTGAIRQGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPY CVCFMSHFNLYLILIMCNSIIDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY" BASE COUNT 229 a 243 c 213 g 314 t ORIGIN 1 atggtgaact ccacccaccg tgggatgcac acttctctgc acctctggaa ccgcagcagt 61 tacagactgc acagcaatgc cagtgagtcc cttggaaaag gctactctga tggagggtgc 121 tacgagcaac tttttgtctc tcctgaggtg tttgtgactc tgggtgtcat cagcttgttg 181 gagaatatct tagtgattgt ggcaatagcc aagaacaaga atctgcattc acccatgtac 241 tttttcatct gcagcttggc tgtggctgat atgctggtga gcgtttcaaa tggatcagaa 301 accattatca tcaccctatt aaacagtaca gatacggatg cacagagttt cacagtgaat 361 attgataatg tcattgactc ggtgatctgt agctccttgc ttgcatccat ttgcagcctg 421 ctttcaattg cagtggacag gtactttact atcttctatg ctctccagta ccataacatt 481 atgacagtta agcgggttgg gatcatcata agttgtatct gggcagcttg cacggtttca 541 ggcattttgt tcatcattta ctcagatagt agtgctgtca tcatctgcct catcaccatg 601 ttcttcacca tgctggctct catggcttct ctctatgtcc acatgttcct gatggccagg 661 cttcacatta agaggattgc tgtcctcccc ggcactggtg ccatccgcca aggtgccaat 721 atgaagggag cgattacctt gaccatcctg attggcgtct ttgttgtctg ctgggcccca 781 ttcttcctcc acttaatatt ctacatctct tgtcctcaga atccatattg tgtgtgcttc 841 atgtctcact ttaacttgta tctcatactg atcatgtgta attcaatcat cgatcctctg 901 atttatgcac tccggagtca agaactgagg aaaaccttca aagagatcat ctgttgctat 961 cccctgggag gcctttgtga cttgtctagc agatattaa // LOCUS HUMENIGMA 1725 bp DNA PRI 10-NOV-1994 DEFINITION Human enigma gene, complete cds. ACCESSION L35240 NID g561636 KEYWORDS enigma protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1725) AUTHORS Wu,R.Y. and Gill,G.N. TITLE LIM domain recognition of a tyrosine-containing tight turn JOURNAL J. Biol. Chem. 269 (40), 25085-25090 (1994) MEDLINE 95014287 FEATURES Location/Qualifiers source 1..1725 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SKNMC" /cell_type="neuroblastoma" CDS 85..1452 /note="bp 1099-1251: LIM domain, LIM1; bp 1276-1434: LIM domain, LIM2" /codon_start=1 /product="enigma protein" /db_xref="PID:g561637" /translation="MDSFKVVLEGPAPWGFRLQGGKDFNVPLSISRLTPGGKAAQAGV AVGDWVLSIDGENAGSLTHIEAQNKIRACGERLSLGLSRAQPVQSKPQKASAPAADPP RTPLHPASPSTRRPNPLGPPAPDSPPQQNGQPLRPLVPDASKQRLMENTEDWRPRPGQ ASRVPSASLPTSQAPSSCKTPDEEHLKKSSQVPDRSPSPSLIYTPGALAWPYRPQPYQ PPALGCGPCVCRALCPGQNEHSADPHSQPATPTPLQSRTSIVQAAAGGVPGGGSNNGK TPVCHQCHKVIRGRYLVALGHAYHPEEFVCSQCGKVLEEGGFFEEKGAIFCPPCYDVR YAPSCAKCKKKITGEIMHALKMTWHVHCFTCAACKTPIRNRAFYMEEGVPYCERDYEK MFGTKCHGCDFKIDAGDRFLEALGFSWHDTCFVCAICQINLEGKTFYSKKDRPLCKSH AFSHV" BASE COUNT 355 a 593 c 499 g 278 t ORIGIN 1 gaattcccgt tgctgtcgcc caacgaggct ccctggagcc gacgcagagc agcgccctgg 61 ccgggccaag caggagccgg catcatggat tccttcaaag tagtgctgga ggggccagca 121 ccttgggggt tccggctgca agggggcaag gacttcaatg tgcccctctc catttcccgg 181 ctcactcctg ggggcaaagc ggcgcaggcc ggagtggccg tgggtgactg ggtgctgagc 241 atcgatggcg agaatgcggg tagcctcaca cacatcgaag ctcagaacaa gatccgggcc 301 tgcggggagc gcctcagcct gggcctcagc agggcccagc cggttcagag caaaccgcag 361 aaggcctccg cccccgccgc ggaccctccg cgtacacctt tgcacccagc gtctccctca 421 acaagacggc ccaacccttt gggccccccc gcccctgaca gccccccgca gcagaatgga 481 cagccgctcc gaccgctggt cccagatgcc agcaagcagc ggctgatgga gaacacagag 541 gactggcggc cgcggccggg acaggccagt cgcgttcctt ccgcatcctt gcccacctca 601 caggcaccga gttcatgcaa gaccccggat gaggagcacc tgaagaaatc aagccaggtg 661 ccagacagaa gccccagccc cagcctcatc tacaccccag gagccctggc ctggccctac 721 cgcccccagc cctaccagcc gcccgccctg ggctgtggac cctgcgtttg ccgagcgcta 781 tgccccggac aaaacgagca cagtgctgac ccacacagcc agccagccac gcccacgccg 841 ctgcagagcc gcacctccat tgtgcaggca gctgccggag gggtgccagg agggggcagc 901 aacaacggca agactcccgt gtgtcaccag tgccacaagg tcatccgggg ccgctacctg 961 gtggcgctgg gccacgcgta ccacccggag gagtttgtgt gtagccagtg tgggaaggtc 1021 ctggaagagg gtggcttctt tgaggagaag ggcgccatct tctgcccacc atgctatgac 1081 gtgcgctatg cacccagctg tgccaagtgc aagaagaaga ttacaggcga gatcatgcac 1141 gccctgaaga tgacctggca cgtgcactgc tttacctgtg ctgcctgcaa gacgcccatc 1201 cggaacaggg ccttctacat ggaggagggc gtgccctatt gcgagcgaga ctatgagaag 1261 atgtttggca cgaaatgcca tggctgtgac ttcaagatcg acgctgggga ccgcttcctg 1321 gaggccctgg gcttcagctg gcatgacacc tgcttcgtct gtgcgatatg tcagatcaac 1381 ctggaaggaa agaccttcta ctccaagaag gacaggcctc tctgcaagag ccatgccttc 1441 tctcatgtgt gagccccttc tgcccacagc tgccgcggtg gcccctagcc tgaggggcct 1501 ggagtcgtgg ccctgcattt ctgggtaggg ctggcaatgg ttgccttaac cctggctcct 1561 ggcccgagcc tgggctccct ggccctgccc cacccacctt atcctcccac cccactccct 1621 ccaccaccac agcacaccgg tgctggccac accagccccc tttcacctcc agtgccacaa 1681 taaacctgta cccagctgaa aaaaaaaaaa aaaaaaaaac tcgag // LOCUS HUMEP2AA 9318 bp DNA PRI 04-DEC-1997 DEFINITION Homo sapiens HIV-EP2/Schnurri-2 gene, complete cds. ACCESSION M60119 NID g2661140 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1603 to 9318) AUTHORS Nomura,N., Zhao,M.J., Nagase,T., Maekawa,T., Ishizaki,R., Tabata,S. and Ishii,S. TITLE HIV-EP2, a new member of the gene family encoding the human immunodeficiency virus type 1 enhancer-binding protein. Comparison with HIV-EP1/PRDII-BF1/MBP-1 JOURNAL J. Biol. Chem. 266 (13), 8590-8594 (1991) MEDLINE 91217105 REFERENCE 2 (bases 1 to 9318) AUTHORS Goto-Mandeville,R., Tashiro,S., Harada,J., Takagi,T., Sano,Y., Nomura,T.N., Emi,M., Miyazono,K. and Ishii,S. TITLE Human Schnurri-2 antagonizes BMP and TGF-b signalling by anchoring Smads in cytoplasm JOURNAL Unpublished REFERENCE 3 (bases 1 to 9318) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (16-MAY-1991) Lab. of Gene Structure I, Kazusa DNA Research Institute, 1532-3 Yana, Kisarazu, Chiba 292, Japan REFERENCE 4 (bases 1 to 9318) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (04-DEC-1997) Lab. of Gene Structure I, Kazusa DNA Research Institute, 1532-3 Yana, Kisarazu, Chiba 292, Japan REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..9318 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /tissue_type="umbilical vein" gene 2178..7679 /gene="HIV-EP2/Schnurri-2" CDS 2178..7679 /gene="HIV-EP2/Schnurri-2" /codon_start=1 /db_xref="PID:g182120" /translation="MLKGISSSSLKEKKLSPGDRVGYDYDVCRKPYKKWEDSETPKQN YRDISCLSSLKHGGEYFMDPVVPLQGVPSMFGTTCENRKRRKEKSVGDEEDTPMICSS IVSTPVGIMASDYDPKLQMQEGVRSGFAMAGHENLSHGHTERFDPCRPQLQPGSPSLV SEESPSAIDSDKMSDLGGRKPPGNVISVIQHTNSLSRPNSFERSESAELVACTQDKAP SPSETCDSEISEAPVSPEWAPPGDGAESGGKPSPSQQVQQQSYHTQPRLVRQHNIQVP EIRVTEEPDKPEKEKEAQSKEPEKPVEEFQWPQRSETLSQLPAEKLPPKKKRLRLADM EHSSGESSFESTGTGLSRSPSQESNLSHSSSFSMSFEREETSKLSALPKQDEFGKHSE FLTVPAGSYSLSVPGHHHQKEMRRCSSEQMPCPHPAEVPEVRSKSFDYGNLSHAPVSG AAASTVSPSRERKKCFLVRQASFSGSPEISQGEVGMDQSVKQEQLEHLHAGLRSGWHH GPPAVLPPLQQEDPGKQVAGPCPPLSSGPLHLAQPQIMHMDSQESLRNPLIQPTSYMT SKHLPEQPHLFPHQETIPFSPIQNALFQFQYPTVCMVHLPAQQPPWWQAHFPHPFAQH PQKSYGKPSFQTEIHSSYPLEHVAEHTGKKPAEYAHTKEQTYPCYSGASGLHPKNLLP KFPSDQSSKSTETPSEQVLQEDFASANAGSLQSLPGTVVPVRIQTHVPSYGSVMYTSI SQILGQNSPAIVICKVDENMTQRTLVTNAAMQGIGFNIAQVLGQHAGLEKYPIWKAPQ TLPLGLESSIPLCLPSTSDSVATLGGSKRMLSPASSLELFMETKQQKRVKEEKMYGQI VEELSAVELTNSDIKKDLSRPQKPQLVRQGCASEPKDGLQSGSSSFSSLSPSSSQDYP SVSPSSREPFPPSKEMLSGSRAPLPGQKSSGPSESKESSDELDIDETASDMSMSPQSS SLPAGDGQLEEEGKGHKRPVGMLVRMASAPSGNVADSTLLLTDMADFQQILQFPSLRT TTTVSWCFLNYTKPNYVQQATFKSSVYASWCISSCNPNPSGLNTKTTLALLRSKQKIT AEIYTLAAMHRPGTGKLTSSSAWKQFTQMKPDASFLFGSKLERKLVGNILKERGKGDI HGDKDIGSKQTEPIRIKIFEGGYKSNEDYVYVRGRGRGKYICEECGIRCKKPSMLKKH IRTHTDVRPYVCKLCNFAFKTKGNLTKHMKSKAHMKKCLELGVSMTSVDDTETEEAEN LEDLHKAAEKHSMSSISTDHQFSDAEESDGEDGDDNDDDDEDEDDFDDQGDLTPKTRS RSTSPQPPRFSSLPVNVGAVPHGVPSDSSLGHSSLISYLVTLPSIRVTQLMTPSDSCE DTQMTEYQRLFQSKSTDSEPDKDRLDIPSCMDEECMLPSEPSSSPRDFSPSSHHSSPG YDSSPCRDNSPKRYLIPKGDLSPRRHLSPRRDLSPMRHLSPRKEAALRREMSQRDVSP RRHLSPRRPVSPGKDITARRDLSPRRERRYMTTIRAPSPRRALYHNPPLSMGQYLQAE PIVLGPPNLRRGLPQVPYFSLYGDQEGAYEHPGSSLFPEGPNDYVFSHLPLHSQQQVR APIPMVPVGGIQMVHSMPPALSSLHPSPTLPLPMEGFEEKKGASGESFSKDPYVLSKQ HEKRGPHALQSSGPPSTPSSPRLLMKQSTSEDSLNATEREQEENIQTCTKAIASLRIA TEEAALLGPDQPARVQEPHQNPLGSAHVSIRHFSRPEPGQPCTSATHPDLHDGEKDNF GTSQTPLAHSTFYSKSCVDDKQLDFHSSKELSSSTEESKDPSSEKSQLH" variation 7254 /gene="HIV-EP2/Schnurri-2" BASE COUNT 2667 a 2191 c 2161 g 2299 t ORIGIN 1 gctggcaaaa ttcctagact attcatcatc ccaatttcat cctagttgaa aattttcaaa 61 tgccataaga aatctttata gatttgcact tagcttttgg atggacgttt ctacaatgga 121 gagaactgtg ttatagccct ggtccaagga cattactagc taatgcccat cgactgtggt 181 gtgcgtgtgg aaggttccaa agagaaggag caatcagcaa gtttgcagac accctggaac 241 atggaagcaa ccaagcttta agaagcacag ctttggagac actccatgag tctgcactgc 301 tttcagggga actagcactt aagaccttgt gtaacaaaat ggacactggg gacacagctc 361 taggacaaaa agctacctca aggtctggag aaactgataa agcatcaggt agatggagac 421 aggaacaatc agctgttatt aagatgagca cttttggcag tcatgaagga cagcggcaac 481 cacaaataga gcctgagcaa atcggaaaca cagcatcagc acaactgttt ggttctggga 541 aactggcctc ccctagtgaa gtggtgcagc aagtcgcaga gaagcaatat ccaccgcatc 601 gtccgagtcc ttactcatgc caacactcac tctctttccc tcagcactca ttgccacagg 661 gggtcatgca cagcaccaag ccacatcaga gcctcgaagg tcctccgtgg cttttccctg 721 gccctttgcc atccgttgcc tctgaggact tatttccttt tcctatacat ggccacagtg 781 gtggttatcc tagaaaaaag atttcaagtc tgaaccctgc ttatagccaa tactcccaga 841 aaagtattga acaggcagaa gaggctcaca agaaagagca caaacccaaa aagcctggca 901 agtacatttg cccttactgc agcagagcgt gtgccaaacc tagtgtactg aaaaaacaca 961 tcaggtccca tactggggag cggccatatc catgtatacc ttgtggtttc tctttcaaga 1021 caaagagcaa tttgtacaag cacaggaagt cacatgccca tgcaattaag gcaggattag 1081 tacctttcac agagtcagct gtatctaaat tggacctaga ggctggtttt attgatgtag 1141 aagcagaaat acattcagat ggtgaacaga gtacagacac agatgaggag agttctttat 1201 ttgccgaggc ttctgacaaa atgagtcctg gtccacccat cccactggac attgccagca 1261 gaggcggcta tcatgggtca ttggaagaat cattgggagg tccaatgaag gtgccgattt 1321 tgattatccc taaaagtggg attcctctcc ctaatgaaag ctctcagtat attggccctg 1381 atatgctacc aaatccatct ttaaatacta aggctgatga ttcgcacaca gtcaaacaga 1441 aacttgcact aagactgtca gagaaaaaag gacaagattc tgagccatcg ctcaaccttc 1501 tgagcccgca cagtaaagga agcactgatt ctggttactt ttctcgctca gaaagtgctg 1561 agcagcaaat aagccctccc aacacaaatg caaagtctta tgaagaaatc atctttggaa 1621 aatactgtcg gcttagtccg agaaatgcac tcagtgttac aaccacaagt caggagcgtg 1681 ccgcaatggg taggaagggc ataatggaac cattacctca cgttaacacc aggttagatg 1741 tcaagatgtt tgaagatcct gtttcacagc tgatcccaag caagggagat gtcgacccca 1801 gtcaaacgag catgctgaaa tccactaagt tcaacagtga gtccagacaa ccccagatta 1861 ttccatcatc tatcaggaac gaaggaaaac tttatccagc aaacttccaa ggcagcaacc 1921 cggttctctt agaagctcct gtagactctt caccccttat tagaagcaac tcagtgccaa 1981 cttcttcagc aactaatcta actattcctc cttctttgag aggaagtcac tcatttgatg 2041 aaaggatgac tggttccgac gatgtattct atccagggac cgtgggcata ccccctcagc 2101 gcatgctaag aagacaagcg gcatttgagc tgccttcggt acaggagggc cacgtggaag 2161 tcgagcacca tggcaggatg ttgaagggta tcagcagttc atccctgaag gaaaagaaat 2221 tgtctcctgg ggacagggtt gggtatgact atgatgtctg tcggaaaccc tataagaagt 2281 gggaggactc tgaaacacca aagcaaaact acagggacat ttcctgcttg agttctttaa 2341 agcatggtgg agaatatttc atggatcccg tggtgccatt gcagggagta ccaagcatgt 2401 ttggaactac ctgtgaaaac aggaaacgcc ggaaagagaa gagcgtaggg gatgaagagg 2461 acacgcccat gatctgcagc agcattgtaa gcactcctgt gggcatcatg gcttccgatt 2521 atgaccccaa actgcagatg caggaaggag tcaggagtgg atttgccatg gctggacacg 2581 aaaacctttc tcatggtcac acggaacgct ttgacccatg tcggccccaa ctgcagcctg 2641 gaagtccatc tcttgtgtca gaggagtcac cttcagccat tgattcagac aagatgtcag 2701 acctaggggg caggaaacct cctggaaatg tgatttctgt gattcagcac accaactcac 2761 tgagccgacc caattcattt gaaaggtctg agtcagccga acttgtggct tgcacacagg 2821 ataaagcccc ttccccttca gagacttgtg acagtgagat ttcagaagcc ccagtgagtc 2881 ctgagtgggc tccacctggg gatggtgcag aaagtggggg gaaaccctct ccatctcagc 2941 aggtgcagca gcagtcctat cacacacagc ccaggctagt tcggcaacac aacatccagg 3001 ttcctgagat tcgagtgacc gaggagcctg ataaacctga gaaggagaag gaagcccaga 3061 gcaaagagcc agagaagcct gtggaagaat ttcagtggcc ccagagaagt gagacccttt 3121 cccagctccc cgcggagaag ttgccaccca aaaagaagcg tctgcgactt gcagatatgg 3181 agcactcctc aggggagtcc agctttgaat ccacaggcac aggcctctcc cgcagcccca 3241 gccaagaaag caacttgtcc cacagctcca gtttctccat gtcttttgaa agagaagaaa 3301 ccagtaagct ttctgcactt cctaagcagg atgagtttgg gaagcattca gagtttctga 3361 ctgtccctgc tggttcatac tcattgtctg tcccaggcca tcaccaccag aaagagatgc 3421 gacgctgctc atcagagcag atgccttgtc ctcacccagc ggaagtccca gaagttcgga 3481 gcaaatcatt tgattatggg aatctgtccc atgctcctgt gtcgggagca gcagcctcca 3541 cggtatcacc gtccagggag aggaagaaat gctttctggt gcggcaagct tccttcagtg 3601 gctccccaga aatctcccag ggcgaggttg gcatggatca gagcgtgaag caagagcagc 3661 tggagcacct gcatgctggc ctccggtccg ggtggcacca tggcccgcct gctgtgctgc 3721 ctcctcttca gcaagaggac ccagggaagc aggtggcggg tccttgtccc ccgctgagct 3781 cggggccact gcacctggcc cagccacaga tcatgcacat ggacagtcag gaatctttga 3841 gaaatccctt gatccaacca acatcctata tgacaagcaa gcacttacct gaacagccac 3901 acttatttcc acatcaagag acaattccat tttctccaat ccagaatgcc ttgtttcagt 3961 ttcagtatcc tacagtttgt atggttcatt taccagctca gcagcctccc tggtggcagg 4021 cacatttccc acatcccttt gctcagcacc ctcagaagag ctatggcaag ccctcttttc 4081 agacagaaat ccattcgagc tatcccttag agcatgtggc agagcacact ggaaagaaac 4141 ctgctgagta tgcacacacg aaagagcaga cctacccatg ttattcagga gcatcagggc 4201 tacacccaaa gaaccttctt ccaaagtttc catcagacca gagcagtaag tcaactgaaa 4261 cgccctctga gcaggttctt caagaagatt ttgcctcggc aaatgctggg tctttgcagt 4321 ccctcccagg aacagtggtt cctgttcgga tccagacgca cgtaccatcc tatggaagtg 4381 tcatgtacac aagcatttct cagatacttg ggcagaatag ccctgccatt gtcatatgca 4441 aagtcgatga gaatatgacc caaaggacac tggtcaccaa cgcagccatg caagggatag 4501 gattcaacat tgcccaggtg ctggggcagc atgcgggctt ggagaagtac cccatttgga 4561 aagcacctca gactttgccc ctcggcttag aatcctccat ccccttgtgt ttaccttcca 4621 cctctgacag cgtggccacc ctgggaggta gcaagcgaat gctttctcca gccagtagct 4681 tggagctctt catggaaacc aagcagcaga aaagggtcaa agaagaaaag atgtacggac 4741 agattgtgga ggagcttagt gctgtggagc tgaccaactc agacatcaaa aaggacctct 4801 cccgccccca gaaaccccag ctggttcgac aaggatgtgc ttctgagcca aaagatggct 4861 tgcagtcagg gtcatcttcc ttctcctcgc tgtcgccctc ctcatctcaa gactatcctt 4921 ctgttagccc gtcttccagg gagccattcc cgcccagcaa ggagatgctt tccggttccc 4981 gggcaccact tccggggcag aagtccagtg ggccttctga aagcaaagaa tcttcagatg 5041 aattagatat cgatgagacg gcatcggaca tgagcatgag cccacagagt tcttcattac 5101 cagcaggaga tggtcagctg gaagaggaag ggaagggcca caagcggcct gttggcatgc 5161 tggtccgcat ggcctctgcc cccagcggga acgtggcaga ctcaactctt cttctcacgg 5221 acatggcaga tttccagcag attcttcagt tccccagtct gcggacaaca actactgtga 5281 gttggtgctt cttgaattat acaaaaccca attatgtgca acaggccacc ttcaaatcct 5341 cggtttatgc ttcatggtgc attagttcct gtaatccaaa cccatcagga ttgaacacca 5401 agaccacgct ggctcttctg aggtccaagc aaaaaatcac tgcagaaatt tatactctgg 5461 ctgctatgca taggcctgga accggcaagc ttacatcatc aagtgcttgg aagcagttta 5521 ctcagatgaa acctgatgcg tcctttttat ttggcagcaa actagaaagg aaactagtgg 5581 gaaatatctt aaaggaaaga gggaaaggag atattcatgg agataaagat attggatcca 5641 aacaaactga gccaatccga attaaaatat ttgaaggagg gtacaaatcg aatgaagatt 5701 atgtatatgt cagaggacgt ggccggggaa agtacatttg tgaagaatgt gggattcgct 5761 gtaagaagcc aagcatgctc aaaaaacaca tccgtaccca tactgatgtt cggccttatg 5821 tatgcaagtt atgtaacttt gccttcaaaa cgaaaggaaa cctaacgaag catatgaaat 5881 ctaaagcaca catgaaaaaa tgcctggaat tgggagtctc aatgacatcg gtggatgata 5941 cagaaactga ggaagcagaa aatttggaag atttgcacaa agcagcagag aagcatagca 6001 tgtccagcat ttcaactgat catcagttct ccgatgctga ggaatcagat ggtgaggatg 6061 gagatgataa tgatgatgat gatgaagatg aagatgactt tgacgaccag ggagatttaa 6121 caccaaaaac aagatcaaga agcaccagtc ctcagcctcc tagattctcc tccttgcctg 6181 tgaatgttgg cgccgtaccc cacggggttc cttcagatag ttccctggga cattcttcgt 6241 tgatcagcta tttggttact ttgccaagta ttcgagttac tcagcttatg acacccagtg 6301 attcatgtga agatacccag atgacagaat accagaggct attccagagc aaaagtacgg 6361 actcagaacc agacaaagac agattggaca tacctagttg tatggatgag gagtgcatgc 6421 taccttcaga gccaagctcc tctcccaggg acttctcacc ctcaagccac cattcctctc 6481 caggatatga ttcttcaccc tgtcgagata attcaccaaa gaggtatctg atacccaaag 6541 gagatttatc tcccaggaga catttatcac ctaggagaga tctgtcaccc atgagacatc 6601 tttcaccaag aaaggaagct gcattgagaa gagagatgtc ccaaagagat gtttcaccaa 6661 gaaggcattt gtctccaagg aggccagtgt ctcctgggaa agatatcaca gcaagaagag 6721 acctctcccc tagaagagag agaagataca tgaccacaat aagagcgcca tctcccagaa 6781 gggctttata ccataaccca ccattgtcca tgggacagta tttgcaagca gagccaattg 6841 tattggggcc tcctaattta agaagaggat tacctcaggt tccttacttc agtctctatg 6901 gagaccaaga aggtgcttat gaacatccag gctccagcct tttccctgag ggtcctaatg 6961 actatgtctt cagtcatctt ccactccact ctcagcaaca agtgcgagcc cctatcccca 7021 tggtgcccgt tggtgggatc cagatggttc actccatgcc gccagccctt tccagtttac 7081 atccttcacc cacattgccc ctgccaatgg agggctttga ggagaagaaa ggcgcgtcag 7141 gggagtcctt ctccaaggac ccctatgtgc tttctaagca gcatgagaag cgaggtcctc 7201 acgctttgca gtcatctggt ccgcctagca ctccctcctc tcctcggctg ttgatgaaac 7261 agagcacttc ggaagacagc ctaaacgcaa cagagcggga acaggaggaa aatatacaga 7321 cttgtacaaa agccattgcc tctctccgga ttgccacgga agaggcagct ctgctcgggc 7381 cagatcagcc agcgcgggtg caggagcccc accagaaccc cctgggaagt gcacatgtta 7441 gcattagaca ctttagtaga cctgagccag gtcagccctg tacctcagcc acccaccctg 7501 acttgcatga tggtgaaaag gacaattttg gtacatcaca gactccatta gctcactcca 7561 cgttttacag caagagttgt gtggatgaca agcagttgga ctttcacagc agcaaggaat 7621 tatcttcaag cacagaggaa agcaaagatc cttcatcaga aaagagtcag ctacattgat 7681 ctatgatgca tggagacttt catttccaca ttttcccatt tttttgtttt tgtttttcta 7741 gaaatggagg taatccagtt tatagcatgc ctgtcctaag ttacagtagt ttgctattat 7801 atatactttt gttatatcaa aagaattagg taaattaaca agtcatcatg agcctgacca 7861 aaacaaaatt tgaaattaac ctattgggtc tggtactttt aaaattgtac agatgtttgt 7921 gccttttctt tactttgctt atattcttat aagcattttt tagcagtaat ttgtacatat 7981 tttagaattt gtgtatctgc tttgtaataa atgtaatttc tttccttttt tggacacttg 8041 gatctaaatg atgtaaagca aaacagcatc aatatatatg tgaggttgca ctaaaacata 8101 tttttatatg attaaaactg aacagctttt atgtacagct ctgattctgt aatactaata 8161 tttatttact ttgtttcata aattgtacat tttttcttaa tgttgtggat tgcttttcta 8221 tgtgaagcat gggatttact gttgcgtaac tagaacaaaa atgtacattg taaacaagat 8281 atttaaacta gagtatctta ttctgcactt atgcattagt taaaaaaaga taaaggatgt 8341 atcagtcagt tcttaactct tgtatatttt tttgtctctt gtttgctgga ttgactataa 8401 cttaagtgct gattgtgatt ttaaaatgat agtaccgtaa agcattaaag taaacaatgt 8461 gctattgtga gttttttcaa agctttataa atcagttata aataatatta aaagtatttg 8521 gtcttatgtg aacatgttga tctatatact catctaaaaa tatgggaaaa cattccaccc 8581 catgtaaata tgtacaagtg catcactggt acaattttat gtaactcagt tggacactag 8641 gttgccacag acctatgcta ggtgtcttta aaaaattaag gtgacaaagc acatgggact 8701 gtgtagagct tggttatcgg ccggcccggt ggcttggcag gcagtgctgt gcgctgctca 8761 tggagaagac ctgggcttag caatctcctt agttcttgct acacaggatg gtgactggaa 8821 ctaaggctac acagagggtc gcacttggac tctgagggtt gggtgtggaa gggggaaaag 8881 gggatggaga cctgctcccc agctcttcct gtcagccggt ttacatggga acagggttaa 8941 catctgtgtt aggggaggtc accttaccct ttttcatagg ggaagagtgt cacactcctg 9001 gctatctcag ggggaatggg gaaaagaatc tttcaagggc aaagaactcg tgggaggatg 9061 tctgttgtat gtaatactca caatggcttt tggttagtgt tgaaggtggg aagagcattt 9121 gtaggtccag aagagtgaaa gagagggagg ggtgcagcaa catgtgcaca ggcacgcaca 9181 tgtgtgcacg cacacataca atctgggtta tctttgtgct atatagtgga ttataattct 9241 gtgaaaccaa gtttgtatat tgaattacat taaggagtgt tctttaaaaa gagaaataaa 9301 tatacaatta catgcttg // LOCUS HUMEVI22 1798 bp DNA PRI 08-NOV-1994 DEFINITION Human EV12 protein gene, exon 1. ACCESSION M55267 NID g182278 KEYWORDS EVI2 protein. SEGMENT 2 of 2 SOURCE Human lymphoblastoid cell DNA, and cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1798) AUTHORS Cawthon,R.M., O'Connell,P., Buchberg,A.M., Viskochil,D., Weiss,R.B., Culver,M., Stevens,J., Jenkins,N.A., Copeland,N.G. and White,R. TITLE Identification and characterization of transcripts from the neurofibromatosis 1 region: the sequence and genomic structure of EVI2 and mapping of other transcripts JOURNAL Genomics 7 (4), 555-565 (1990) MEDLINE 90353953 FEATURES Location/Qualifiers source 1..1798 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoblast" /map="17q11.2" mRNA join(M55266:187..383,198..1563) /partial /gene="EVI2A" /note="G00-125-191" /product="EVI2 protein" gene join(M55266:187..604,1..1563) /gene="EVI2A" intron order(M55266:384..604,1..197) /gene="EVI2A" /note="G00-125-191" CDS 220..918 /gene="EVI2A" /codon_start=1 /number=1 /db_xref="GDB:G00-125-191" /product="EVI2 protein" /db_xref="PID:g182280" /translation="MEHTGHYLHLAFLMTTVFSLSPGTKANYTRLWANSTSSWDSVIQ NKTGRNQNENINTNPITPEVDYKGNSTNMPETSHIVALTSKSEQELYIPSVVSNSPST VQSIENTSKSHGEIFKKDVCAENNNNMAMLICLIIIAVLFLICTFLFLSTVVLANKVS SLRRSKQVGKRQPRSNGDFLASGLWPAESDTWKRTKQLTGPNLVMQSTGVLTATRERK DEEGTEKLTNKQIG" sig_peptide 220..297 /partial /gene="EVI2A" /note="G00-125-191" mat_peptide 298..915 /partial /gene="EVI2A" /note="G00-125-191" /product="EVI2 protein" BASE COUNT 658 a 310 c 303 g 527 t ORIGIN 1 taatagaaat taaaatgctt cttcatacat agctgaatag aaaagaattt gttgagaagg 61 aattcagggt agcgaatatt aggcataagc ttgtagttta cttgtaacat ctcaacacta 121 tcttttaact acaattacca aaaactagga tccattattc tttcacaaac taacaaatta 181 tattgctatc ccaacagatt gccaagtatg cccacggaca tggaacacac aggacattac 241 ctacatcttg cctttctgat gacaacagtt ttttctttgt ctcctggaac aaaagcaaac 301 tatacccgtc tgtgggctaa cagtacttct tcctgggatt cagttattca aaacaagaca 361 ggcagaaacc aaaatgaaaa cattaacaca aaccctataa ctcctgaagt agattataaa 421 ggtaattcta caaacatgcc tgaaacatct cacatcgtag ctttaacttc taaatctgaa 481 caggagcttt atataccttc tgtcgtcagc aacagtcctt caacagtaca gagcattgaa 541 aacacaagca aaagtcatgg tgaaattttc aaaaaggatg tctgtgcgga aaacaacaac 601 aacatggcta tgctaatttg cttaattata attgcagtgc tttttcttat ctgtaccttt 661 ctatttctat caactgtggt tttggcaaac aaagtctctt ctctcagacg atcaaaacaa 721 gtaggcaagc gtcagcctag aagcaatggc gattttctgg caagcggtct atggcccgct 781 gaatcagaca cttggaaaag aacaaaacag ctcacaggac ccaacctagt gatgcaatct 841 actggagtgc tcacagctac aagggaaaga aaagatgaag aaggaactga aaaacttact 901 aacaaacaga taggttagtg aagaaaaatg caaagtagca atgagaaggc ttatggagta 961 aaaatgaagt cagttggtat ttaatcccaa agtgttgttc tgattatcta aaatttgaca 1021 tggtagacct tgcaatttag aatcaagcag gtgagacagg gagaagtatg cctgcttaat 1081 tatttaaact gtgtactttt gttttgacac tgaatatttt aaaaagcaaa taataaaata 1141 actaagcatt tgaggaaaat tttaaggata aattgaggaa actgattaat agagatagca 1201 agggataatt aaataaatat tccctatgta gcaacagtgg ttagatgatc tttgtctgaa 1261 tgtaataaaa ctttgaatag ttttagtgtg tccttaaagc caagtatatg ctttaacatc 1321 aaatggaagt caaattccta atgcatagat agagagagct aaactgtgta atttaatggt 1381 atcttccttg ctggatgtgg cagaatccac accagcttat caaccaacac agctaatttt 1441 agaataggtc ctttatcttt ccatatggca cacgtaagaa agtgtttttc tactattaat 1501 attaaattaa aacctttact tttgtataat aaattaaaac tcagaataaa cctgtgacca 1561 cgtatatttg cattcacttt attactttag agaacacatt gtaaagatca ataagaaata 1621 gagcacaact aaaataaata agatttatag ccacaccaat aggctagtgt aaacgaaagt 1681 atgtttcact gtttatgatt aataatattc atcttttcta taaatactac ttactggaac 1741 attaacaaca agtccaaagg ttgattaatt ttgactcagg agcagagcta tgattata // LOCUS HUMEVI2B3P 2158 bp DNA PRI 22-MAR-1991 DEFINITION Human EVI2B3P gene, exon and complete cds. ACCESSION M60830 NID g182282 KEYWORDS . SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2158) AUTHORS Cawthon,R.M., Andersen,L.B., Buchberg,A.M., Xu,G.F., O'Connell,P. and Viskochil,D. TITLE cDNA sequence and genomic structure of EV12B, a gene lying within an intron of the neurofibromatosis type 1 gene JOURNAL Genomics 9, 446-460 (1991) MEDLINE 91236164 FEATURES Location/Qualifiers source 1..2158 /organism="Homo sapiens" /db_xref="taxon:9606" exon 81..2158 /note="in EVI2B cDNAs this sequence is immediately followed by a polyA tail; putative" CDS 102..1448 /note="open reading frame" /codon_start=1 /db_xref="PID:g182283" /translation="MDPKYFILILFCGHLNNTFFSKTETITTEKQSQPTLYTSSMSQV LANSQNTTGNPLGQPTQFSDTFSGQSISPAKVTAGQPTPAVYTSSEKPEAHTSAGQPL AYNTKQPTPIANTSSQQAVFTSARQLPSARTSTTQPPKSFVYTFTQQSSSVQIPSRKQ ITVHNPSTQPTSTVKNSPRSTPGFILDTTSNKQTPQKNNYNSIAAILIGVLLTSMLVA IIIIVLWKCLRKPVLNDQNWAGRSPFADGETPDICMDNIRENEISTKRTSIISLTPWK PSKSTLLADDLEIKLFESSENIEDSNNPKTEKIKDQVNGTSEDSADGSTVGTAVSSSD DADLPPPPPLLDLEGQESNQSDKPTMTIVSPLPNDSTSLPPSLDCLNQDCGDHKSEII QSFPPLDSLNLPLPPVDFMKNQEDSNLEIQCQEFSIPPNSDQDLNESLPPPPAELL" BASE COUNT 755 a 466 c 307 g 630 t ORIGIN 1 aatataatga aaagtcaaag ttttaactag acaccaatga cgcctaactg tctttctctt 61 tcattataaa cccgctatag ataacgagga aatattctga aatggatccc aaatatttca 121 tcttaatttt gttttgtgga cacctgaaca atacattttt ttcaaagaca gagacaatta 181 caacagagaa gcagtcacag cctaccttat acacatcatc aatgtcacag gtattggcta 241 attctcaaaa cacaacaggg aatcctttgg gtcaaccaac acaattcagc gacacttttt 301 ctggacaatc aatatcacct gccaaagtca ctgctggaca accaacacca gctgtctata 361 cctcttctga aaaaccagaa gcacatactt ctgctggaca accacttgcc tacaacacca 421 aacaaccaac accaatagcc aacacctcct cccagcaagc cgtgttcacc tctgccagac 481 aactaccatc tgcccgtact tctaccacac aaccaccaaa gtcatttgtc tatactttta 541 ctcaacaatc atcatctgtc cagatccctt ctagaaaaca aataactgtt cataatccat 601 ccacacaacc aacatcaact gtcaaaaatt cacctaggag tacaccagga tttatcttag 661 atactaccag taacaaacaa accccacaaa aaaacaatta taattcaata gctgccatac 721 taattggtgt acttctgact tctatgttgg tagctataat catcattgta ctttggaaat 781 gcttaaggaa accagtttta aatgatcaaa attgggcagg tagatctcca tttgctgatg 841 gagaaacccc tgacatttgt atggataaca tcagagaaaa tgaaatatcc acaaaacgta 901 catcaatcat ttcacttaca ccctggaaac caagcaaaag cacactttta gcagatgact 961 tagaaattaa gttgtttgaa tcaagtgaaa acattgaaga ctccaacaac cccaaaacag 1021 agaaaataaa agatcaagta aatggtacat cagaagatag tgctgatggt tcaacagttg 1081 gaactgctgt ttcttcttca gatgatgcag atctgcctcc accacctccc cttctggatt 1141 tggaaggaca ggaaagtaac caatctgaca aacccacaat gacaattgta tctcctcttc 1201 caaatgattc tactagtctc cctccatctc tggactgtct caatcaagac tgtggagatc 1261 ataaatctga gataatacaa tcatttccac cgcttgactc acttaacttg cccctgccac 1321 cagtagattt tatgaaaaac caagaagatt ccaaccttga gatccagtgt caggagttct 1381 ctattcctcc caactctgat caagatctta atgaatccct gccacctcca cctgcagaac 1441 tgttataaat attacaactt gctttttagc tgatcttcca tcctcaaatg actctttttt 1501 ctttatatgt taacatatat aaaatggcaa ctgatagtca attttgattt ttattcagga 1561 actatctgaa atctgctcag agcctatgtg catagatgaa actttttttt aaaaaaagtt 1621 atttaacagt aatctattta ctaattatag tacctatctt taaagtatag tacattttac 1681 atatgtaaat ggtatgtttc aataatttaa gaactctgaa acaatctaca tatacttatt 1741 acccagtaca gttttttttc ccctgaaaag ctgtgtataa aattatggtg aataaacttt 1801 tatgtttcca tttcaaagac cagggtggag aggaataaga gactaagtat atgcttcaag 1861 ttttaaatta atacctcaag tattaaataa atattccaag tttgtgggaa tgggagatta 1921 aaatgcatgt ttgagaatag agaaattttc ttcttggttt cattgcaaag agtaaaacaa 1981 acatgttaaa acatcaactg aagggttggg ttaggaacat ttaccctgaa aaaaatatga 2041 ggatgcatca taaaatgtaa atattttcct accatgttgg gggggcacaa attttaaaac 2101 tggcatcttt acaagtttct tctttataaa cacccaaaca aaatcaagtt ttataaag // LOCUS HUMFCSTRN 1310 bp DNA PRI 16-APR-1996 DEFINITION Human alpha-(1,3) fucosyltransferase gene, complete cds. ACCESSION M81485 NID g182490 KEYWORDS alpha-(1,3) fucosyltransferase; blood group lewis alpha-4-fucosyltransferase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1310) AUTHORS Weston,B.W., Nair,R.P., Larsen,R.D. and Lowe,J.B. TITLE Isolation of a novel human alpha (1,3)fucosyltransferase gene and molecular comparison to the human Lewis blood group alpha (1,3/1,4)fucosyltransferase gene. Syntenic, homologous, nonallelic genes encoding enzymes with distinct acceptor substrate specificities JOURNAL J. Biol. Chem. 267 (6), 4152-4160 (1992) MEDLINE 92156161 FEATURES Location/Qualifiers source 1..1310 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 121..1245 /codon_start=1 /product="alpha(1,3)-fucosyltransferase" /db_xref="PID:g1280209" /translation="MDPLGPAKPQWLWRRCLAGLLFQLLVAVCFFSYLRVSRDDATGS PRPGLMAVEPVTGAPNGSRCQDSMATPAHPTLLILLWTWPFNTPVALPRCSEMVPGAA DCNITADSSVYPQADAVIVHHWDIMYNPSANLPPPTRPQGQRWIWFSMESPSNCRHLE ALDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWKPDSA RVRYYQSLQAHLKVDVYGRSHKPLPKGTMMETLSRYKFYLAFENSLHPDYITEKLWRN ALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFR WRETLRPRSFSWALAFCKACWKLQQESRYQTVRSIAAWFT" BASE COUNT 234 a 455 c 371 g 250 t ORIGIN 1 tttatgacaa gctgtgtcat aaattataac agcttctctc aggacactgt ggccaggaag 61 tgggtgatct tccttaatga ccctcactcc tctctcctct cttcccagct actctgaccc 121 atggatcccc tgggcccagc caagccacag tggctgtggc gccgctgtct ggccgggctg 181 ctgtttcagc tgctggtggc tgtgtgtttc ttctcctacc tgcgtgtgtc ccgagacgat 241 gccactggat cccctaggcc agggcttatg gcagtggaac ctgtcaccgg ggctcccaat 301 gggtcccgct gccaggacag catggcgacc cctgcccacc ccaccctact gatcctgctg 361 tggacgtggc cttttaacac acccgtggct ctgccccgct gctcagagat ggtgcccggc 421 gcggccgact gcaacatcac tgccgactcc agtgtgtacc cacaggcaga cgcggtcatc 481 gtgcaccact gggatatcat gtacaacccc agtgccaacc tcccgccccc caccaggccg 541 caggggcagc gctggatctg gttcagcatg gagtccccca gcaactgccg gcacctggaa 601 gccctggacg gatacttcaa tctcaccatg tcctaccgca gcgactccga catcttcacg 661 ccctacggct ggctggagcc gtggtccggc cagcctgccc acccaccgct caacctctcg 721 gccaagaccg agctggtggc ctgggcggtg tccaactgga agccggactc ggccagggtg 781 cgctactacc agagcctgca ggctcatctc aaggtggacg tgtacggacg ctcccacaag 841 cccctgccca aggggaccat gatggagacg ctgtcccggt acaagttcta tctggccttc 901 gagaactcct tgcaccccga ctacatcacc gagaagctgt ggaggaacgc cctggaggcc 961 tgggccgtgc ccgtggtgct gggccccagc agaagcaact acgagaggtt cctgccgccc 1021 gacgccttca tccacgtgga tgacttccag agccccaagg acctggcccg gtacctgcag 1081 gagctggaca aggaccacgc ccgctacctg agctactttc gctggcggga gacgctgcgg 1141 cctcgctcct tcagctgggc actggctttc tgcaaggcct gctggaagct gcagcaggaa 1201 tccaggtacc agacggtgcg cagcatagcg gcttggttca cctgagaggc cggcatgggg 1261 cctgggctgc cagggacctc actttcccag ggcctcacct acctagggtc // LOCUS HUMFKBP12D 436 bp DNA PRI 31-DEC-1994 DEFINITION Human FKBP-12 protein (FKBP-12), 5' flank and complete cds. ACCESSION M80199 J05340 NID g182632 KEYWORDS T-cell binding protein; rotamase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 436) AUTHORS DiLella,A.G. and Craig,R.J. TITLE Exon organization of the human FKBP-12 gene: correlation with structural and functional protein domains JOURNAL Biochemistry 30 (35), 8512-8517 (1991) MEDLINE 91363332 FEATURES Location/Qualifiers source 1..436 /organism="Homo sapiens" /db_xref="taxon:9606" gene 110..436 /gene="FKBP12" CDS 110..436 /gene="FKBP12" /codon_start=1 /product="FKBP-12 protein" /db_xref="PID:g182633" /translation="MGVQVETISPGDGRTFPKRGQTCVVHYTGMLEDGKKFDSSRDRN KPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYGATGHPGIIPPHATLVFDV ELLKLE" BASE COUNT 96 a 132 c 131 g 77 t ORIGIN 1 gtgcgcagcg acgcgccgag gtactagcag agccgtggaa ccgccgccag gtcgctgttg 61 gtccacgccg cccgtcgcgc cgcccgcccg ctcagcgtcc gccgccgcca tgggagtgca 121 ggtggaaacc atctccccag gagacgggcg caccttcccc aagcgcggcc agacctgcgt 181 ggtgcactac accgggatgc ttgaagatgg aaagaaattt gattcctccc gggacagaaa 241 caagcccttt aagtttatgc taggcaagca ggaggtgatc cgaggctggg aagaaggggt 301 tgcccagatg agtgtgggtc agagagccaa actgactata tctccagatt atgcctatgg 361 tgccactggg cacccaggca tcatcccacc acatgccact ctcgtcttcg atgtggagct 421 tctaaaactg gaatga // LOCUS HUMFRPL2 1198 bp DNA PRI 08-NOV-1994 DEFINITION Human N-formyl receptor-like 2 protein (FPRL2) gene, complete cds. ACCESSION L14061 NID g292034 KEYWORDS N-formyl peptide receptor-like 2 protein; transmembrane protein. SOURCE Homo sapiens (tissue library: lambda FIX; Stratagene) adult DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1198) AUTHORS Durstin,M., Gao,J.L., McDermott,D. and Murphy,P.M. TITLE Structural and functional analysis of the human N-formyl peptide receptor gene cluster JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..1198 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_lib="lambda FIX; Stratagene" /map="Unassigned" gene 80..1141 /gene="FPRL2" CDS 80..1141 /gene="FPRL2" /codon_start=1 /db_xref="GDB:G00-128-855" /product="N-formyl peptide receptor-like 2 protein" /db_xref="PID:g292035" /translation="METNFSIPLNETEEVLPEPAGHTVLWIFSLLVHGVTFVFGVLGN GLVIWVAGFRMTRTVNTICYLNLALADFSFSAILPFRMVSVAMREKWPFGSFLCKLVH VMIDINLFVSVYLITIIALDRCICVLHPAWAQNHRTMSLAKRVMTGLWIFTIVLTLPN FIFWTTISTTNGDTYCIFNFAFWGDTAVERLNVFITMAKVFLILHFIIGFSVPMSIIT VCYGIIAAKIHRNHMIKSSRPLRVFAAVVASFFICWFPYELIGILMAVWLKEMLLNGK YKIILVLINPTSSLAFFNSCLNPILYVFMGRNFQERLIRSLPTSLERALTEVPDSAQT SNTDTTSASPPEETELQAM" BASE COUNT 280 a 301 c 264 g 353 t ORIGIN 1 aagttaatga atagttgtat tgtaagatgg tgtcacagct gagaaatggc cattgctgaa 61 atgtttcagg tgtgggaaga tggaaaccaa cttctccatt cctctgaatg aaactgagga 121 ggtgctccct gagcctgctg gccacaccgt tctgtggatc ttctcattgc tagtccacgg 181 agtcaccttt gtcttcgggg tcctgggcaa tgggcttgtg atctgggtgg ctggattccg 241 gatgacacgc acagtcaaca ccatctgtta cctgaacctg gccctagctg acttctcttt 301 cagtgccatc ctaccattcc gaatggtctc agtcgccatg agagaaaaat ggccttttgg 361 ctcattccta tgtaagttag ttcatgttat gatagacatc aacctgtttg tcagtgtcta 421 cctgatcacc atcattgctc tggaccgctg tatttgtgtc ctgcatccag cctgggccca 481 gaaccatcgc accatgagtc tggccaagag ggtgatgacg ggactctgga ttttcaccat 541 agtccttacc ttaccaaatt tcatcttctg gactacaata agtactacga atggggacac 601 atactgtatt ttcaactttg cattctgggg tgacactgct gtagagaggt tgaacgtgtt 661 cattaccatg gccaaggtct ttctgatcct ccacttcatt attggcttca gcgtgcctat 721 gtccatcatc acagtctgct atgggatcat cgctgccaaa attcacagaa accacatgat 781 taaatccagc cgtcccttac gtgtcttcgc tgctgtggtg gcttctttct tcatctgttg 841 gttcccttat gaactaattg gcattctaat ggcagtctgg ctcaaagaga tgttgttaaa 901 tggcaaatac aaaatcattc ttgtcctgat taacccaaca agctccttgg ccttttttaa 961 cagctgcctc aacccaattc tctacgtctt tatgggtcgt aacttccaag aaagactgat 1021 tcgctctttg cccactagtt tggagagggc cctgactgag gtccctgact cagcccagac 1081 cagcaacaca gacaccactt ctgcttcacc tcctgaggag acggagttac aagcaatgtg 1141 aggtcgggga tatttttggg ctctgtctct ttctaccctg cgttcagcga aaaaaaaa // LOCUS HUMFSHD 3303 bp DNA PRI 01-AUG-1996 DEFINITION Human facioscapulohumeral muscular dystrophy (FSHD) gene region, D4Z4 tandem repeat unit. ACCESSION D38024 NID g871846 KEYWORDS facioscapulohumeral muscular dystrophy; FSHD; D4Z4 repeat family; microsatellite; homeodomain; LSau-like sequence. SOURCE Homo sapiens DNA, clone:c51. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3303) AUTHORS Lee,J.H., Goto,K., Matsuda,C. and Arahata,K. TITLE Characterization of a tandemly repeated 3.3-kb KpnI unit in the facioscapulohumeral muscular dystrophy (FSHD) gene region on chromosome 4q35 JOURNAL Muscle Nerve 2, S6-S13 (1995) MEDLINE 95258038 REFERENCE 2 (bases 1 to 3303) AUTHORS Lee,J. TITLE Direct Submission JOURNAL Submitted (22-AUG-1994) to the DDBJ/EMBL/GenBank databases. Je Hyeon Lee, National Institute of Neuroscience, NCNP, Dept. of neuromusclar Research; 4-1-1, Ogawa-Higashi, Kodaira, Tokyo 187, Japan (Tel:0423-46-1712, Fax:0423-46-1742) FEATURES Location/Qualifiers source 1..3303 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4q35-qter" /clone="c51" repeat_unit 1..3303 /note="3303bps KpnI fragment in FSHD gene region contains other repeat and sequence motive" /function="unknown" /rpt_family="D4Z4" /rpt_type=tandem repeat_region 1..300 /note="region with similarity to Lsau(GenBank X59423), Part of tandem repeat D4Z4" /function="unknown" /rpt_type=other misc_feature 393..578 /note="extremly G-rich region in 186bps. Part of tandem repeat locus D4Z4" /function="unknown" satellite 398..578 /note="microsatellite of GGAGG, Part of tantem repeat locus D4Z4" /rpt_type=direct CDS 417..2978 /note="ORF" /codon_start=1 /db_xref="PID:d1007805" /db_xref="PID:g1435038" /translation="MERGTGETRGAEGTLGGRQGGREAGRNGGRDRATQGLGAGPREP GTDGGRKAGRKSGPRPPGVAGPPASGKTVSVRRGLRAGPTAAAPAGGAPPIRPGSGAQ GVGGFLRDKRPGLGLPSGLHPRGSQTAHPQAEPCNAARGPQTRPRRSHTQDDGGVILV SEWLCPPEGGLLLTSLRPPKGWPCRLFAPGALRHPETCREGCKPGMVPSLSLPGSKPA TLQTPPRCRTRESIVRPSRRGGISSLGSRSGLLRGNEREPHACVCETVPATATPTGIA SFTERGPGTLKTPTEVQFHTPLHPPRLVSPCCRRVGAQRAASRSRGIPGEVRRAGPRN APPSPLPPLPLPLRLSGPTTTTATTPPPPPPPPTTTTTTTPPAGPRPRRPGSLPGWGG LSQGGSPPFMKGWSLPACGPLQGRLAGWLAVRAGLLAAPAAVHSPAEVHGSPPASLCP RPSVKFRPGLTAMALPTPSDSTLPAEARGRGRPRRLVWTPSQSEALRACFERNPYPGI ATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRGPPEGRRKRTAVTGS QTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHPGQGGRAPAQAGG LCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQAARAAPALQP SQAAPAEGVSQPAPARGDFAYAAPAPPEPGRSPTLRLLGGLRTRAKAGRTGTRSATAC RAPARWHSLGPLKRGRRPRGACATHVPGESVVGLGPGSPGRRGGVGTPSRGSSTSPAR APGTPPPPRGRGRCKASRRPPRRSRSRRPGLHSPAACCWMSSWRARSFCSRRNLS" misc_feature 1405..1554 /note="Extremly C-rich region(69%) in 186bps. Part of tandem repeat locus 4DZ4." /function="unknown" satellite 1470..1536 /note="microsatellite of CCA, Part of tandem repeat locus D4Z4" /rpt_type=direct satellite 1590..1703 /note="microsatellite of GGCT, Part of tandem repeat locus D4Z4" /rpt_type=direct misc_feature 1863..2037 /note="paired type homeodomain seq I, translation;PRRLVWTPSQSEALRACFERNPYPGIATRERLAQAIGIPEPRVQIW FQNERSRQLR" /function="unknown" misc_feature 2082..2264 /note="paired type homeodomain seq II, translationGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQI WFQNRRARHPGQG" /function="unknown" BASE COUNT 486 a 1257 c 1142 g 418 t ORIGIN Chromosome 4q35-qter. 1 ggtaccagca ggtgggccgc ctactgcgca cgcgcgggtt tgcgggcagc cgcctgggct 61 gtgggagcag cccgggccag agctctcctg cctctccacc agcccacccc gccgcctgac 121 cgccccctcc ccacccccca ccccccaccc ccggaaaacg cgtcgtcccc tgggctgggt 181 ggagaccccc gtcccgcgaa acaccgggcc ccgcgcagcg tccgggcctg acaccgctcc 241 gccggctcgc ctcctcctgt cgcccccggg ccaccgtcgc ccgcccgccc gggcccctgc 301 gggcccctgc agccgcccag ctgccagcac gggcggctgg cggcggaacg cagaccccag 361 gcccggcgca caccggggac gctgagcgtt ccaggcggga gggaaggcgg gcagagatgg 421 agagaggaac gggagagact agaggggcgg aagggacgtt aggagggagg cagggaggca 481 gggaggcagg gaggaacgga gggagagaca gagcgacgca gggactgggg gcggggccga 541 gggagccggg gacggacggg gggaggaagg cagggaggaa aagcggtcct cggcctccgg 601 gagtagcggg accgcccgcc tccgggaaaa cggtcagcgt ccggcgcggg ctgagggctg 661 ggcccacagc cgccgcgccg gccggcgggg caccacccat tcgccccggt tccggggccc 721 agggagtggg cggtttcctc cgggacaaaa gaccgggact cgggttgccg tcgggtcttc 781 acccgcgcgg ttcacagacc gcacatcccc aggctgagcc ctgcaacgcg gcgcgaggcc 841 cacagacccg gccacggagg agccacacgc aggacgacgg aggcgtgatt ttggtttccg 901 agtggctttg ccctcccgaa ggcggcctgt tgctcacgtc tctccggccc ccgaaaggct 961 ggccatgccg actgtttgct cccggagctc tgcggcaccc ggaaacatgc agggaagggt 1021 gcaagcccgg catggtgcct tcgctctcct tgccaggttc caaacccgcc acactgcaga 1081 ctcccccacg ttgccgcacg cgggaatcca tcgtcaggcc atcacgccgg ggaggcatct 1141 cctctctggg gtctcgctct ggtcttctac gtggaaatga acgagagcca cacgcctgcg 1201 tgtgcgagac cgtcccggca acggcgacgc ccacaggcat tgcctccttc acggagagag 1261 ggcctggcac actcaagact cccacggagg ttcagttcca cactcccctc caccctccca 1321 ggctggtttc tccctgctgc cgacgcgtgg gagcccagag agcggcttcc cgttcccgcg 1381 ggatccctgg agaggtccgg agagccggcc cccgaaacgc gcccccctcc cccctccccc 1441 ctctccccct tcctcttcgt ctctccggcc ccaccaccac caccgccacc acgcctcccc 1501 caccaccccc cccccccacc accaccacca ccaccacccc gccggccggc cccaggcctc 1561 gacgccctgg gtcccttccg gggtggggcg ggctgtccca ggggggctca ccgccattca 1621 tgaaggggtg gagcctgcct gcctgtgggc ctttacaagg gcggctggct ggctggctgg 1681 ctgtccgggc aggcctcctg gctgcacctg ccgcagtgca cagtccggct gaggtgcacg 1741 ggagcccgcc ggcctctctc tgcccgcgtc cgtccgtgaa attccggccg gggctcaccg 1801 cgatggccct cccgacaccc tcggacagca ccctccccgc ggaagcccgg ggacgaggac 1861 ggccacggag actcgtttgg accccgagcc aaagcgaggc cctgcgagcc tgctttgagc 1921 ggaacccgta cccgggcatc gccaccagag aacggctggc ccaggccatc ggcattccgg 1981 agcccagggt ccagatttgg tttcagaatg agaggtcacg ccagctgagg cagcaccggc 2041 gggaatctcg gccctggccc gggagacgcg gcccgccaga aggccggcga aagcggaccg 2101 ccgtcaccgg atcccagacc gccctgctcc tccgagcctt tgagaaggat cgctttccag 2161 gcatcgccgc ccgggaggag ctggccagag agacgggcct cccggagtcc aggattcaga 2221 tctggtttca gaatcgaagg gccaggcacc cgggacaggg tggcagggcg cccgcgcagg 2281 caggcggcct gtgcagcgcg gcccccggcg ggggtcaccc tgctccctcg tgggtcgcct 2341 tcgcccacac cggcgcgtgg ggaacggggc ttcccgcacc ccacgtgccc tgcgcgcctg 2401 gggctctccc acagggggct ttcgtgagcc aggcagcgag ggccgccccc gcgctgcagc 2461 ccagccaggc cgcgccggca gagggggtct cccaacctgc cccggcgcgc ggggatttcg 2521 cctacgccgc cccggctcct ccggagccgg ggcgctctcc caccctcagg ctcctcggtg 2581 gcctccgcac ccgggcaaaa gccgggagga ccgggacccg cagcgcgacg gcctgccggg 2641 cccctgcgcg gtggcacagc ctgggcccgc tcaagcgggg ccgcaggcca aggggtgctt 2701 gcgccaccca cgtcccaggg gagtccgtgg tggggctggg gccggggtcc ccaggtcgcc 2761 ggggcggcgt gggaacccca agccggggca gctccacctc cccagcccgc gcccccggga 2821 cgcctccgcc tccgcgcggc aggggcagat gcaaggcatc ccggcgccct cccaggcgct 2881 ccaggagccg gcgccctggt ctgcactccc ctgcggcctg ctgctggatg agctcctggc 2941 gagcccggag tttctgcagc aggcgcaacc tctcctagaa acggaggccc cgggggagct 3001 ggaggcctcg gaagaggcgc ctcgctggaa gcacccctca gcgaggaaga ataccgggct 3061 ctgctggagg agctttagga cgcggggttg ggacggggtc gggtggttcg gggcagggcg 3121 gtggcctctc tttcgcgggg aacacctggc tggctacgga ggggcgtgtc tccgccccgc 3181 cccctccacc gggctgaccg gcctgggatt cctgccttct aggtccaggc ccggtgagag 3241 actccacacc gcggagaact gccattcttt cctgggcatc ccggggatcc cagagccggc 3301 cca // LOCUS HUMFUCTRA 1256 bp DNA PRI 26-MAR-1996 DEFINITION Homo sapiens alpha-(1,3) fucosyltransferase gene, complete cds. ACCESSION M65030 NID g182791 KEYWORDS alpha-(1,3) fucosyltransferase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1256) AUTHORS Lowe,J.B., Kukowska-Latallo,J.F., Nair,R.P., Larsen,R.D., Marks,R.M., Macher,B.A., Kelly,R.J. and Ernst,L.K. TITLE Molecular cloning of a human fucosyltransferase gene that determines expression of the Lewis x and VIM-2 epitopes but not ELAM-1-dependent cell adhesion JOURNAL J. Biol. Chem. 266 (26), 17467-17477 (1991) MEDLINE 91373370 FEATURES Location/Qualifiers source 1..1256 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 39..1256 /codon_start=1 /product="alpha(1,3)-fucosyltransferase" /db_xref="PID:g1236720" /translation="MGAPWGSPTAAAGGRRGWRRGRGLPWTVCVLAAAGLTCTALITY ACWGQLPPLPWASPTPSRPVGVLLWWEPFGGRDSAPRPPPDCPLRFNISGCRLLTDRA SYGEAQAVLFHHRDLVKGPPDWPPPWGIQAHTAEEVDLRVLDYEEAAAAAEALATSSP RPPGQRWVWMNFESPSHSPGLRSLASNLFNWTLSYRADSDVFVPYGYLYPRSHPGDPP SGLAPPLSRKQGLVAWVVSHWDERQARVRYYHQLSQHVTVDVFGRGGPGQPVPEIGLL HTVARYKFYLAFENSQHLDYITEKLWRNALLAGAVPVVLGPDRANYERFVPRGAFIHV DDFPSASSLASYLLFLDRNPAVYRRYFHWRRSYAVHITSFWDEPWCRVCQAVQRAGDR PKSIRNLASWFER" BASE COUNT 174 a 443 c 431 g 208 t ORIGIN 1 acgcgtggcg agcggaggca gcgctgcctg ttcgcgccat gggggcaccg tggggctcgc 61 cgacggcggc ggcgggcggg cggcgcgggt ggcgccgagg ccgggggctg ccatggaccg 121 tctgtgtgct ggcggccgcc ggcttgacgt gtacggcgct gatcacctac gcttgctggg 181 ggcagctgcc gccgctgccc tgggcgtcgc caaccccgtc gcgaccggtg ggcgtgctgc 241 tgtggtggga gcccttcggg gggcgcgata gcgccccgag gccgccccct gactgcccgc 301 tgcgcttcaa catcagcggc tgccgcctgc tcaccgaccg cgcgtcctac ggagaggctc 361 aggccgtgct tttccaccac cgcgacctcg tgaaggggcc ccccgactgg cccccgccct 421 ggggcatcca ggcgcacact gccgaggagg tggatctgcg cgtgttggac tacgaggagg 481 cagcggcggc ggcagaagcc ctggcgacct ccagccccag gcccccgggc cagcgctggg 541 tttggatgaa cttcgagtcg ccctcgcact ccccggggct gcgaagcctg gcaagtaacc 601 tcttcaactg gacgctctcc taccgggcgg actcggacgt ctttgtgcct tatggctacc 661 tctaccccag aagccacccc ggcgacccgc cctcaggcct ggccccgcca ctgtccagga 721 aacaggggct ggtggcatgg gtggtgagcc actgggacga gcgccaggcc cgggtccgct 781 actaccacca actgagccaa catgtgaccg tggacgtgtt cggccggggc gggccggggc 841 agccggtgcc cgaaattggg ctcctgcaca cagtggcccg ctacaagttc tacctggctt 901 tcgagaactc gcagcacctg gattatatca ccgagaagct ctggcgcaac gcgttgctcg 961 ctggggcggt gccggtggtg ctgggcccag accgtgccaa ctacgagcgc tttgtgcccc 1021 gcggcgcctt catccacgtg gacgacttcc caagtgcctc ctccctggcc tcgtacctgc 1081 ttttcctcga ccgcaacccc gcggtctatc gccgctactt ccactggcgc cggagctacg 1141 ctgtccacat cacctccttc tgggacgagc cttggtgccg ggtgtgccag gctgtacaga 1201 gggctgggga ccggcccaag agcatacgga acttggccag ctggttcgag cggtag // LOCUS HUMG0S2PE 4466 bp DNA PRI 05-JAN-1995 DEFINITION Human GOS2 gene, 5' flank and cds. ACCESSION M72885 NID g182852 KEYWORDS . SOURCE Homo sapiens blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4466) AUTHORS Russell,L. and Forsdyke,D.R. TITLE A human putative lymphocyte G0/G1 switch gene containing a CpG-rich island encodes a small basic protein with the potential to be phosphorylated JOURNAL DNA Cell Biol. 10 (8), 581-591 (1991) MEDLINE 92029620 FEATURES Location/Qualifiers source 1..4466 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /tissue_type="blood" enhancer 150..156 /note="c-mos enhancer homology; putative" misc_feature 270..286 /note="homeobox homology; putative" misc_feature 292..320 /note="dyad-symmetry element" protein_bind 375..384 /note="AP2 site homology; putative" /bound_moiety="AP2" protein_bind 405..414 /note="GC box homology (Sp1); putative" /bound_moiety="Sp1" misc_feature 499..507 /note="T cell element homology; putative" repeat_region 602..666 /note="TCAGTTT-containing repeats" enhancer 733..742 /note="c-mos enhancer homology; putative" protein_bind 1049..1059 /note="GC box homology (Sp1); putative" /bound_moiety="Sp1" protein_bind 1208..1216 /note="AP3 site homology; putative" /bound_moiety="AP3" protein_bind 1291..1299 /note="AP1 site homology; putative" /bound_moiety="AP1" misc_feature 1607..1627 /note="region of dyad symmetry" protein_bind 1732..1740 /note="AP1 site homology; putative" /bound_moiety="AP1" misc_binding 1810..1818 /bound_moiety="c_myc" protein_bind 1810..1818 /note="AP1 site homology; putative" /bound_moiety="AP1" protein_bind 1828..1841 /note="GC box homology (Sp1); putative" /bound_moiety="Sp1" misc_feature 1829..1858 /note="GC-rich mini-island" repeat_region 1961..2002 /note="CCAAT-containing repeat element" CAAT_signal 1963..1974 repeat_region 2003..2044 /note="CCAAT-containing repeat element" CAAT_signal 2005..2016 misc_feature 2248..2257 /note="TGF-beta consensus; putative" enhancer 2390..2397 /note="Adenovirus E1A enhancer homology; putative" repeat_region 2475..2544 /note="CT/CA repeat element" enhancer 2674..2680 /note="Adenovirus E4F1 enhancer homology; putative" misc_feature 2796..2805 /note="T cell element homology; putative" misc_feature 2863..3965 /note="CpG rich island" misc_feature 2895..2905 /note="CpG rc-fos dyad symmetry SRE arm" misc_feature 3076..3085 /note="T cell element homology; putative" protein_bind 3113..3122 /note="AP2 site homology; putative" /bound_moiety="AP2" misc_feature 3149..3168 /note="dyad symmetry element" TATA_signal 3190..3194 mRNA join(3228..3355,3459..4149) /gene="G0S2" gene join(3228..3355,3459..4149) /gene="G0S2" exon 3228..3355 /gene="G0S2" /number=1 intron 3356..3458 /number=1 misc_feature 3361..3368 /note="thyroid hormone response element" exon 3459..4149 /gene="G0S2" /number=1 protein_bind 3462..3472 /gene="G0S2" /note="GC box homology (Sp1); putative" /bound_moiety="Sp1" CDS 3491..3802 /gene="G0S2" /note="ORF 103 amino acids" /codon_start=1 /db_xref="PID:g182853" /translation="METVQELIPLAKEMMAQKRKGKMVKLYVLGSVLALFGVVLGLME TVCSPFTAARRLRDQEAAVAELQAALERQALQKQALQEKGKQQDTVLGGRALSNRQHA S" protein_bind 3513..3522 /gene="G0S2" /note="AP2 site homology; putative" /bound_moiety="AP2" protein_bind 3674..3683 /gene="G0S2" /note="AP2 site homology; putative" /bound_moiety="AP2" repeat_region 4129..4147 /note="ATTA repeats" polyA_signal 4173..4178 misc_feature 4277..4283 /note="CK-2 cytokine motif homology; putative" BASE COUNT 1174 a 1200 c 980 g 1112 t ORIGIN 1 tctagatctc tagtctataa gaccagagga gacagtggct acacatataa atttcagtgt 61 cttccactga tatccgagtg ataagcaact tcctttctaa aattatgaaa gtaactaagg 121 gtctaaaaaa aatttctagt gcggtagtct taaaactaca aatagttttg tcatatctcc 181 tatgatgact ccctcccttc agctgcctgg agcccagggg tgggcagtga ctttgtgtag 241 gcagcaagca gccggtaaca aaataataat tattattgtt attattatat tataataatt 301 gtaacaataa caattattat tgaagctcat ttacaactaa ccatccaaaa gacctctttc 361 ccctgtgtct tcaatcccca aggcagaggg gtagggacag ttctatccct cctctgcact 421 taacccttga aacacatgca cccctttgtg actttaccct ctgcagatgg ctctgaatgt 481 cttaatgtct gagagaaagg gatttagaaa gcaaaatata aaaattttaa actagtcctt 541 cctaccttcc tagaagtggc aagagttaaa tgttgagata gactcaaggg taggatgact 601 atttcagttt gcctggaagt gtttcagttt gcctggaagt atctcagttt tagtgctcag 661 ttttagcagg agtcctgcat ttcagagaac ccctcaaccc tgagcaaaat aagatggttg 721 gtcaccctat ggcttggttt gaccatcatc gctataacct gtatcttacc tgcatgcatg 781 acacaacaca gtgctttact tccctaaaaa tgacatcagc cccactaaaa tatatgttta 841 gtttccaagc cctacctagt caccttctac cctaggagcc aggttctctt ttcccaccca 901 gacagaggag ctgcactcag aaattcctag acatgagtta acactggatt ccttagcctt 961 ctactcccat catctcctgc tcagccccag ctaccaccta aactaggaag atcaagtcta 1021 ccagtgacct ccgtccatgg cacttgctgc ccctcctctg tccagctctt accaatatag 1081 ctgctggaac ctggaggtca aagtcaaatt atcaaaaaaa ggaactgagc tggtgatgtg 1141 cactaacaca gcaaatcaca ggaaagggga acccaggtaa attacagcct tctgacctag 1201 gaagacgtgt ggtttgcgtc tctgagttac agaaacacag gaaatgctta ctggaccagt 1261 caatttcaga attttgggtc ccaagctagg ctgactcacc ttcagaatgg aaaccacgtg 1321 acagccctta tatcagggca cacatcacat gctcttccag aagtcaatgg gtttggaacc 1381 ctcacagata ttgggaaagc tcactaatca tttctgccag ttatcagagg ttgctctgaa 1441 actataagga agattcaaag aaaatgccaa gactgatatt aaacttggca ggaacccttg 1501 ttacagaatt ttcctgcctg acaaggttaa aagaacaata agcaggaaac acagtcctcc 1561 aggaataatc aattctattt ggcccctggt caccttcact cagactaaat tctaaaacat 1621 agaatttcaa ataagctatt tagataacct tgaccattct ccacacacaa gcccttgcct 1681 gaactattaa tagtcaaggc aaagggtagt tgttattgct gcctttttaa actgaatcat 1741 ctgagaaatt gcttcagacc cccaaagaaa gattactgtt aacaattcaa aaactaaaat 1801 atttgatccc tgagacagcc ttttcccccg acccgccctt cagggctcag tccgaccgac 1861 tgcaaaggct gttgcaagat tgcatcactg acctttgcaa ttttctggcc agtttgattc 1921 cccttctttc ccctgccccc tccctttctc tgcttaaagg cctttggcca atttgcctct 1981 ccttttcccc aagtttgcta accctttggc caatttgcct ctctttttcc ccaagtttgc 2041 taactctagc atatccataa ccaaagccaa actagaacgc tccctcagcc ccaggtgatt 2101 acagctaacc ctggtcaaaa tcaatcctac atcttcacac gtccaagagt actcacactc 2161 tggattctta cctaagctgt ctactacacg cccttctgcc cacaaactgc ttcaaagctg 2221 aagttgagct ggagcagtga agttgtaccc ccaaacccag gagggtggca gagaattatt 2281 gaggagagca tgaaatactt ccattctaaa atggcaagat gaacttctac caacagcccc 2341 ttccatactt gaccccctac ccccaagctc ccaactccac ttctcaagtg gaagtgagaa 2401 caatttgaat ttgaaggctc ttccctgata cacggaagta cataaggaaa cagctcgcag 2461 gcaaagagac taatctctct ctctctctct ctctctctct ctctctctct ctcacacaca 2521 cacacacaca cacacacaca cacacctctg tgcataaaag caccatcaat gaatagtttt 2581 ctatcaactg actctagtta tacatgcatg tacctctaaa taaaaccaac caggcaggaa 2641 agaaacaata ttagcacata ttgctttatc caagcgtaac ctgttctgtc ctgttaccca 2701 gatccttccc ccttgccttc tcctctctga tccattgcca cacacgtggg aaggtgacaa 2761 cccttccgaa taaaaatgaa agctttcttc tttagatgga acccccaaat tccctcatta 2821 tttataatgt caggctgtcc tggacaaggg aagctgtgca cccgctgaca ccagtaagaa 2881 ggttgccgcc atgtcagaga tgtccgcgga cacctccctg ggctccgggt cctcccctgc 2941 gctcgcctgg agtgggacct tcgcgtgcac actggccttc ccacgcgccc cgctgcgatg 3001 gcacccgcgc cgggccccct agctcacaca gtcggagcgt gctcagcgcg tggccacctc 3061 ttgccaggtc ccagccgggt tccaccccct ccttttcccc tcctcttctt cctccccctc 3121 cgagttcccc tggctctgac cgcgctggcc tgggcccgag agcccaggag gcgtgtctca 3181 gagaaaagat ataagcggcc cccggacgct aaagcggtgc cagcggcgga gtctccaact 3241 gggagagctg cagctgccga gaggaggaga acgctgaggt cggtcggacc aacggacgcg 3301 ctgaccgctg ccaactgcag ctcgcgctgc ctcctgctcg cgccgtgcca ctaaggtagt 3361 ccgcctttct atgagccctc cccaagatta gctgggtgcg gggtggtggg agccgttctt 3421 tggtggctga agcccctctc ctgctgctcc tcctgcaggt cattcccgcc tccgagagcc 3481 cagagccgag atggaaacgg tccaggagct gatccccctg gccaaggaga tgatggccca 3541 gaagcgcaag gggaagatgg tgaagctgta cgtgctgggc agcgtgctgg ccctcttcgg 3601 cgtggtgctc ggcctgatgg agactgtgtg cagccccttc acggccgcca gacgtctgcg 3661 ggaccaggag gcagccgtgg cggagctgca ggccgccctg gagcgacagg ctctccagaa 3721 gcaagccctg caggagaaag gcaagcagca ggacacggtc ctcggcggcc gggccctgtc 3781 caaccggcag cacgcctcct aggaactgtg ggagaccagc ggagtgggag ggagacgcag 3841 tagacagaga cagaccgaga gaggaaggga gagacagagg gggcgcgcgc acaggagcct 3901 gactccgctg ggagagtgca ggagcacgtg ctgtttttta tttggactta acttcagaga 3961 aaccgctgac atctagaact gacctaccac aagcagccac caaaggagtt tgggattgag 4021 ttttgctgct gtgcagcact gcattgtcat gacatttcca acactgtgtg aattatctaa 4081 atgcgtctac cattttgcac tagggaggaa ggataaatgc tttttatgtt attattatta 4141 attattacaa tgaccaccat tttgcatttt gaaataaaaa actttttata ccatatctca 4201 tgtaattcct gagaggtgtg gtgtcctggg gtgggaagca gggagggtga gcaggtgggc 4261 gatggtgatg ggttcttacc tgagcactgc agagggagca gcttcctgag ggtcagacac 4321 ttgcttcaca cctaggaact gtgtaataag ttactacatg catataagtc tgttgaggac 4381 ttgtttttcc ttcttgttag gggtgggaag agagaaaatt ttataacttc cgtgagattt 4441 agcattttaa catcaaaagg tagatc // LOCUS HUMGA733A 2259 bp DNA PRI 08-NOV-1994 DEFINITION Human gastrointestinal tumor-associated antigen GA733-1 protein gene, complete cds, clone 05516. ACCESSION J04152 NID g182893 KEYWORDS GA733-1 gene; integral membrane glycoprotein. SOURCE Homo sapiens (tissue library: of Maniatis) foetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2259) AUTHORS Linnenbach,A.J., Wojcierowski,J., Wu,S.A., Pyrc,J.J., Ross,A.H., Dietzschold,B., Speicher,D. and Koprowski,H. TITLE Sequence investigation of the major gastrointestinal tumor-associated antigen gene family, GA733 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (1), 27-31 (1989) MEDLINE 89098896 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.J.Linnenbach, 03-FEB-1989. FEATURES Location/Qualifiers source 1..2259 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="liver" /tissue_lib="of Maniatis" /map="Unassigned" protein_bind 109..118 /note="putative" /bound_moiety="Sp-1" repeat_region 227..234 /note="direct repeat 1 copy A" mRNA 237..2029 /note="GA733-1 protein mRNA" gene 307..1278 /gene="M1S1" CDS 307..1278 /gene="M1S1" /note="GA733-1 protein precursor" /codon_start=1 /db_xref="GDB:G00-120-161" /db_xref="PID:g182894" /translation="MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVC SPDGPGGRCQCRALGSGMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDG LYDPDCDPEGRFKARQCNQTSVCWCVNSVGVRRTDKGDLSLRCDDLVRTHHILIDLRH RPTAGAFNHSDLDAELRRLFRERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGEVD IGDAAYYFERDIKGESLFQGRGGLDLRVRGEPLQVERTLIYYLDEIPPKFSMKRLTAG LIAVIVVVVVALVAGMAVLVITNRRKSGKYKKVEIKELGELRKEPSL" sig_peptide 307..384 /gene="M1S1" /note="GA733-1 protein signal peptide (put.); putative" mat_peptide 385..1275 /gene="M1S1" /note="GA733-1 protein" misc_feature 403..411 /gene="M1S1" /note="N-linked glycosylation site" misc_feature 664..672 /gene="M1S1" /note="N-linked glycosylation site" misc_feature 808..816 /gene="M1S1" /note="N-linked glycosylation site" misc_feature 928..936 /gene="M1S1" /note="N-linked glycosylation site" repeat_region 1327..1334 /note="direct repeat 1 copy B" BASE COUNT 460 a 645 c 642 g 512 t ORIGIN 221 bp upstream of PstI site. 1 gttctcccct tcccggcttt cggtccggag gaggcgggag cagcttccct gttctgatcc 61 tatcgcgggc ggcgcagggc cggcttggcc ttccgtggga cggggagggg ggcgggatgt 121 gtcacccaaa taccagtggg gacggtcggt ggtggaacca gccgggcagg tcgggtagag 181 tataagagcc ggagggagcg gccgggcggc agacgcctgc agaccatccc agacgccgga 241 gcccgagccc cgccgagtcc ccgcgcctca tccgcccgcg tccggtccgc gttcctccgc 301 cccaccatgg ctcggggccc cggcctcgcg ccgccaccgc tgcggctgcc gctgctgctg 361 ctggtgctgg cggcggtgac cggccacacg gccgcgcagg acaactgcac gtgtcccacc 421 aacaagatga ccgtgtgcag ccccgacggc cccggcggcc gctgccagtg ccgcgcgctg 481 ggctcgggca tggcggtcga ctgctccacg ctgacctcca agtgtctgct gctcaaggcg 541 cgcatgagcg cccccaagaa cgcccgcacg ctggtgcggc cgagtgagca cgcgctcgtg 601 gacaacgatg gcctctacga ccccgactgc gaccccgagg gccgcttcaa ggcgcgccag 661 tgcaaccaga cgtcggtgtg ctggtgcgtg aactcggtgg gcgtgcgccg cacggacaag 721 ggcgacctga gcctacgctg cgatgacctg gtgcgcaccc accacatcct cattgacctg 781 cgccaccgcc ccaccgccgg cgccttcaac cactcagacc tggacgccga gctgaggcgg 841 ctcttccgcg agcgctatcg gctgcacccc aagttcgtgg cggccgtgca ctacgagcag 901 cccaccatcc agatcgagct gcggcagaac acgtctcaga aggccgccgg tgaagtggat 961 atcggcgatg ccgcctacta cttcgagagg gacatcaagg gcgagtctct attccagggc 1021 cgcggcggcc tggacttgcg cgtgcgcgga gaacccctgc aggtggagcg cacgctcatc 1081 tattacctgg acgagattcc cccgaagttc tccatgaagc gcctcaccgc cggcctcatc 1141 gccgtcatcg tggtggtcgt ggtggccctc gtcgccggca tggccgtcct ggtgatcacc 1201 aaccggagaa agtcggggaa gtacaagaag gtggagatca aggaactggg ggagttgaga 1261 aaggaaccga gcttgtaggt acccggcggg gcaggggatg gggtggggta ccggatttcg 1321 gtatcgtccc agacccaagt gagtcacgct tcctgattcc tcggcgcaaa ggagacgttt 1381 atcctttcaa attcctgcct tccccctccc ttttgcgcac acaccaggtt taatagatcc 1441 tggcctcagg gtctcctttc tttctcactt ctgtcttgaa ggaagcattt ctaaaatgta 1501 tcccctttcg gtccaacaac aggaaacctg actggggcag tgaaggaagg gatggcacag 1561 cgttatgtgt aaaaaacaag tatctgtatg acaacccggg atcgtttgca agtaactgaa 1621 tccattgcga cattgtgaag gcttaaatga gtttagatgg gaaatagcgt tgttatcgcc 1681 ttgggtttaa attatttgat gagttccact tgtatcatgg cctacccgag gagaagagga 1741 gtttgttaac tgggcctatg tagtagcctc atttaccatc gtttgtatta ctgaccacat 1801 atgcttgtca ctgggaaaga agcctgtttc agctgcctga acgcagtttg gatgtctttg 1861 aggacagaca ttgcccggaa actcagtcta tttattcttc agcttgccct tactaccact 1921 gatattggta atgttctttt ttgtaaaatg tttgtacata tgttgtcttt gataatgttg 1981 ctgtaatttt ttaaaataaa acacgaattt aataaaatat gggaaaggca caaaccagaa 2041 gttggcattt gtgaaaagtc cctccagatt tctatcactt tggtctctaa tttcccaaga 2101 cttgtatttt ttttttattt caaattataa cacttttttt tcccccagaa gtgggtgttt 2161 catgttgcta ctctggtgtg tcccaagata tcctaactgg ccagtgtaaa tgctattctt 2221 tctaaataag attatttgga aacttccttc aaactgcag // LOCUS HUMGLUDECA 1782 bp DNA PRI 11-MAR-1992 DEFINITION Human glutamate decarboxylase, complete cds. ACCESSION M86522 NID g183271 KEYWORDS glutamate decarboxylase. SOURCE Homo sapiens adult pancreatic islets and brain DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1782) AUTHORS Giorda,R., Peakman,M., Vergani,D. and Trucco,M. TITLE Sequence of glutamic acid decarboxylase transcripts in human pancreatic islets and brain JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..1782 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="beta cells" /dev_stage="adult" /tissue_type="pancreatic islets and brain" CDS 1..1782 /codon_start=1 /product="glutamate decarboxylase" /db_xref="PID:g183272" /translation="MASSTPSSATSSNAGQDPNTTNLRPTTYDTWCGVAHGCTRKLGL KICGFLQRTNSLEEKSRLVSAFKERQSSKNLLSCENSDRDARFRRTETDFSNLFARDL LPAKNGEEQTVQFLLEVVDILLNYVRKTFDRSAKVLEFRHPHQLLEGMEGFNLELSDH PESLEQILVDCRDTLKYGVRTGHPRFFNQLSTGLDIIGLAGEWLTSTANTNMFTYEIA PVFVLMEQITLKKMREIVGWSSKDGDGIFSPGGAISNMYSIMAARYKYFPEVKTKGMA AVPKLVLFTSEQSHYSIKKAGAALGFGTDNVILIKCNERGKIIPADFEAKILEAKQKG YVPFYVNATAGTTVYGAFDPIQEIADICEKYNLWLHVDAAWGGGLLMSRKHRHKLNGI ERANSVTWNPHKMMGVLLQCSAILVKEKGILQGCNQMCAGYLFQPDKQYDVSYDTGDK AIQCGRHVDIFKFWLMWKAKGTVGFENQINKCLELAEYLYAKIKNREEFEMVFNGEPE HTNVCFWYIPQSLRGVPDSPQRREKLHKVAPKIKALMMESGTTMVGYQPQGDKANFFR MVISNPAATQSDIDFLIEEIERLGQDL" BASE COUNT 488 a 438 c 453 g 403 t ORIGIN 1 atggcgtctt cgaccccttc gtccgcaacc tcctcgaacg cgggacagga ccccaatacc 61 actaacctgc gccccacaac gtacgatacc tggtgcggcg tggcccacgg atgcaccaga 121 aaactggggc tcaagatctg cggcttcttg caaaggacca acagcctgga agagaagagt 181 cgccttgtga gtgccttcaa ggagaggcaa tcctccaaga acctgctttc ctgtgaaaac 241 agcgaccggg atgcccgctt ccggcgcaca gagactgact tctctaatct gtttgctaga 301 gatctgcttc cggctaagaa cggtgaggag caaaccgtgc aattcctcct ggaagtggtg 361 gacatactcc tcaactatgt ccgcaagaca tttgatcgct ccgccaaggt gctggaattt 421 cgtcacccac accagttgct ggaaggcatg gagggcttca acttggagct ctctgaccac 481 cccgagtccc tggagcagat cctggttgac tgcagagaca ccttgaagta tggggttcgc 541 acaggtcatc ctcgattttt caaccagctc tccactggat tggatattat tggcctagct 601 ggagaatggc tgacatcaac tgccaatacc aacatgttta catatgaaat tgcaccagtg 661 tttgtcctca tggaacaaat aacacttaag aagatgagag agatagttgg atggtcaagt 721 aaagatggtg atgggatatt ttctcctggg ggcgccatat ccaacatgta cagcatcatg 781 gctgctcgct acaagtactt cccggaagtt aagacaaagg gcatggcggc tgtgcctaaa 841 ctggtcctct tcacctcaga acagagtcac tattccataa agaaagctgg ggctgcactt 901 ggctttggaa ctgacaatgt gatcttgata aagtgcaatg aaagggggaa aataattcca 961 gctgattttg aggcaaaaat tcttgaagcc aaacagaagg gatatgttcc cttttatgtc 1021 aatgcaactg ctggcacgac tgtttatgga gcttttgatc cgatacaaga gattgcagat 1081 atatgtgaga aatataacct ttggttgcat gtcgatgctg cctggggagg tgggctgctc 1141 atgtccagga agcaccgcca taaactcaac ggcatagaaa gggccaactc agtcacctgg 1201 aaccctcaca agatgatggg cgtgctgttg cagtgctctg ccattctcgt caaggaaaag 1261 ggtatactcc aaggatgcaa ccagatgtgt gcaggatacc tcttccagcc agacaagcag 1321 tatgatgtct cctacgacac cggggacaag gcaattcagt gtggccgcca cgtggatatc 1381 ttcaagttct ggctgatgtg gaaagcaaag ggcacagtgg gatttgaaaa ccagatcaac 1441 aaatgcctgg aactggctga atacctctat gccaagatta aaaacagaga agaatttgag 1501 atggttttca atggcgagcc tgagcacaca aacgtctgtt tttggtatat tccacaaagc 1561 ctcaggggtg tgccagacag ccctcaacga cgggaaaagc tacacaaggt ggctccaaaa 1621 atcaaagccc tgatgatgga gtcaggtacg accatggttg gctaccagcc tcaaggggac 1681 aaggccaact tcttccggat ggtcatctcc aacccagccg ctacccagtc tgacattgac 1741 ttcctcattg aggagataga aagactgggc caggatctgt aa // LOCUS HUMGPCRD 1262 bp DNA PRI 31-JUL-1995 DEFINITION Homo sapiens G protein-coupled receptor (GPR3) gene, complete cds. ACCESSION L32831 NID g602311 KEYWORDS G protein-coupled receptor; G protein-coupled receptor GPR3. SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1262) AUTHORS Iismaa,T.P., Kiefer,J., Liu,M.L., Baker,E., Sutherland,G.R. and Shine,J. TITLE Isolation and chromosomal localization of a novel human G-protein-coupled receptor (GPR3) expressed predominantly in the central nervous system JOURNAL Genomics 24 (2), 391-394 (1994) MEDLINE 95213036 FEATURES Location/Qualifiers source 1..1262 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Kelly" /cell_type="neuroblastoma" intron <1..187 exon 188..1262 CDS 193..1185 /codon_start=1 /product="G protein-coupled receptor GPR3" /db_xref="PID:g602312" /translation="MMWGAGSPLAWLSAGSGNVNVSSVGPAEGPTGPAAPLPSPKAWD VVLCISGTLVSCENALVVAIIVGTPAFRAPMFLLVGSLAVADLLAGLGLVLHFAAVFC IGSAEMSLVLVGVLAMAFTASIGSLLAITVDRYLSLYNALTYYSETTVTRTYVMLALV WGGALGLGLLPVLAWNCLDGLTTCGVVYPLSKNHLVVLAIAFFMVFGIMLQLYAQICR IVCRHAQQIALQRHLLPASHYVATRKGIATLAVVLGAFAACWLPFTVYCLLGDAHSPP LYTYLTLLPATYNSMINPIIYAFRNQDVQKVLWAVCCCCSSSKIPFRSRSPSDV" BASE COUNT 213 a 415 c 332 g 302 t ORIGIN 1 attggagggg acagcggtat cctgggaaga gccccagggc atgaatgtgg ggataaggca 61 ttgggaccct atcaggtatc ctgaggagag actcccacca cgtatcctga gaagcacctc 121 accccctcca gaccccaact cccatcaccc agcttggtca gcttctcaca aggcctttct 181 cctgcaggta ccatgatgtg gggtgcaggc agccctctgg cctggctctc agctggctca 241 ggcaacgtga atgtaagcag cgtgggccca gcagaggggc ccacaggtcc agccgcacca 301 ctgccctcgc ctaaggcctg ggatgtggtg ctctgcatct caggcaccct ggtgtcctgc 361 gagaatgcgc tagtggtggc catcatcgtg ggcactcctg ccttccgtgc ccccatgttc 421 ctgctggtgg gcagcctggc cgtggcagac ctgctggcag gcctgggcct ggtcctgcac 481 tttgctgctg tcttctgcat cggctcagcg gagatgagcc tggtgctggt tggcgtgctg 541 gcaatggcct ttaccgccag catcggcagt ctactggcca tcactgtcga ccgctacctt 601 tctctgtaca atgccctcac ctactattca gagacaacag tgacacggac ctatgtgatg 661 ctggccttag tgtggggagg tgccctgggc ctggggctgc tgcctgtgct ggcctggaac 721 tgcctggatg gcctgaccac atgtggcgtg gtttatccac tctccaagaa ccatctggta 781 gttctggcca ttgccttctt catggtgttt ggcatcatgc tgcagctcta cgcccaaatc 841 tgccgcatcg tctgccgcca tgcccagcag attgcccttc agcggcacct gctgcctgcc 901 tcccactatg tggccacccg caagggcatt gccacactgg ccgtggtgct tggagccttt 961 gccgcctgct ggttgccctt cactgtctac tgcctgctgg gtgatgccca ctctccacct 1021 ctctacacct atcttacctt gctccctgcc acctacaact ccatgatcaa ccctatcatc 1081 tacgccttcc gcaaccagga tgtgcagaaa gtgctgtggg ctgtctgctg ctgctgttcc 1141 tcttccaaga tccccttccg atcccgctcc cccagtgatg tctagctgag tcttcatgac 1201 ccttcaaccc tgattactac agaattccag aatgttaggc tctccagggc ttctttccaa 1261 ac // LOCUS HUMGPIBAA 6062 bp DNA PRI 08-NOV-1994 DEFINITION Human blood platelet membrane glycoprotein Ib-alpha (GPIB) gene, complete cds, clone N10. ACCESSION M22403 NID g183501 KEYWORDS adhesion receptor; blood platelet membrane glycoprotein; cell surface glycoprotein; cell surface receptor; thrombin receptor; transmembrane protein; von Willebrand factor receptor. SOURCE Human platelet DNA, clone N10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2532) AUTHORS Clemetson,K.J. JOURNAL Unpublished (1988) REFERENCE 2 (bases 2533 to 6062) AUTHORS Wenger,R.H., Kieffer,N., Wicki,A.N. and Clemetson,K.J. TITLE Structure of the human blood platelet membrane glycoprotein Ib alpha gene JOURNAL Biochem. Biophys. Res. Commun. 156 (1), 389-395 (1988) MEDLINE 89025874 COMMENT Draft entry and computer-readable sequence for [1,2] kindly submitted by K.J.Clemetson, 03-FEB-1989. FEATURES Location/Qualifiers source 1..6062 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="platelet" /map="17pter-p12" repeat_region 231..518 /note="Alu repeat 1" repeat_region 674..967 /note="Alu repeat 2" repeat_region 861..1158 /note="Alu repeat 3" repeat_region 1622..1820 /note="Alu repeat 4" repeat_region 2231..2520 /note="Alu repeat 5" misc_feature 2642..2659 /note="Sp1 binding site" gene 2795..5416 /gene="GP1BA" exon 2795..2829 /gene="GP1BA" /note="G00-118-806" /number=1 intron 2830..3062 /gene="GP1BA" /note="G00-118-806" /number=1 exon 3063..5416 /gene="GP1BA" /note="G00-118-806" /number=2 CDS 3069..4949 /gene="GP1BA" /codon_start=1 /db_xref="GDB:G00-118-806" /product="platelet membrane glycoprotein Ib-alpha" /db_xref="PID:g386752" /translation="MPLLLLLLLLPSPLHPHPICEVSKVASHLEVNCDKRNLTALPPD LPKDTTILHLSENLLYTFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSH NQLQSLPLLGQTLPALTVLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLL TPTPKLEKLSLANNNLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGSHLLPFAFL HGNPWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYP GKGCPTLGDEGDTDLYDYYPEEDTEGDKVRATRTVVKFPTKAHTTPWGLFYSWSTASL DSQMPSSLHPTQESTKEQTTFPPRWTPNFTLHMESITFSKTPKSTTEPTPSPTTSEPV PEPAPNMTTLEPTPSPTTPEPTSEPAPSPTTPEPTPIPTIATSPTILVSATSLITPKS TFLTTTKPVSLLESTKKTIPELDQPPKLRGVLQGHLESSRNDPFLHPDFCCLLPLGFY VLGLFWLLFASVVLILLLSWVGHVKPQALDSGQGAALTTATQTTHLELQRGRQVTVPR AWLLFLRGSLPTFRSSLFLWVRPNGRVGPLVAGRRPSALSQGRGQDLLSTVSIRYSGH SL" polyA_signal 5392..5397 /gene="GP1BA" /note="G00-118-806" repeat_region 5410..5702 /note="Alu repeat 6" repeat_region 5725..6014 /note="Alu repeat 7" BASE COUNT 1560 a 1491 c 1723 g 1288 t ORIGIN Chromosome 17p12-pter. 1 gaattctgag ctggaacttg ccaaataagt aaaagttagg cagaagggga taagggttga 61 agaaaaagtt caggcacagg agagaatgat caagagtaag ggcagagtgg gaacagatgg 121 ggctggacag gcaggaaagg ggagaccttg caggaccctg caggccatgt tgggaacttt 181 agactttctc ctgagatcta tgaggagcca ttggtgaatt tatttattta tttagagaca 241 gaatttcgct ctttcgccca ggctggagtg agggggtgtg atctcagctc actgcaacct 301 atgccccctg gggctcaagc tgttctcctg cctcagcctc cctagtagct gggattacag 361 gtgctggcca ccacgtctgg ctaatttttt gtatatttag tagagatggg ttttcaccat 421 gttggccagg ctggtctcaa acccctgacc tcaggtgatc cacccgcctc ggcctcccaa 481 actgctagga ttacaggtgt gagccactgt gcctggccca ttggtggatt tcaagcagtg 541 gaaggatatg agcaaatgtg cagttctgaa agaccactcc agtggagaat aaagagaaga 601 aagggaattg ggaagttggg agaccagcaa ggaggctgtt gcagggaagc agtagcgaga 661 agatggtgcc ctgggcaggg catggtgtca cgcctgtact cctagcactt tgggaggccg 721 aggtgggagg atcgcttgag cccaggagtt tgagaccagc ctaggcaaca aacaaatgtt 781 tttctgtctc tagaaaaaaa tttttaaatc agccaggtgt ggtggtgcat gcctgtagtc 841 ccaactcctt gggaggctga agcaggagga tcccttgggt ccaggagttt gtccttgagt 901 gaggcatgat tgcgccactg cactccagcc tgggagacag agcaaaggag accagcctgg 961 acaacatggt aaaaccccgt ctctactaaa gatatgaaaa attagccggg catgatggcg 1021 ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg aacccaggag 1081 gcagaggtgg cagtgagccg agatggagcc attgcactcc agcctgggag acagggcaag 1141 actccatctc aaaaaaaaaa aaaaaaagaa gaagaagaag aagaaaggag gaagaggagg 1201 aagaagaaga ggaggaggag gatataacaa gaaacaacaa caaccttgga ctaggaacgc 1261 ggcaggatgg tgtggatgga gagtagattt gagagagacg ggagataaaa tcaccaggac 1321 ttggtgatgg ggatgaaaga cgggagacgt tagagatgac tattagggat tttgtttttg 1381 ttgactgaat ggttaggtaa atgagtggat gatggtgcca ctcaccaggc tgagcaacac 1441 aggaggagaa gccatttgga gagagatggt gagttcagct ttgaacatat tgttttggtg 1501 tctgtgggac atccaagtag agatgtccag aaggcagctg gacatatgtg cttgtgtctc 1561 aggagaaggg tctggactag agacatacat ttgagagtca ttaggttgta ggtggtggtt 1621 gaaggcaagg gaatggatga ggataatctc aggtggttgc atagcgtaag aaaagaagag 1681 aatgccgggc gcggtggctc atgcctctaa tcccagcact tcgggaagcc aatgtgggtg 1741 gatcacttga gatgaggagt tcgagaccag gctgaccaac atggtgaaac cccgtctcta 1801 ctaaaaatat aaaaattagc tgggaatgaa gacagcctaa gcacaaactc tgggggcagt 1861 ttaattgttc aagaagggga atctatgaag gagactgaga aggcccaacc aaacaggtaa 1921 gaagaaaacc aagaaaatta cagggtcatg tgtcacagag gccaaaagga gaatcccaag 1981 ggcgagtggc catgggtatg caggctgccg aaataaagta agatcacagt ggagaagttt 2041 cccatggatg gtagaggagg gagagcatta aagaccttgg taagaaccga gtcaatggag 2101 tagcagggga aagaaatcag catagactgg ggtgagaagg cagagagaaa ggtgggagag 2161 aattaaaagt aggcaactct cagaatgcag ctgtgaagag gaagagaagc gggcatctgg 2221 agaggttttt ttgtttttaa gatggagttt tgctcttgtc acccaggctg gagtacaatg 2281 gcacgatctc ggctcaccgc aacctctttc cacctcccgg tttcaagcga ttctcctgcc 2341 tcagcctccc gagtagctgg gattacaggc atgcgccacc acgcctggct aatgtatttt 2401 tagtagagac agggtttccc cgtgttggtc aggctggtct cgaactcctg acttcaggtg 2461 atccgcccgc ctcagcctcc caaagttctg ggattacagg catgagccac gcgcccggcc 2521 ctggagaggt ttttaaaaga tggcagaagg ctgtttggag gagtccaccc ccatctcccc 2581 tgtgtaaaag gaaagcggaa gagagaacca caaagagggc ctgggggaaa gccgtggagt 2641 gaggcgataa gggcttgtgt ccaggggatt cccggtcact ggaatcccta tcaggcctgc 2701 atttcctcct cacccccatc cccttccttg ccactggctt agtcctccat ggggctagaa 2761 gagagaagga cggagtcgag tggcacccta gaagacgctc tgtgccttcg gaggtctttc 2821 tgcctgcctg taagccgggg ttggtgctgg ggcaggagag gggtctgagg gaggggaaag 2881 agccaaggac ctggagctag tagttttaag ttctgcaggc aagggtggga gatgggagta 2941 gggaggacag gaggtgtgga tgctgtttct ggaagcgaag ctgcaggggg aagggggctg 3001 gggcctgggg ggatgcttcc aggggatgca gggggatcca ctcaaggctc ccttgcccac 3061 aggtcctcat gcctctcctc ctcttgctgc tcctgctgcc aagcccctta cacccccacc 3121 ccatctgtga ggtctccaaa gtggccagcc acctagaagt gaactgtgac aagaggaatc 3181 tgacagcgct gcctccagac ctgccgaaag acacaaccat cctccacctg agtgagaacc 3241 tcctgtacac cttctccctg gcaaccctga tgccttacac tcgcctcact cagctgaacc 3301 tagataggtg cgagctcacc aagctccagg tcgatgggac gctgccagtg ctggggaccc 3361 tggatctatc ccacaatcag ctgcaaagcc tgcccttgct agggcagaca ctgcctgctc 3421 tcaccgtcct ggacgtctcc ttcaaccggc tgacctcgct gcctcttggt gccctgcgtg 3481 gtcttggcga actccaagag ctctacctga aaggcaatga gctgaagacc ctgcccccag 3541 ggctcctgac gcccacaccc aagctggaga agctcagtct ggctaacaac aacttgactg 3601 agctccccgc tgggctcctg aatgggctgg agaatctcga cacccttctc ctccaagaga 3661 actcgctgta tacaatacca aagggctttt ttgggtccca cctcctgcct tttgcttttc 3721 tccacgggaa cccctggtta tgcaactgtg agatcctcta ttttcgtcgc tggctgcagg 3781 acaatgctga aaatgtctac gtatggaagc aaggtgtgga cgtcaaggcc atgacctcta 3841 acgtggccag tgtgcagtgt gacaattcag acaagtttcc cgtctacaaa tacccaggaa 3901 aggggtgccc cacccttggt gatgaaggtg acacagacct atatgattac tacccagaag 3961 aggacactga gggcgataag gtgcgtgcca caaggactgt ggtcaagttc cccaccaaag 4021 cccatacaac cccctggggt ctattctact catggtccac tgcttctcta gacagccaaa 4081 tgccctcctc cttgcatcca acacaagaat ccactaagga gcagaccaca ttcccaccta 4141 gatggacccc aaatttcaca cttcacatgg aatccatcac attctccaaa actccaaaat 4201 ccactactga accaacccca agcccgacca cctcagagcc cgtcccggag cccgccccaa 4261 acatgaccac cctggagccc actccaagcc cgaccacccc agagcccacc tcagagcccg 4321 cccccagccc gaccaccccg gagcccaccc caatcccgac catcgccaca agcccgacca 4381 tcctggtgtc tgccacaagc ctgatcactc caaaaagcac atttttaact accacaaaac 4441 ccgtatcact cttagaatcc accaaaaaaa ccatccctga acttgatcag ccaccaaagc 4501 tccgtggggt gctccaaggg catttggaga gctccagaaa tgaccctttt ctccaccccg 4561 acttttgctg cctcctcccc ctgggcttct atgtcttggg tctcttctgg ctgctctttg 4621 cctctgtggt cctcatcctg ctgctgagct gggttgggca tgtgaaacca caggccctgg 4681 actctggcca aggtgctgct ctgaccacag ccacacaaac cacacacctg gagctgcaga 4741 ggggacggca agtgacagtg ccccgggcct ggctgctctt ccttcgaggt tcgcttccca 4801 ctttccgctc cagcctcttc ctgtgggtac ggcctaatgg ccgtgtgggg cctctagtgg 4861 caggaaggag gccctcagct ctgagtcagg gtcgtggtca ggacctgctg agcacagtga 4921 gcattaggta ctctggccac agcctctgag ggtgggaggt ttggggacct tgagagaaga 4981 gcctgtgggc tctcctattg gaatctagtt gggggttgga ggggtaagga acacagggtg 5041 ataggggagg ggtcttagtt cctttttctg tatcagaagc cctgtcttca caacacaggc 5101 acacaatttc agtcccagcc aaagcagaag gggtaatgac atggacttgg cggggggaca 5161 agacaaagct cccgatgctg catggggcgc tgccagatct cacggtgaac cattttggca 5221 gaatacagca tggttcccac atgcatctat gcacagaaga aaatctggaa agtgatttat 5281 caggatgtga gcactcgttg tgtctggatg ttacaaatat gggtggtttt attttctttt 5341 tccctgttta gcattttcta gttttccact attattgtat attatctgta taataaaaaa 5401 taattttagg gttgggagtg atggctcatg cctgtaatcc tagcactttg ggaggccgag 5461 gcgggtggaa tcaccagagg tagggagttc aagaccagcc tggcaaacat ggtgaaaccc 5521 tggtctctac taaaaataca aaaattaggc caggcgtggt ggtgcacacc tataacccca 5581 gctactcggg agggtggggc aggagaatcg cttgaacctg ggaggcggaa gttgccgtga 5641 gccaagatcg taccactgaa ctccagcctg ggtaacagag tgagactccg tctaaaaaaa 5701 aaaaaaaaaa aaaaaaaaac ttctggccgg gtgcaggggc tcatgcctgt aattccagca 5761 ctctggaagg ctgaggcggg tgggttgctt gaacccagga gtttggccca ggcttggcaa 5821 catggcaaaa cccgacctct acaaaaaata caaaacatta gccaggtgtg gtggcatgca 5881 cctgtggtcc caggtacccg ggtggctgag gagggaggat cacctgagcc tgggagatgg 5941 aggctgcagt gagccctgaa ggtgccactg tactccagcc tgggtgacag agtgagagcc 6001 tgtctcaaaa caacttggct tcttttggtg aagagtggct ggggcacctg tcatgagaat 6061 tc // LOCUS HUMGPR5A 1265 bp DNA PRI 28-FEB-1995 DEFINITION Homo sapiens G protein-coupled receptor (GPR5) gene, complete cds. ACCESSION L36149 NID g598154 KEYWORDS G protein-coupled receptor; G-protein coupled receptor. SOURCE Homo sapiens (clone library: lambda EMBL3 SP6/T7) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1265) AUTHORS Heiber,M., Docherty,J.M., Shah,G., Nguyen,T., Cheng,R., Heng,H.H.Q., Marchese,A., Tsui,L.-C., Shi,X., George,S.R. and O'Dowd,B.F. TITLE Isolation of three novel human genes encoding G protein-coupled receptors JOURNAL DNA Cell Biol. 14 (1), 25-35 (1995) MEDLINE 95134353 FEATURES Location/Qualifiers source 1..1265 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda EMBL3 SP6/T7" CDS 139..1140 /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g598155" /translation="MESSGNPESTTFFYYDLQSQPCENQAWVFATLATTVLYCLVFLL SLVGNSLVLWVLVKYESLESLTNIFILNLCLSDLVFACLLPVWISPYHWGWVLGDFLC KLLNMIFSISLYSSIFFLTIMTIHRYLSVVSPLSTLRVPTLRCRVLVTMAVWVASILS SILDTIFHKVLSSGCDYSELTWYLTSVYQHNLFFLLSLGIILFCYVEILRTLFRSRSK RRHRTVKLIFAIVVAYFLSWGPYNFTLFLQTLFRTQIIRSCEAKQQLEYALLICRNLA FSHCCFNPVLYVFVGVKFRTHLKHVLRQFWFCRLQAPSPASIPHSPGAFAYEGASFY" BASE COUNT 220 a 427 c 307 g 311 t ORIGIN 1 caaggcaatc ctctcccttt ggcctttcca aagtgctagg attacaggtg ttagccactg 61 tacccagcca ccacatggct ttaaactcca tgtctctatc atttcagatg ctctaaacgt 121 ccctgccatc tggtccagat ggagtcctca ggcaacccag agagcaccac ctttttttac 181 tatgaccttc agagccagcc gtgtgagaac caggcctggg tctttgctac cctcgccacc 241 actgtcctgt actgcctggt gtttctcctc agcctagtgg gcaacagcct ggtcctgtgg 301 gtcctggtga agtatgagag cctggagtcc ctcaccaaca tcttcatcct caacctgtgc 361 ctctcagacc tggtgttcgc ctgcttgttg cctgtgtgga tctccccata ccactggggc 421 tgggtgctgg gagacttcct ctgcaaactc ctcaatatga tcttctccat cagcctctac 481 agcagcatct tcttcctgac catcatgacc atccaccgct acctgtcggt agtgagcccc 541 ctctccaccc tgcgcgtccc caccctccgc tgccgggtgc tggtgaccat ggctgtgtgg 601 gtagccagca tcctgtcctc catcctcgac accatcttcc acaaggtgct ttcttcgggc 661 tgtgattatt ccgaactcac gtggtacctc acctccgtct accagcacaa cctcttcttc 721 ctgctgtccc tggggattat cctgttctgc tacgtggaga tcctcaggac cctgttccgc 781 tcacgctcca agcggcgcca ccgcacggtc aagctcatct tcgccatcgt ggtggcctac 841 ttcctcagct ggggtcccta caacttcacc ctgtttctgc agacgctgtt tcggacccag 901 atcatccgga gctgcgaggc caaacagcag ctagaatacg ccctgctcat ctgccgcaac 961 ctcgccttct cccactgctg ctttaacccg gtgctctatg tcttcgtggg ggtcaagttc 1021 cgcacacacc tgaaacatgt tctccggcag ttctggttct gccggctgca ggcacccagc 1081 ccagcctcga tcccccactc ccctggtgcc ttcgcctatg agggcgcctc cttctactga 1141 ggggcctgtg gcggtgcagg cgcaggtgca ggtggacagg gactggaatg ggggtcatgg 1201 agaagcgggc ctggaaggag cattgcagaa cacagcaggg tggagacgtc tcctcctgct 1261 gcagg // LOCUS HUMH1T 1759 bp DNA PRI 18-OCT-1991 DEFINITION Human testicular H1 histone (H1) gene, complete cds. ACCESSION M60094 NID g183750 KEYWORDS histone H1. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1759) AUTHORS Drabent,B., Kardalinou,E. and Doenecke,D. TITLE Structure and expression of the human gene encoding testicular H1 histone (Hlt) JOURNAL Gene 103, 263-268 (1991) MEDLINE 91365256 FEATURES Location/Qualifiers source 1..1759 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" mRNA 487..1211 /evidence=experimental gene 530..1153 /gene="H1t" CDS 530..1153 /gene="H1t" /codon_start=1 /product="testicular H1 histone" /db_xref="PID:g183751" /translation="MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSV SKLITEALSVSQERVGMSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRG TGASGSFKLSKKVIPKSTRSKAKKSVSAKTKKLVLSRDSKSPKTAKTNKRAKKPRATT PKTVRSGRKAKGAKGKQQQKSPVKARASKSKLTQHHEVNVRKATSKK" BASE COUNT 509 a 391 c 418 g 441 t ORIGIN 1 gtcactccgc aattagacag ctaagagatc tgtgttactt ccctcacata tataaataat 61 tttaaataaa aatcatggcg tgaataattt ctttcctcta ccgatttgaa gctatccatt 121 tggaagacca ctctgaagag atgaaataag tcttctgcca aagattactt attaatttac 181 aaggaaaagg ggaagttttg ttcctctccg tgaatttgat tgaaaatcga gggctttctc 241 gaatagtttt ggcatccagg gtcatttttc attaaaaaga gaaaagtcat gtcaaatatg 301 aatttccgca gattattcag cactagaccc tgggagattc tgtaaagagg ggttttgtta 361 tactcaactt ttccgggtaa aacaaacaca aatactcctc ctccaagggg cgggggcggt 421 gcctaggtga tgcaccaatc acagcgcgcc ctaccctata taagccccga ggccgcccgg 481 gtgtttcatg cttttcgctg gttattacat cttgcgtttc tctgttgtta tgtctgaaac 541 cgtgcctgca gcttctgcca gtgctggtct agccgctatg gagaaacttc caaccaagaa 601 gcgagggagg aagccggctg gcttgataag tgcaagtcgc aaagtgccga acctctctgt 661 gtccaagttg atcaccgagg ccctttcagt gtcacaggaa cgagtaggta tgtctttggt 721 tgcgctcaag aaggcattgg ccgctgctgg ctacgacgta gagaagaata acagccgcat 781 caaactgtcc ctcaagagct tagtgaacaa gggaatcctg gtgcaaacca ggggtactgg 841 tgcttccggt tcctttaagc ttagtaagaa ggtgattcct aaatctacac gaagcaaggc 901 taaaaagtca gtttctgcca agaccaagaa gctggtttta tccagggact ccaagtcacc 961 aaagactgct aaaaccaata agagagccaa gaagccgaga gcgacaactc ctaaaactgt 1021 taggagcggg agaaaggcta aaggagccaa gggtaagcaa cagcagaaga gcccagtgaa 1081 ggcaagggct tcgaagtcaa aattgaccca acatcatgaa gttaatgtta gaaaggccac 1141 atctaagaag taaagagctt tccgggaggc caatttggaa agaacccaaa ggctctttta 1201 agagccaccc acattatttt aagatggcgt aacactggaa acaagtttct gtgacagtta 1261 tctataggtt taagttgtga tgcagctgag ttgaaaaggc ttgagattgg agaattaatt 1321 caggccaggc ttcaagacca tcctgggcaa catagccaga ctaccatcta taccaggggt 1381 cctcattccc ccggccaccg accggtaacc ggtccctgtc catggcacgt tatgaattga 1441 gccgcacagc tgaggggtga gcgaacatta accaactgag ctccaccgcc tgtcaggtta 1501 gctgcagcat tagatagatt ctcataagct caaactgtat tgtgaatggc acatgcaagg 1561 gatctaggtt tcaggctcct tgtgacaatc taatgcctga tgatctgagg ttggagcagt 1621 tttagtccgg aaatcattgc tcccagcccc tgcaccccct ggtccgtggt ataattgtct 1681 tacacaaacg gtctcttgtg tcaaaaaggt tggagactac tggtttttac aaaaaagtaa 1741 attagtcaag catggttgg // LOCUS HUMHEN1A 600 bp DNA PRI 31-DEC-1994 DEFINITION Homo sapiens helix-loop-helix protein (HEN1) gene, complete cds. ACCESSION M97507 NID g183946 KEYWORDS helix-loop-helix protein. SOURCE Homo sapiens (tissue library: RPMI-8402/2001) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 600) AUTHORS Brown,L., Espinosa,R. III., Le Beau,M.M., Siciliano,M.J. and Baer,R. TITLE HEN1 and HEN2: a subgroup of basic helix-loop-helix genes that are coexpressed in a human neuroblastoma JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (18), 8492-8496 (1992) MEDLINE 92409542 FEATURES Location/Qualifiers source 1..600 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RPMI-8402" /tissue_lib="RPMI-8402/2001" intron <1..24 /gene="HEN1" gene 1..600 /gene="HEN1" CDS 199..600 /gene="HEN1" /codon_start=1 /product="helix-loop-helix protein" /db_xref="PID:g183947" /translation="MMLNSDTMELDLPPTHSETESGFSDCGGGAGPDGAGPGGPGGGQ ARGPEPGEPGRKDLQHLSREERRRRRRATAKYRTAHATRERIRVEAFNLAFAELRKLL PTLPPDKKLSKIEILRLAICYISYLNHVLDV" BASE COUNT 109 a 209 c 180 g 102 t ORIGIN 1 tcacttctct cttctctttt tcaggcttca gactggcacc ctgaccatgg aaccctgaag 61 tggcagtgac ttctagagct cagtggcaga ccccacgacc cttcctcccc cttcctcccc 121 ctcccaccac cagctttcaa gtcccagagg gaggggtggg gaggggatcc tgatctcaca 181 gggcaggggg cttccatcat gatgctcaac tcagacacca tggagctgga cctgccgccc 241 acccactcag agactgagtc gggcttcagt gactgtgggg gcggggcggg ccctgatggt 301 gccgggcctg ggggtccggg agggggccag gcccgaggcc cagagccggg agagcctggc 361 cggaaagacc tgcagcatct gagccgcgag gagcgccggc gccggcgccg cgccacagcc 421 aagtaccgca cggcccacgc cacgcgagaa cgcatccgcg tggaagcctt caacctggcc 481 ttcgccgagc tgcgcaagct gctgcctacg ctgccccccg acaagaagct ctccaagatt 541 gagattctgc gcctggccat ctgctatatc tcctacctga accacgtgct ggacgtctga // LOCUS HUMHISAC 1978 bp DNA PRI 07-MAR-1995 DEFINITION Human histone H1 (H1F4) gene, complete cds. ACCESSION M60748 NID g184073 KEYWORDS histone H1. SOURCE Human blood DNA, clone C3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1978) AUTHORS Albig,W., Kardalinou,E., Drabent,B., Zimmer,A. and Doenecke,D. TITLE Isolation and characterization of two human H1 histone genes within clusters of core histone genes JOURNAL Genomics 10 (4), 940-948 (1991) MEDLINE 92009931 FEATURES Location/Qualifiers source 1..1978 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C3" /tissue_type="blood" /map="12q11-q21" gene 730..1389 /gene="H1F4" CDS 730..1389 /gene="H1F4" /note="putative" /codon_start=1 /db_xref="GDB:G00-120-030" /product="histone H1" /db_xref="PID:g184074" /translation="MSETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELI TKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGAS GSFKLNKKAASGEAKPKAKKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKK PAAAAGAKKAKSPKKAKAAKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKK K" BASE COUNT 532 a 494 c 544 g 408 t ORIGIN 1 aagggaaaga attatccaag aattgtttaa aaactcagat gtagcggaca gatgtaaaac 61 catggctgta tagattgatg tcccaggggt ccaaaactta atctcaaatg ggcaataatt 121 tgtttggcat taaactaaac cagtttgatg aactcaaatg ccctcggctc aataggcagg 181 actctccgag gagcctgtgt tacttccctc acttaagtgc agatttgtaa taaaaatctt 241 aatgccagtg gcatgctttt tggatatata agaagctaac cacttggagt atcatatttg 301 agaggtcaga aaagtccaca gttaaagatc ggtttataat ttacgaagaa atagaaagtt 361 ttgtttcctc ctgagttgaa atttgccaag cacggaggaa atattgcaag tttttggcac 421 aaggctttct gcttcccctt ataatttgag atctgcgtga agcctgaggg ttcggggatc 481 attatctgag aaaaaccggg cagttcggtg tagacaattt ttatattttt ggcttttttt 541 gaggtgtaac aaacacaact cgggatccga gaggacactc tgcggctgcc agcgaggcgg 601 gctggacagc gcaccaatca cggcgcacgt ccgccctata taaacgggcg ggcgcagcgc 661 cgcggctcga gtcccggcca gtgcctctgc ttccggctcg aattgctctc gctcacgctt 721 gccttcaaca tgtccgagac tgcgcctgcc gcgcccgctg ctccggcccc tgccgagaag 781 actcccgtga agaagaaggc ccgcaagtct gcaggtgcgg ccaagcgcaa agcgtctggg 841 cccccggtgt ccgagctcat tactaaagct gttgccgcct ccaaggagcg cagcggcgta 901 tctttggccg ctctcaagaa agcgctggca gccgctggct atgacgtgga gaaaaacaac 961 agccgcatca agctgggtct caagagcctg gtgagcaagg gcaccctggt gcagaccaag 1021 ggcaccggcg cgtcgggttc cttcaaactc aacaagaagg cggcctctgg ggaagccaag 1081 cctaaggcta aaaaggcagg cgcggccaag gccaagaagc cagcaggagc ggcgaagaag 1141 cccaagaagg cgacgggggc ggccaccccc aagaagagcg ccaagaagac cccaaagaag 1201 gcgaagaagc cggctgcagc tgctggagcc aaaaaagcga aaagcccgaa aaaggcgaaa 1261 gcagccaagc caaaaaaggc gcccaagagc ccagcgaagg ccaaagcagt taaacccaag 1321 gcggctaaac caaagaccgc caagcccaag gcagccaagc caaagaaggc ggcagccaag 1381 aaaaagtaga aagttccttt ggccaactgc ttagaagccc aacacaaccc aaaggctctt 1441 ttcagagcca cccaccgctc tcagtaaaag agctgttgca ctattagggg gcgtggctcg 1501 ggaaaacgct gctaagcagg ggcgggtctc ccgggaacaa agtcggggag aggagtggga 1561 ttttgtgtgt ctccggagct atttttgact atggcgtcgc gtcgcccaag ccggagtgca 1621 gtggcgtcat ctcgattttg cgttctcgag tgtcggagtt gaacccattt gggcctccct 1681 tgtgctttgc cttttagcag gccctggctc cagatagcat gggaaaaaaa atgttgggat 1741 tttccccggg tttctaagct gggtttttcc gagttccaaa cacggcacag tgtatcagtt 1801 tctgtgctgg ttacaagcct actggttatc cctatcgagt atggcaggca gtgagggact 1861 tcagaggagt acgtcttagg acaagtggca tagtactgac attatttccg aagggctaca 1921 tttcaagtgc ttggggagac tactgccaca taactgaaat tagaaaccaa cactgcag // LOCUS HUMHISAG 871 bp DNA PRI 07-MAR-1995 DEFINITION Human histone H2A.1 (H2A) gene, complete cds. ACCESSION M60752 NID g184081 KEYWORDS histone H2A.1. SOURCE Human blood DNA, clone C5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 871) AUTHORS Albig,W., Kardalinou,E., Drabent,B., Zimmer,A. and Doenecke,D. TITLE Isolation and characterization of two human H1 histone genes within clusters of core histone genes JOURNAL Genomics 10 (4), 940-948 (1991) MEDLINE 92009931 FEATURES Location/Qualifiers source 1..871 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C5" /tissue_type="blood" gene 332..724 /gene="H2A" CDS 332..724 /gene="H2A" /note="putative" /codon_start=1 /product="histone H2A.1" /db_xref="PID:g184082" /translation="MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYSERV GAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVT IAQGGVLPNIQAVLLPKKTESHHKAKGK" BASE COUNT 234 a 228 c 192 g 217 t ORIGIN 1 gctgtcagaa aacaataaca gcagtgagaa tgaacgcact taaataaaag ctcgtgtcta 61 gagtctctcc ttttataggc ctttcatgca aataaagaat tcaaaatatc cagctctgat 121 tgggcaatgt gttagtgacg catacatgta aaatagcctt caccttattt cctttctaat 181 tggttggctc gtcaaagaac aattttaacc aatcaaattg cgcctttcac aattctaccg 241 atgactataa ctagcttctt attcctccat cgagcccatt ctttttcttt attcagtgga 301 ttgttagttc ttctgctgtt aggaagccac tatgtctgga cgtggaaagc aaggcggcaa 361 agctcgggca aaagctaaaa cgcgttcttc cagggccggt cttcagtttc cagttggccg 421 tgtgcaccgc ctcctccgca aaggcaacta ctccgaacga gtcggggccg gcgctccagt 481 gtacctggca gcggtgctgg aatatctgac ggccgagatc ttagagctag ctggcaacgc 541 ggctcgcgac aataagaaga cccgcatcat cccgcgccac ctgcagctag ccatccgcaa 601 cgacgaggag ctaaataagc ttctaggtcg cgtgaccatc gcgcagggcg gtgtcctgcc 661 caacatccag gccgtattgc tgcctaagaa gacggagagc caccataagg ccaagggcaa 721 gtgaaatgat tactagtcaa atccgtcagt gatcccgagt cccagaaacc aaaggctctt 781 ttcagagcca cccacctttt ctgtaaagtg ctggaataca catacgatgc ctgaaatctc 841 aatgttcact gtcctaattt ttaacgaact t // LOCUS HUMHISH2R 1191 bp DNA PRI 31-DEC-1994 DEFINITION Human histamine H2 receptor gene, complete cds. ACCESSION M64799 NID g184087 KEYWORDS histamine H2 receptor. SOURCE Homo sapiens (tissue library: Clontech) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1191) AUTHORS Gantz,I., Munzert,G., Tashiro,T., Schaffer,M., Wang,L., DelValle,J. and Yamada,T. TITLE Molecular cloning of the human histamine H2 receptor JOURNAL Biochem. Biophys. Res. Commun. 178 (3), 1386-1392 (1991) MEDLINE 91337087 FEATURES Location/Qualifiers source 1..1191 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="Clontech" gene 1..1080 /gene="histamine H2 receptor" CDS 1..1080 /gene="histamine H2 receptor" /codon_start=1 /product="histamine H2 receptor" /db_xref="PID:g184088" /translation="MAPNGTASSFCLDSTACKITITVVLAVLILITVAGNVVVCLAVG LNRRLRNLTNCFIVSLAITDLLLGLLVLPFSAIYQLSCKWSFGKVFCNIYTSLDVMLC TASILNLFMISLDRYCAVMDPLRYPVLVTPVRVAISLVLIWVISITLSFLSIHLGWNS RNETSKGNHTTSKCKVQVNEVYGLVDGLVTFYLPLLIMCITYYRIFKVARDQAKRINH ISSWKAATIREHKATVTLAAVMGAFIICWFPYFTAFVYRGLRGDDAINEVLEAIVLWL GYANSALNPILYAALNRDFRTGYQQLFCCRLANRNSHKTSLRSNASQLSRTQSREPRQ QEEKPLKLQVWSGTEVTAPQGATDR" BASE COUNT 250 a 377 c 302 g 262 t ORIGIN 1 atggcaccca atggcacagc ctcttccttt tgcctggact ctaccgcatg caagatcacc 61 atcaccgtgg tccttgcggt cctcatcctc atcaccgttg ctggcaatgt ggtcgtctgt 121 ctggccgtgg gcttgaaccg ccggctccgc aacctgacca attgtttcat cgtgtccttg 181 gctatcactg acctgctcct cggcctcctg gtgctgccct tctctgccat ctaccagctg 241 tcctgcaagt ggagctttgg caaggtcttc tgcaatatct acaccagcct ggatgtgatg 301 ctctgcacag cctccattct taacctcttc atgatcagcc tcgaccggta ctgcgctgtc 361 atggacccac tgcggtaccc tgtgctggtc accccagttc gggtcgccat ctctctggtc 421 ttaatttggg tcatctccat taccctgtcc tttctgtcta tccacctggg gtggaacagc 481 aggaacgaga ccagcaaggg caatcatacc acctctaagt gcaaagtcca ggtcaatgaa 541 gtgtacgggc tggtggatgg gctggtcacc ttctacctcc cgctactgat catgtgcatc 601 acctactacc gcatcttcaa ggtcgcccgg gatcaggcca agaggatcaa tcacattagc 661 tcctggaagg cagccaccat cagggagcac aaagccacag tgacactggc cgccgtcatg 721 ggggccttca tcatctgctg gtttccctac ttcaccgcgt ttgtgtaccg tgggctgaga 781 ggggatgatg ccatcaatga ggtgttagaa gccatcgttc tgtggctggg ctatgccaac 841 tcagccctga accccatcct gtatgctgcg ctgaacagag acttccgcac cgggtaccaa 901 cagctcttct gctgcaggct ggccaaccgc aactcccaca aaacttctct gaggtccaac 961 gcctctcagc tgtccaggac ccaaagccga gaacccaggc aacaggaaga gaaacccctg 1021 aagctccagg tgtggagtgg gacagaagtc acggcccccc agggagccac agacaggtaa 1081 aagctccagg tgtggagtgg gacagaagtc acggcccccc agggagccac agacaggtaa 1141 gcgctgaaca gagacttccg caccgggtac caacagctct tctgctgcag g // LOCUS HUMHLGS 2373 bp DNA PRI 24-MAY-1996 DEFINITION Human gene for liver glycogen synthase, complete cds. ACCESSION D29685 NID g517111 KEYWORDS liver glycogen synthase. SOURCE Homo sapiens (strain caucasian) female liver DNA, clone HLGS. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2373) AUTHORS Nakabayashi,H. and Nakayama,T. TITLE Human liver glycogen synthase cDNA JOURNAL Unpublished (1994) REFERENCE 2 (bases 1 to 2373) AUTHORS Nakabayashi,H. TITLE Direct Submission JOURNAL Submitted (28-MAR-1994) to the DDBJ/EMBL/GenBank databases. Hiroki Nakabayashi, Nihon University School of Medicine, Medical Reserch Institute; Oyaguchikami-machi, Itabashi-ku, Tokyo 173, Japan (Tel:03-3972-8111(ex.2330), Fax:03-3972-8830) COMMENT Submitted (28-Mar-1994) to DDBJ by: Hiroki Nakabayashi Medical Research Institute Nihon Unviersity School of Medicine Oyaguchikami-Machi, Itabashi-ku Tokyo 173 Japan Phone: 03-3972-8111 x23301 Fax: 03-3972-8830. FEATURES Location/Qualifiers source 1..2373 /organism="Homo sapiens" /strain="caucasian" /db_xref="taxon:9606" /sex="female" /tissue_type="liver" gene 255..2369 /gene="human liver glycogen synthase gene" CDS 255..2369 /gene="human liver glycogen synthase gene" /standard_name="HLGS" /EC_number="2.4.1.11" /codon_start=1 /product="human liver glycogen synthase" /db_xref="PID:d1006716" /db_xref="PID:g517112" /translation="MLRGRSLSVTSLGGLPQWEVEELPVEELLLFEVAWEVTNKVGGI YTVIQTKAKTTADEWGENYFLIGPYFEHNMKTQVEQCEPVNDAVRRAVDAMNMHGCQV HFGRWLIEGSPYVVLFDIGYSAWNLDRWKGDLWEACSVGIPYHDREANDMLIFGSLTA WFLKEVTDHADGKYVVARFHEWQAGVGLILSRARKLPIATIFTTHATLLGRYLCAANI DFYNHLDKFNIDKEAGERQIYHRYCMERASVHCAHVFTTVSEITAIEAEHMLKRKPDV VTPNGLNVKKFSAVHEFQNLHAMYKARIQDFVRGHFYGHLDFDLEKTLFLFIAGRYEF FKTKGADIFLDSLSRLNFLLRMHKSDITVVVFFIMPAKTNNFNVETLKGQAVRKQLWD VAHSVKEKFGKKLYDALLRGEIPDLNDILDRDDLTIMKRAIFSTQRQSLAPVTTHNMI DDSTDPILSTIRRIGLFNNRTDRVKVILHPEFLSSTSPLLPMDYEEFVRGCHLGVFPS YYEPWGYTPAECTVMGIPSVTTNLSGFGCFMQEHVADPTAYGIYIVDRRFRSPDDSCN QLTKFLYGFCNMSRRQRFIQRNRTERLSDLLDWRYLGRYYQHARHLTLSRAFPDKFHV ELTSPPTTEGFKYPRPSSVPPSPSGSQASSPQSSDVEDEVEDERYDEEEEAERDRLNI KSPFSLSHVPHGKKKLHGEYKN" BASE COUNT 688 a 502 c 552 g 631 t ORIGIN Chromosome 12. 1 agatactgac agggcagata ccgtcctcac aatacctgcc cagaaagacg agaaagagga 61 ggaagaattc ctccttccac caggaattct gtgggaagca cataagattt catgctacta 121 gtttattccc aagagaagct accaaagcct ggtaactcta ccaactctaa cttttgtgcc 181 tgtaagttct cttctcctgg gattacaact aattgaaaca ggaatcaaag gagtctcggt 241 ggactgtaag aagaatgctt cgaggccgat ccctctctgt aacatccctg ggtgggcttc 301 cccagtggga agtcgaagaa cttcctgtgg aggagttact gctctttgaa gttgcttggg 361 aagtgaccaa taaagttgga ggcatctata ctgtgattca gacaaaggcc aaaacaacag 421 cagatgaatg gggagagaac tattttctga taggtccata ttttgagcat aatatgaaga 481 ctcaggtgga acagtgtgaa cctgtaaatg atgctgtcag aagagcagtg gacgcaatga 541 atatgcatgg ctgccaggtg cattttggaa gatggctgat agaaggaagt ccttatgtgg 601 tactttttga cataggctat tcagcttgga atctggacag gtggaagggt gacctctggg 661 aagcatgcag tgtcggcatt ccttatcatg accgagaagc caatgatatg ctgatatttg 721 gatctttaac tgcctggttc ttaaaagaag tgacagatca tgcagatggt aaatatgtcg 781 ttgcccggtt ccatgaatgg caggctggag ttggactgat cctttctcga gccaggaaac 841 ttcctattgc cacaatattt acaacccacg ctacactact tgggaggtat ctctgtgcag 901 caaatattga tttctacaac catcttgata agtttaacat tgacaaagag gctggggaaa 961 ggcagattta ccaccggtac tgcatggagc gagcttccgt tcattgcgct cacgtgttca 1021 ccacggtttc tgaaataaca gcaatagaag ctgaacatat gctgaagaga aagcctgatg 1081 tagttactcc aaacggcttg aatgttaaga aattttcagc agtgcatgag tttcaaaatc 1141 tacatgccat gtacaaggcc agaatccaag attttgttcg aggtcatttc tatggtcatc 1201 tcgactttga tcttgaaaag actttgttcc ttttcattgc tgggaggtat gagtttttca 1261 aaacaaaagg agctgacatc ttcctagatt ccttatccag gctaaatttc ctgctgagga 1321 tgcataaaag tgacatcaca gtggtggtgt ttttcattat gcctgccaag acaaataatt 1381 tcaacgtgga aaccctgaaa ggacaagcag tgcgaaaaca gctgtgggat gttgcacatt 1441 ctgtgaagga aaagtttgga aaaaaactct atgatgcatt attaagagga gaaattcctg 1501 acctgaacga tattttagat cgagatgatc taacaattat gaaaagagcc atcttttcaa 1561 ctcagcgaca gtcattagcc ccagtgacca cgcacaacat gattgatgac tccaccgacc 1621 ccatcctcag caccattaga cggattggac ttttcaacaa ccgcacagat agagtcaagg 1681 tgattttgca cccagagttt ctatcctcca ccagtccctt actacccatg gactatgaag 1741 agtttgttag aggttgtcat cttggagtat ttccatcata ctatgaaccc tggggttata 1801 ctccagctga atgcactgtg atgggtatcc ccagtgtgac cacgaatctc tccgggtttg 1861 gctgtttcat gcaggagcac gtggctgatc ctactgctta cggtatttac atcgttgaca 1921 ggcggttccg ttctccagat gattcttgca atcagctgac taagtttctc tatggatttt 1981 gcaacatgtc acgccgccaa aggtttatcc agaggaacag aactgagagg ctctcagatc 2041 ttctggattg gagatactta ggcagatatt accagcatgc cagacacctg acattaagca 2101 gagcttttcc agataaattc catgtggaac taacatcacc accaacgaca gaaggattta 2161 aatatcccag gccttcctca gtaccacctt ctccttcagg gtctcaggcc tccagtcctc 2221 agagcagtga tgtggaagat gaagtggagg atgagagata cgatgaggaa gaggaggctg 2281 aaagggatcg gttaaatatc aagtcaccat tttcactgag ccacgttcct catgggaaga 2341 aaaagctgca tggtgaatat aagaactgaa ttc // LOCUS HUMHSPA2A 3457 bp DNA PRI 08-NOV-1994 DEFINITION Human heat shock protein HSPA2 gene, complete cds. ACCESSION L26336 NID g476704 KEYWORDS heat shock protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3457) AUTHORS Yu,C.-E., Bonnycastle,L.L.C., Hunt,C.R., Trask,B.J., Clancy,K.P., Weber,J.L., Patterson,D. and Schellenberg,G.D. TITLE Cloning, sequencing, and mapping of the human chromosome 14 heat shock protein gene (HSPA2) JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..3457 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14" gene 1087..3006 /gene="HSPA2" CDS 1087..3006 /gene="HSPA2" /codon_start=1 /db_xref="GDB:G00-120-059" /product="heat shock protein" /db_xref="PID:g476705" /translation="MSARGPAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVA FTDTERLIGDAAKNQVAMNPTNTIFDAKRLIGRKFEDATVQSDMKHWPFRVVSEGGKP KVQVEYKGETKTFFPEEISSMVLTKMKEIAEAYLGGKVHSAVITVPAYFNDSQRQATK DAGTITGLNVLRIINEPTAAAIAYGLDKKGCAGGEKNVLIFDLGGGTFDVSILTIEDG IFEVKSTAGDTHLGGEDFDNRMVSHLAEEFKRKHKKDIGPNKRAVRRLRTACERAKRT LSSSTQASIEIDSLYEGVDFYTSITRARFEELNADLFRGTLEPVEKALRDAKLDKGQI QEIVLVGGSTRIPKIQKLLQDFFNGKELNKSINPDEAVAYGAAVQAAILIGDKSENVQ DLLLLDVTPLSLGIETAGGVMTPLIKRNTTIPTKQTQTFTTYSDNQSSVLVQVYEGER AMTKDNNLLGKFDLTGIPPAPRGVPQIEVTFDIDANGILNVTAADKSTGKENKITITN DKGRLSKDDIDRMVQEAERYKSEDEANRDRVAAKNALESYTYNIKQTVEDEKLRGKIS EQDKNKILDKCQEVINWLDRNQMAEKDEYEHKQKELERVCNPIISKLYQGGPGGGSGG GGSGASGGPTIEEVD" BASE COUNT 811 a 1002 c 964 g 680 t ORIGIN 1 cctccacctc ccgggttcaa gcgattctcc tgcctcagcc tcccgagtag ctgagactac 61 aggcacgcgc caccacgccc agctaatttt tgtatcttta gtagagacgg gctttcacca 121 tgttggccag gatggtctcg atgtcttaac gtcgtgatcc ggccgcctcg gcctcccaag 181 tgctgggatt acaggcgtta gccactgcgc ccggccccag ccaggcagtt ttaatcgagc 241 gctcacaacc actgagacgc agcgaagcac ccaccataat atcccaggag gccgaccgcc 301 ggttcagact ttttcttttc tttaatcccc gtccaaggga tccgccctca ccccccaccc 361 cagccacccc aattccctat tccctcccct tggacggcgc cggggaaaac aagctgctcg 421 agctttattt cttcggtgca accaactcag aatgaattcc tccgcccctg cgtgctcagt 481 gagtcggcac cctagcagtg aactgcattt aaaacctcag gaattgagcg aactctccca 541 gtggctctcc tcaccgggat ccccttccac gcctcctccc cgtgccgcgc ctcagtccgc 601 actgctcatt ggccgcgtgc ctgccaatcc gatgcacgtc ggctagggca aagaccgcga 661 aaaagcgcgt acacctggct ctgggagcgc gcgcctaacg ccagccagca gcaggaggcg 721 cgcgaggcac cacggcctgg cggccgagag tcagggagga acctcattta cataacggcc 781 gcccctctgt ctcctggcgg gggccggagt cccgcccctc gtccaacttg aaatctgttg 841 ggtcacgggc cagtcactcc gacctaggca agcctgtggt ggagctggaa gagtttgtga 901 gggcggtccc gggagcggat tgggtctggg agttcccaga ggcggctata agaaccggga 961 actgggcgcg gggagctgag ttgctggtag tgcccgtggt gcttggttcg aggtggccgt 1021 tagttgactc cgcggagttc atctccctgg ttttcccgtc ctaacgtcgc tcgcctttca 1081 gtcaggatgt ctgcccgtgg cccggctatc ggcatcgacc tgggcaccac ctattcgtgc 1141 gtcggggtct tccaacatgg caaggtggag atcatcgcca acgaccaggg caatcgcacc 1201 acccccagct acgtggcctt cacggacacc gagcgcctca tcggcgacgc cgccaagaac 1261 caggtggcca tgaaccccac caacaccatc ttcgacgcca agaggctgat tggacggaaa 1321 ttcgaggatg ccacagtgca gtcggatatg aaacactggc cgttccgggt ggtgagcgag 1381 ggaggcaagc ccaaagtgca agtagagtac aagggggaga ccaagacctt cttcccagag 1441 gagatatcct ccatggtcct cacgaagatg aaggagatcg cggaagccta cctggggggc 1501 aaggtgcaca gcgcggtcat aacggtcccg gcctatttca acgactcgca gcgccaggcc 1561 accaaggacg caggcaccat cacggggctc aatgtgctgc gcatcatcaa cgagcccacg 1621 gcggcggcca tcgcctacgg cctggacaag aagggctgcg cgggcggcga gaagaacgtg 1681 ctcatctttg acctgggcgg tggcactttc gacgtgtcca tcctgaccat cgaggatggc 1741 atcttcgagg tgaagtccac ggccggcgat acccacctgg gcggtgagga cttcgacaac 1801 cgcatggtga gccacctggc ggaggagttc aagcgcaagc acaagaagga cattgggccc 1861 aacaagcgcg ccgtgaggcg gctgcgcacc gcttgcgagc gcgccaagcg caccctgagc 1921 tcgtccacgc aggcgagcat cgagatcgac tcgctctacg agggcgtgga cttctatacg 1981 tccatcacgc gcgcccgctt cgaggagctc aatgccgacc tctttcgcgg gaccctggag 2041 ccggtggaga aggcgctgcg cgacgccaag ctggacaagg gccagatcca ggagatcgtg 2101 ctggtgggcg gctccactcg tatccccaag atccagaagc tgctgcagga tttcttcaac 2161 ggcaaggagc tgaacaagag catcaacccc gacgaggcgg tggcctatgg cgccgcggtg 2221 caggcggcca tcctcatcgg cgacaaatca gagaatgtgc aggacctgct gctactcgac 2281 gtgaccccgt tgtcgctggg catcgagaca gctggcggtg tcatgacccc actcatcaag 2341 aggaacacca cgatccccac caagcagacg cagaccttca ccacctactc ggacaaccag 2401 agcagcgtac tggtgcaggt atacgagggc gaacgggcca tgaccaagga caataacctg 2461 ctgggcaagt tcgacctgac cgggattccc cctgcgcctc gcggggtccc ccaaatcgag 2521 gttaccttcg acattgacgc caatggcatc cttaacgtta ccgccgccga caagagcacc 2581 ggtaaggaaa acaaaatcac catcaccaat gacaaaggtc gtctgagcaa ggacgacatt 2641 gaccggatgg tgcaggaggc ggagcggtac aaatcggaag atgaggcgaa tcgcgaccga 2701 gtcgcggcca aaaacgccct ggagtcctat acctacaaca tcaagcagac ggtggaagac 2761 gagaaactga ggggcaagat tagcgagcag gacaaaaaca agatcctcga caagtgtcag 2821 gaggtgatca actggctcga ccgaaaccag atggcagaga aagatgagta tgaacacaag 2881 cagaaagagc tcgaaagagt ttgcaacccc atcatcagca aactttacca aggtggtcct 2941 ggcggcggca gcggcggcgg cggttcagga gcctccgggg gacccaccat cgaagaagtg 3001 gactaagctt gcactcaagt cagcgtaaac ctctttgcct ttctctctct ctcttttttt 3061 tttgtttgtt tctttgaaat gtccttgtgc caagtacgag atctattgtt ggaagtcttt 3121 ggtatatgca aatgaaagga gaggtgcaac aacttagttt aattataaaa gttccaaagt 3181 ttgtttttta aaaacattat tcgaggtttc tctttaatgc attttgcgtg tttgctgact 3241 tgagcatttt tgattagttc gtgcatggag atttgtttga gatgagaaac cttaagtttg 3301 cacacctgtt ctgtagaagc ttggaaacag taaaatatat aggagcttaa attgtttatt 3361 tttatgtact actttaaaac taaactgaac attgcagtaa tgttaaggac aggtatactt 3421 tttgcaaaca aatgcataaa tgcaaatgta aagtaaa // LOCUS HUMHTR1DB 1959 bp DNA PRI 31-DEC-1994 DEFINITION Human serotonin 1Db receptor (HTR1D) gene, complete cds. ACCESSION M75128 NID g184459 KEYWORDS serotonin 1Db receptor. SOURCE Homo sapiens (tissue library: EMBL3 SP6/T7 (Clontech)) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1959) AUTHORS Demchyshyn,L., Sunahara,R.K., Miller,K., Teitler,M., Hoffman,B.J., Kennedy,J.L., Seeman,P., Van Tol,H.H. and Niznik,H.B. TITLE A human serotonin 1D receptor variant (5HT1D beta) encoded by an intronless gene on chromosome 6 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (12), 5522-5526 (1992) MEDLINE 92302275 FEATURES Location/Qualifiers source 1..1959 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="EMBL3 SP6/T7 (Clontech)" gene 619..1791 /gene="HTR1D" CDS 619..1791 /gene="HTR1D" /codon_start=1 /product="serotonin 1Db receptor" /db_xref="PID:g184460" /translation="MEEPGAQCAPPPPAGSETWVPQANLSSAPSQNCSAKDYIYQDSI SLPWKVLLVMLLALITLATTLSNAFVIATVYRTRKLHTPANYLIASLAVTDLLVSILV MPISTMYTVTGRWTLGQVVCDFWLSSDITCCTASILHLCVIALDRYWAITDAVEYSAK RTPKRAAVMIALVWVFSISISLPPFFWRQAKAEEEVSECVVNTDHILYTVYSTVGAFY FPTLLLIALYGRIYVEARSRILKQTPNRTGKRLTRAQLITDSPGSTSSVTSINSRVPD VPSESGSPVYVNQVKVRVSDALLEKKKLMAARERKATKTLGIILGAFIVCWLPFFIIS LVMPICKDACWFHLAIFDFFTWLGYLNSLINPIIYTMSNEDFKQAFHKLIRFKCTS" BASE COUNT 393 a 627 c 516 g 423 t ORIGIN 1 gagctccggc gcgaggcgcg gcgcagcgct gctcctagac ttcaccccac ccagctctgg 61 cggccgctgc agccccccaa aagtgcccca gcttggggcg aggggtggga atgcaagatc 121 tcgggacctc tcgctggcct gcaagctttg gtctctacac ctaggaaact cctgtgggca 181 aagtctgcag atccaaaagc gtccaggtta ggagacgctc agcctcaagc aactggggta 241 agagatccca tttggtcaaa gccttctcct caagcagtac ttcaccctcc tgcactagac 301 gcctccaggg agctggagcg gagcagggct cggtgggcca gctcttagca acccaggtct 361 aagacccggt gtggagagga acaaccacag acgcggcggc ttagctaggc gctctggaag 421 tgcaggggag gcgccgcctg ccttggctgc cgcacccatg acctctagtt tcagctgtga 481 acctgggcgg aggaataatt gaggaactca cggaactatc aactggggac aaacctgcga 541 tcgccacggt ccttccgccc tctccttcgt ccgctccatg cccaagagct gcgctccgga 601 gctggggcga ggagagccat ggaggaaccg ggtgctcagt gcgctccacc gccgcccgcg 661 ggctccgaga cctgggttcc tcaagccaac ttatcctctg ctccctccca aaactgcagc 721 gccaaggact acatttacca ggactccatc tccctaccct ggaaagtact gctggttatg 781 ctattggcgc tcatcacctt ggccaccacg ctctccaatg cctttgtgat tgccacagtg 841 taccggaccc ggaaactgca caccccggct aactacctga tcgcctctct ggcggtcacc 901 gacctgcttg tgtccatcct ggtgatgccc atcagcacca tgtacactgt caccggccgc 961 tggacactgg gccaggtggt ctgtgacttc tggctgtcgt cggacatcac ttgttgcact 1021 gcctccatcc tgcacctctg tgtcatcgcc ctggaccgct actgggccat cacggacgcc 1081 gtggagtact cagctaaaag gactcccaag agggcggcgg tcatgatcgc gctggtgtgg 1141 gtcttctcca tctctatctc gctgccgccc ttcttctggc gtcaggctaa ggccgaagag 1201 gaggtgtcgg aatgcgtggt gaacaccgac cacatcctct acacggtcta ctccacggtg 1261 ggtgctttct acttccccac cctgctcctc atcgccctct atggccgcat ctacgtagaa 1321 gcccgctccc ggattttgaa acagacgccc aacaggaccg gcaagcgctt gacccgagcc 1381 cagctgataa ccgactcccc cgggtccacg tcctcggtca cctctattaa ctcgcgggtt 1441 cccgacgtgc ccagcgaatc cggatctcct gtgtatgtga accaagtcaa agtgcgagtc 1501 tccgacgccc tgctggaaaa gaagaaactc atggccgcta gggagcgcaa agccaccaag 1561 accctaggga tcattttggg agcctttatt gtgtgttggc tacccttctt catcatctcc 1621 ctagtgatgc ctatctgcaa agatgcctgc tggttccacc tagccatctt tgacttcttc 1681 acatggctgg gctatctcaa ctccctcatc aaccccataa tctataccat gtccaatgag 1741 gactttaaac aagcattcca taaactgata cgttttaagt gcacaagttg acttgccgtt 1801 tgcagtgggg tcgcctaagc gacctttggg gaccaagttg tgtctggttc cacaggtagg 1861 tcgaatcttc tttcgcggtt tctgggtccc agcgaggctc tctctcctgg gcaagggcaa 1921 tggatcctga gaagccagaa tagtcctgag agagagctc // LOCUS HUMIFNAII 1257 bp DNA PRI 08-NOV-1994 DEFINITION Human interferon-alpha class II (IFNA-II-1) gene, complete cds. ACCESSION M11003 NID g184610 KEYWORDS interferon alpha-II-1. SOURCE Homo sapiens foetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1257) AUTHORS Capon,D.J., Shepard,H.M. and Goeddel,D.V. TITLE Two distinct families of human and bovine interferon-alpha genes are coordinately expressed and encode functional polypeptides JOURNAL Mol. Cell. Biol. 5 (4), 768-779 (1985) MEDLINE 85187974 FEATURES Location/Qualifiers source 1..1257 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="liver" /map="9p22" mRNA 140..>795 /gene="IFNA" /note="G00-119-328" gene 140..795 /gene="IFNA" CDS 208..795 /gene="IFNA" /note="class II" /codon_start=1 /db_xref="GDB:G00-119-328" /product="interferon-alpha" /db_xref="PID:g386800" /translation="MALLFPLLAALVMTSYSPVGSLGCDLPQNHGLLSRNTLVLLHQM RRISPFLCLKDRRDFRFPQEMVKGSQLQKAHVMSVLHEMLQQIFSLFHTERSSAAWNM TLLDQLHTELHQQLQHLETCLLQVVGEGESAGAISSPALTLRRYFQGIRVYLKEKKYS DCAWEVVRMEIMKSLFLSTNMQERLRSKDRDLGSS" BASE COUNT 386 a 272 c 235 g 364 t ORIGIN Chromosome 9p22-p13. 1 tagattgttg tcatcctctt aagtcatagg gagaacacac aaatgaaaac agtaaaagaa 61 actgaaagta cagagaaatg ttcagaaaat gaaaaccatg tgtttcctat taaaagccat 121 gcatacaagc aatgtcttca gaaaacctag ggtccaaggt taagccatat cccagctcag 181 taaagccagg agcatcctca tttcccaatg gccctcctgt tccctctact ggcagcccta 241 gtgatgacca gctatagccc tgttggatct ctgggctgtg atctgcctca gaaccatggc 301 ctacttagca ggaacacctt ggtgcttctg caccaaatga ggagaatctc ccctttcttg 361 tgtctcaagg acagaagaga cttcaggttc ccccaggaga tggtaaaagg gagccagttg 421 cagaaggccc atgtcatgtc tgtcctccat gagatgctgc agcagatctt cagcctcttc 481 cacacagagc gctcctctgc tgcctggaac atgaccctcc tagaccaact ccacactgaa 541 cttcatcagc aactgcaaca cctggagacc tgcttgctgc aggtagtggg agaaggagaa 601 tctgctgggg caattagcag ccctgcactg accttgagga ggtacttcca gggaatccgt 661 gtctacctga aagagaagaa atacagcgac tgtgcctggg aagttgtcag aatggaaatc 721 atgaaatcct tgttcttatc aacaaacatg caagaaagac tgagaagtaa agatagagac 781 ctgggctcat cttgaaatga ttctcattga ttaatttgcc ataataacac ttgcacatgt 841 gactctggtc aattcaaaag actcttattt cggctttaat cacagaatga ctgaattagt 901 tctgcaaata ctttgtcggt atattaagcc agtatatgtt aaaaagactt aggttcaggg 961 gcatcagtcc ctaagatgtt atttattttt actcatttat ttattcttac attttatcat 1021 atttatacta tttatattct tatataacaa atgtttgcct ttacattgta ttaagataac 1081 aaaacatgtt cagctttcca tttggttaaa tattgtattt tgttatttat taaattattt 1141 tcaaacaaaa cttcttgaag ttatttattc gaaaaccaaa atccaaacac tagttttctg 1201 aaccaaatca aggaatggac ggtaatatac acttacctat tcattcattc catttac // LOCUS HUMIFP 1602 bp DNA PRI 30-SEP-1988 DEFINITION Human 40-kDa keratin intermediate filament precursor gene. ACCESSION J03607 NID g184658 KEYWORDS . SOURCE Human, DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1602) AUTHORS Eckert,R.L. TITLE Sequence of the human 40-kDa keratin reveals an unusual structure with very high sequence identity to the corresponding bovine keratin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 1114-1118 (1988) MEDLINE 88124986 COMMENT Draft entry and clean copy of sequence [1] kindly provided by R.L.Eckert (01/04/88). FEATURES Location/Qualifiers source 1..1602 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 265..1467 /partial /note="40-kDa keratin protein" /codon_start=1 /db_xref="PID:g386803" /translation="MTSYSYRQSSATSSFGGLGGGSVRFGPGVAFRAPSIHGGSGGRG VSVSSARFVSSSSSGGYGGGYGGVLTASDGLLAGNEKLTMQNLNDRLASYLDKVRALE AANGELEVKIRDWYQKQGPGPSRDYSHYYTTIQDLRDKILGATIENSRIVLQIDNARL AADDFRTKFETEQALRMSVEADINGLRRVLDELTLARTDLEMQIEGLKEELAYLKKNH EEEISTLRGQVGGQVSVEVDSAPGTDLAKILSDMRSQYEVMAEQNRKDAEAWFTSRTE ELNREVAGHTEQLQMSRSEVTDLRRTLQGLEIELQSQLSMKAALEDTLAETEARFGAQ LAHIQALISGIEAQLGDVRADSERQNQEYQRLMDIKSRLEQEIATYRSLLEGQEDHYN NLSASKVL" BASE COUNT 329 a 473 c 519 g 281 t ORIGIN 1 cgcggaccgg ggcggggcac ctctggaggg caggggcctc tggtctctgg gaggggaggg 61 aattgaccaa tggggagaga gcccatattt gctctcagga gcctgcaaat tcctcagggc 121 tcagatatcc gcccctgaca ccattcctcc cttcccccct ccaccggccg cgggcataaa 181 aggcgccagg tgagggcctc gccgctcctc ccgcgaatcg cagcttctga gaccagggtt 241 gctccgtccg tgctccgcct cgccatgact tcctacagct atcgccagtc gtcggccacg 301 tcgtccttcg gaggcctggg cggcggctcc gtgcgttttg ggccgggggt cgcttttcgc 361 gcgcccagca ttcacggggg ctccggcggc cgcggcgtat ccgtgtcctc cgcccgcttt 421 gtgtcctcgt cctcctcggg gggctacggc ggcggctacg gcggcgtcct gaccgcgtcc 481 gacgggctgc tggcgggcaa cgagaagcta accatgcaga acctcaacga ccgcctggcc 541 tcctacctgg acaaggtgcg cgccctggag gcggccaacg gcgagctaga ggtgaagatc 601 cgcgactggt accagaagca ggggcctggg ccctcccgcg actacagcca ctactacacg 661 accatccagg acctgcggga caagattctt ggtgccacca ttgagaactc caggattgtc 721 ctgcagatcg acaatgcccg tctggctgca gatgacttcc gaaccaagtt tgagacggaa 781 caggctctgc gcatgagcgt ggaggccgac atcaacggcc tgcgcagggt gctggatgag 841 ctgaccctgg ccaggaccga cctggagatg cagatcgaag gcctgaagga agagctggcc 901 tacctgaaga agaaccatga ggaggaaatc agtacgctga ggggccaagt gggaggccag 961 gtcagtgtgg aggtggattc cgctccgggc accgatctcg ccaagatcct gagtgacatg 1021 cgaagccaat atgaggtcat ggccgagcag aaccggaagg atgctgaagc ctggttcacc 1081 agccggactg aagaattgaa ccgggaggtc gctggccaca cggagcagct ccagatgagc 1141 aggtccgagg ttactgacct gcggcgcacc cttcagggtc ttgagattga gctgcagtca 1201 cagctgagca tgaaagctgc cttggaagac acactggcag aaacggaggc gcgctttgga 1261 gcccagctgg cgcatatcca ggcgctgatc agcggtattg aagcccagct gggcgatgtg 1321 cgagctgata gtgagcggca gaatcaggag taccagcggc tcatggacat caagtcgcgg 1381 ctggagcagg agattgccac ctaccgcagc ctgctcgagg gacaggaaga tcactacaac 1441 aatttgtctg cctccaaggt cctctgaggc agcaggctct ggggcttctg ctgtcctttg 1501 gagggtgtct tctgggtaga gggatgggaa ggaagggacc cttacccccg gctcttctcc 1561 tgacctgcca ataaaaattt atggtccaag ggaaaaaaaa aa // LOCUS HUMIL2AB 442 bp DNA PRI 06-JAN-1995 DEFINITION Human interleukin 2 gene, clone pATtacIL-2C/2TT, complete cds, clone pATtacIL-2C/2TT. ACCESSION M22005 NID g186300 KEYWORDS interleukin 2. SOURCE Human T-lymphocyte DNA, clone pATtacIL-2C/2TT. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 442) AUTHORS Weir,M.P., Chaplin,M.A., Wallace,D.M., Dykes,C.W. and Hobden,A.N. TITLE Structure-activity relationships of recombinant human interleukin 2 JOURNAL Biochemistry 27 (18), 6883-6892 (1988) MEDLINE 89062420 FEATURES Location/Qualifiers source 1..442 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-lymphocyte" /map="4q26-q27" gene 22..426 /gene="IL2" CDS 22..426 /gene="IL2" /note="precursor" /codon_start=1 /db_xref="GDB:G00-119-344" /product="interleukin 2" /db_xref="PID:g386818" /translation="MAPTSSSTKKTQLQLEHLLLDLQMILNGINNYKNPKLTRMLTFK FYMPKKATELKHLQCLEEELKPLEEVLNLAQSKNFHLRPRDLISNINVIVLELKGSET TFMCEYADETATIVEFLNRWITFCQSIISTLT" mat_peptide 25..423 /gene="IL2" /note="G00-119-344" /product="interleukin 2" BASE COUNT 124 a 132 c 93 g 93 t ORIGIN Chromosome 4q26-q27. 1 gatcctagga ggtttggtac catggctccg acgagcagct ccaccaagaa aacccagctc 61 cagctcgaac acctgctgct ggacctgcag atgatcctga acggtatcaa caactacaag 121 aacccgaaac tgactcgtat gctgaccttc aagttctaca tgccgaagaa agctaccgaa 181 ctgaaacacc tgcaatgcct cgaggaggag ctcaaaccgc tggaagaggt tctgaacctg 241 gctcagtcca agaacttcca cctgcgtccg cgcgacctga tctccaacat caacgttatc 301 gttctggaac tgaaaggcag tgagactacc ttcatgtgcg aatacgctga cgaaaccgct 361 actatcgttg aattcctgaa ccgttggatc accttctgtc agtccatcat ctccaccctg 421 acctaataac taactaagtc ga // LOCUS HUMINV2 2108 bp DNA PRI 06-JAN-1995 DEFINITION Human involucrin gene, exon 2. ACCESSION M13903 NID g186519 KEYWORDS involucrin; keratinocyte protein. SEGMENT 2 of 2 SOURCE Human keratinocyte, cDNA to mRNA; and DNA, clone lambda-1-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2108) AUTHORS Eckert,R.L. and Green,H. TITLE Structure and evolution of the human involucrin gene JOURNAL Cell 46 (4), 583-589 (1986) MEDLINE 86272107 FEATURES Location/Qualifiers source 1..2108 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q21-q22" gene join(M13902:785..834,1..1787) /gene="IVL" intron <1..10 /gene="IVL" /note="G00-119-355" /number=1 CDS 30..1787 /gene="IVL" /codon_start=1 /db_xref="GDB:G00-119-355" /product="involucrin" /db_xref="PID:g386834" /translation="MSQQHTLPVTLSPALSQELLKTVPPPVNTHQEQMKQPTPLPPPC QKVPVELPVEVPSKQEEKHMTAVKGLPEQECEQQQKEPQEQELQQQHWEQHEEYQKAE NPEQQLKQEKTQRDQQLNKQLEEEKKLLDQQLDQELVKRDEQLGMKKEQLLELPEQQE GHLKHLEQQEGQLKHPEQQEGQLELPEQQEGQLELPEQQEGQLELPEQQEGQLELPEQ QEGQLELPQQQEGQLELSEQQEGQLELSEQQEGQLELSEQQEGQLKHLEHQEGQLEVP EEQMGQLKYLEQQEGQLKHLDQQEKQPELPEQQMGQLKHLEQQEGQPKHLEQQEGQLE QLEEQEGQLKHLEQQEGQLEHLEHQEGQLGLPEQQVLQLKQLEKQQGQPKHLEEEEGQ LKHLVQQEGQLKHLVQQEGQLEQQERQVEHLEQQVGQLKHLEEQEGQLKHLEQQQGQL EVPEQQVGQPKNLEQEEKQLELPEQQEGQVKHLEKQEAQLELPEQQVGQPKHLEQQEK HLEHPEQQDGQLKHLEQQEGQLKDLEQQKGQLEQPVFAPAPGQVQDIQPALPTKGEVL LPVEHQQQKQEVQWPPKHK" BASE COUNT 602 a 526 c 711 g 269 t ORIGIN About 1188 bp after segment 1. 1 tgtctttcag gttgacagta gcttctaaga tgtcccagca acacacactg ccagtgaccc 61 tctcccctgc cctcagtcag gagctcctca agactgttcc tcctccagtc aatacccatc 121 aggagcaaat gaaacagcca actccactgc ctcccccatg ccagaaggtg cctgtcgagc 181 tcccagtgga ggtcccatca aagcaagagg aaaagcacat gactgctgta aagggactgc 241 ctgagcaaga atgtgagcaa cagcagaagg agccacagga gcaggagctg cagcaacagc 301 actgggaaca gcatgaggaa tatcagaaag cagaaaaccc agagcagcag cttaagcagg 361 agaaaacaca aagggatcag cagctaaaca aacagctgga agaagagaag aagctcttag 421 accagcaact ggatcaagag ctagtcaaga gagatgagca actgggaatg aagaaagagc 481 aactgttgga gctcccagag cagcaggagg ggcacctgaa gcacctagag cagcaggagg 541 gacagctgaa gcacccggag cagcaggagg ggcagctgga gctcccagag cagcaggagg 601 ggcagctgga gctcccagag cagcaggagg ggcagctgga gctcccagag cagcaggagg 661 ggcagctgga gctcccagag cagcaggagg ggcagctgga gctcccacag cagcaggagg 721 ggcagctgga gctctctgag cagcaggagg ggcagctgga gctctctgag cagcaggagg 781 ggcagctgga gctctctgag cagcaggagg gacagctgaa gcacctggag caccaggagg 841 ggcagctgga ggtcccagag gagcagatgg ggcagctgaa gtacctggaa cagcaggagg 901 ggcagctgaa gcacctggat cagcaggaga agcagccaga gctcccagag cagcagatgg 961 ggcagctgaa gcacctggag cagcaggagg ggcagcctaa gcatctggag cagcaggagg 1021 ggcaactgga gcagctggag gagcaggagg ggcagctgaa gcacctggag cagcaggagg 1081 ggcagctgga gcacctggag caccaggaag ggcagctggg gctcccagag cagcaggtgc 1141 tgcagctgaa gcagctagag aagcagcagg ggcagccaaa gcacctggag gaggaggagg 1201 ggcagctgaa gcacctggtg cagcaggagg ggcagctgaa gcatctggtg cagcaggagg 1261 ggcagctgga gcagcaggag aggcaggtgg agcacctgga gcagcaggtg gggcagctga 1321 agcacctaga ggagcaggag ggacagctga agcatctgga gcagcagcag gggcagttgg 1381 aggtcccaga gcagcaggtg gggcagccaa agaacctgga gcaggaggag aagcaactgg 1441 agctcccaga gcagcaagag ggccaggtga agcacctgga gaagcaggag gcacagctgg 1501 agctcccaga gcagcaggta ggacagccaa agcacctgga acagcaggaa aagcacctag 1561 agcacccaga gcagcaggac ggacaactaa aacatctgga gcagcaggag gggcagctga 1621 aggacctgga gcagcagaag gggcagctgg agcagcctgt gtttgcccca gctccaggcc 1681 aggtccaaga cattcaacca gccctgccca caaagggaga agtattgctt cctgtagagc 1741 accagcagca gaagcaggag gtgcagtggc cacccaaaca taaataacca cccgcagtgt 1801 ccagaggccc tcagatcgtc tcatacaagg gaagagagag ccactggctc cacttatttc 1861 gggtccgcta ggtggcccgt ctcatctgtg aacttgactc tgtccctcta catgtctctt 1921 taatggggtg agggtggggg agagagggaa ttattgtcca gtgccaaccc caatgacccc 1981 aatcccaacc tcaggtgagc ggagcctcta cttgagggac tattgttact ataggaatcc 2041 ttacttcccc agtattgaag ctgaatcagt gagtgtgtac aatgatacat aataaatctt 2101 ggaagtct // LOCUS HUMIRKCB 1635 bp DNA PRI 27-JAN-1997 DEFINITION Human gene for inward rectifier K channel, complete cds. ACCESSION D50582 NID g1088444 KEYWORDS . SOURCE Homo sapiens (isolate:caucasian) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Inagaki,N., Gonoi,T., Clement,J.P. IV., Namba,N., Inazawa,J., Gonzalez,G., Aguilar-Bryan,L., Seino,S. and Bryan,J. TITLE Reconstitution of IKATP: an inward rectifier subunit plus the sulfonylurea receptor JOURNAL Science 270 (5239), 1166-1170 (1995) MEDLINE 96072967 REFERENCE 2 (bases 1 to 1635) AUTHORS Inagaki,N. JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 1635) AUTHORS Inagaki,N. TITLE Direct Submission JOURNAL Submitted (16-MAY-1995) to the DDBJ/EMBL/GenBank databases. Nobuya Inagaki, Chiba University School of Medicine, Center for Biomedical Science; 1-8-1 Inohana, Chuo-ku, Chiba, Chiba 260, Japan (Tel:043-222-7171(ex.2223), Fax:043-221-7803) FEATURES Location/Qualifiers source 1..1635 /organism="Homo sapiens" /isolate="caucasian" /db_xref="taxon:9606" /chromosome="11" /tissue_type="placenta" CDS 210..1382 /codon_start=1 /product="inward rectifier K channel" /db_xref="PID:d1009769" /db_xref="PID:g1088445" /translation="MLSRKGIIPEEYVLTRLAEDPAEPRYRARQRRARFVSKKGNCNV AHKNIREQGRFLQDVFTTLVDLKWPHTLLIFTMSFLCSWLLFAMAWWLIAFAHGDLAP SEGTAEPCVTSIHSFSSAFLFSIEVQVTIGFGGRMVTEECPLAILSLIVQNIVGLMIN AIMLGCIFMKTAQAHRRAETLIFSKHAVIALRHGRLCFMLRVGDLRKSMIISATIHMQ VVRKTTSPEGEVVPLHQVDIPMENGVGGNSIFLVAPLIIYHVIDANSPLYDLAPSDLH HHQDLEIIVILEGVVETTGITTQARTSYLADEILWGQRFVPIVAEEDGRYSVDYSKFG NTIKVPTPLCTARQLDEDHSLLEALTLASARGPLRKRSVPMAKAKPKFSISPDSLS" BASE COUNT 314 a 548 c 459 g 314 t ORIGIN Chromosome 11. 1 ctgaggctgg tattaagaag tgaagtggga cccaggtgga ggtaaggaag agtctggtgg 61 ggagttatct cagaagtgag gccagcacag gctgagtgca gccccagggt gagaaggtgc 121 ccaccgagag gactctgcag tgaggcccta ggccacgtcc gaggggtgcc tccgatgggg 181 gaagcccctc cctgggggtc accggagcca tgctgtcccg caagggcatc atccccgagg 241 aatacgtgct gacacgcctg gcagaggacc ctgccgagcc caggtaccgt gcccgccagc 301 ggagggcccg ctttgtgtcc aagaaaggca actgcaacgt ggcccacaag aacatccggg 361 agcagggccg cttcctgcag gacgtgttca ccacgctggt ggacctcaag tggccacaca 421 cattgctcat cttcaccatg tccttcctgt gcagctggct gctcttcgcc atggcctggt 481 ggctcatcgc cttcgcccac ggtgacctgg cccccagcga gggcactgct gagccctgtg 541 tcaccagcat ccactccttc tcgtctgcct tccttttctc cattgaggtc caagtgacta 601 ttggctttgg ggggcgcatg gtgactgagg agtgcccact ggccatcctg agcctcatcg 661 tgcagaacat cgtggggctc atgatcaacg ccatcatgct tggctgcatc ttcatgaaga 721 ctgcccaagc ccaccgcagg gctgagaccc tcatcttcag caagcatgcg gtgatcgctc 781 tgcgccacgg ccgcctctgc ttcatgctac gtgtgggtga cctccgcaag agcatgatca 841 tcagcgccac catccacatg caggtggtac gcaagaccac cagccccgag ggcgaggtgg 901 tgcccctcca ccaggtggac atccccatgg agaacggcgt gggtggcaac agcatcttcc 961 tggtggcccc gctgatcatc taccatgtca ttgatgccaa cagcccactc tacgacctgg 1021 cacccagcga cctgcaccac caccaggacc tcgagatcat cgtcatcctg gaaggcgtgg 1081 tggaaaccac gggcatcacc acccaggccc gcacctccta cctggccgat gagatcctgt 1141 ggggccagcg ctttgtgccc attgtagctg aggaggacgg acgttactct gtggactact 1201 ccaagtttgg caacaccatc aaagtgccca caccactctg cacggcccgc cagcttgatg 1261 aggaccacag cctactggaa gctctgaccc tcgcctcagc ccgcgggccc ctgcgcaagc 1321 gcagcgtgcc catggccaag gccaagccca agttcagcat ctctccagat tccctgtcct 1381 gagccatggt ctctcgggcc ccccacacgc gtgtgtacac acggaccatg tggtatgtag 1441 cccagccagg gcctggtgtg aggctgggcc agcctcagct cagcctcccc ctgctgctca 1501 tccagggtgt tacaaggcac ttgtcactat gctatttctg gcctcagcag gaacctgtac 1561 tgggttattt ttgtccctgc tcctcccaac ccaatttagg actggctcac ccctctcccc 1621 cgcccaaggc tgcag // LOCUS HUMISK 436 bp DNA PRI 30-MAR-1994 DEFINITION Human IsK protein (exhibiting a slowly activating channel activity) gene, complete cds, clone phKI2. ACCESSION M26685 NID g186569 KEYWORDS IsK protein; transmembrane protein. SOURCE Human adult DNA, clone phKI2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 436) AUTHORS Murai,T., Kakizuka,A., Takumi,T., Ohkubo,H. and Nakanishi,S. TITLE Molecular cloning and sequence analysis of human genomic DNA encoding a novel membrane protein which exhibits a slowly activating potassium channel activity JOURNAL Biochem. Biophys. Res. Commun. 161, 176-181 (1989) MEDLINE 89273632 COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by S.Nakanishi, 07-SEP-1989. FEATURES Location/Qualifiers source 1..436 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" CDS 29..418 /note="IsK protein" /codon_start=1 /db_xref="PID:g386838" /translation="MILSNTTAVTPFLTKLWQETVQQGGNMSGLARRSPRSSDGKLEA LYVLMVLGFFGFFTLGIMLSYIRSKKLEHSNDPFNVYIESDAWQEKDKAYVQARVLES YRSCYVVENHLAIEQPNTHLPETKPSP" BASE COUNT 103 a 139 c 112 g 82 t ORIGIN 5 bp upstream of PstI site. 1 ctgcagcagt ggaaccttaa tgcccaggat gatcctgtct aacaccacag cggtgacgcc 61 ctttctgacc aagctgtggc aggagacagt tcagcagggt ggcaacatgt cgggcctggc 121 ccgcaggtcc ccccgcagca gtgacggcaa gctggaggcc ctctacgtcc tcatggtact 181 gggattcttc ggcttcttca ccctgggcat catgctgagc tacatccgct ccaagaagct 241 ggagcactcg aacgacccat tcaacgtcta catcgagtcc gatgcctggc aagagaagga 301 caaggcctat gtccaggccc gggtcctgga gagctacagg tcgtgctatg tcgttgaaaa 361 ccatctggcc atagaacaac ccaacacaca ccttcctgag acgaagcctt ccccatgaac 421 cccaccactg gctaaa // LOCUS HUMJUNA 3622 bp DNA PRI 06-JAN-1995 DEFINITION Human c-jun proto oncogene (JUN), complete cds, clone hCJ-1. ACCESSION J04111 NID g186624 KEYWORDS jun oncogene. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3622) AUTHORS Hattori,K., Angel,P., Le Beau,M.M. and Karin,M. TITLE Structure and chromosomal localization of the functional intronless human JUN protooncogene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (23), 9148-9152 (1988) MEDLINE 89057892 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.Hattori, 16-NOV-1988. FEATURES Location/Qualifiers source 1..3622 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /map="1p32-p31" gene 287..3622 /gene="JUN" exon 287..3622 /gene="JUN" /note="alternative mRNA start; G00-120-114" /number=1 exon 289..3622 /gene="JUN" /note="alternative mRNA start; G00-120-114" /number=1 exon 293..3622 /gene="JUN" /note="alternative mRNA start; G00-120-114" /number=1 CDS 1261..2256 /gene="JUN" /codon_start=1 /db_xref="GDB:G00-120-114" /db_xref="PID:g386839" /translation="MTAKMETTFYDDALNASFLPSESGPYGYSNPKILKQSMTLNLAD PVGSLKPHLRAKNSDLLTSPDVGLLKLASPELERLIIQSSNGHITTTPTPTQFLCPKN VTDEQEGFAEGFVRALAELHSQNTLPSVTSAAQPVNGAGMVAPAVASVAGGSGSGGFS ASLHSEPPVYANLSNFNPGALSSGGGAPSYGAAGLAFPAQPQQQQQPPHHLPQQMPVQ HPRLQALKEEPQTVPEMPGETPPLSPIDMESQERIKAERKRMRNRIAASKCRKRKLER IARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNHVNSGCQLMLTQQLQTF" BASE COUNT 851 a 949 c 1091 g 731 t ORIGIN Chromosome 1p31-p32. 1 cccggggagg ggaccgggga acagagggcc gagaggcgtg cggcaggggg gagggtagga 61 gaaagaaggg cccgactgta ggagggcagc ggagcattac ctcatcccgt gagcctccgc 121 gggcccagag aagaatcttc tagggtggag tctccatggt gacgggcggg cccgcccccc 181 tgagagcgac gcgagccaat gggaaggcct tggggtgaca tcatgggcta tttttagggg 241 ttgactggta gcagataagt gttgagctcg ggctggataa gggctcagag ttgcactgag 301 tgtggctgaa gcagcgaggc gggagtggag gtgcgcggag tcaggcagac agacagacac 361 agccagccag ccaggtcggc agtatagtcc gaactgcaaa tcttattttc ttttcacctt 421 ctctctaact gcccagagct agcgcctgtg gctcccgggc tggtggttcg ggagtgtcca 481 gagagccttg tctccagccg gccccgggag gagagccctg ctgcccaggc gctgttgaca 541 gcggcggaaa gcagcggtac cccacgcgcc cgccggggga cgtcggcgag cggctgcagc 601 agcaaagaac tttcccggcg gggaggaccg gagacaagtg gcagagtccc ggagcgaact 661 tttgcaagcc tttcctgcgt cttaggcttc tccacggcgg taaagaccag aaggcggcgg 721 agagccacgc aagagaagaa ggacgtgcgc tcagcttcgc tcgcaccggt tgttgaactt 781 gggcgagcgc gagccgcggc tgccgggcgc cccctccccc tagcagcgga ggaggggaca 841 agtcgtcgga gtccgggcgg ccaagacccg ccgccggccg gccactgcag ggtccgcact 901 gatccgctcc gcggggagag ccgctgctct gggaagtgag ttcgcctgcg gactccgagg 961 aaccgctgcg cccgaagagc gctcagtgag tgaccgcgac ttttcaaagc cgggtagcgc 1021 gcgcgagtcg acaagtaaga gtgcgggagg catcttaatt aaccctgcgc tccctggagc 1081 gagctggtga ggagggcgca gcggggacga cagccagcgg gtgcgtgcgc tcttagagaa 1141 actttccctg tcaaaggctc cggggggcgc gggtgtcccc cgcttgccag agccctgttg 1201 cggccccgaa acttgtgcgc gcacgccaaa ctaacctcac gtgaagtgac ggactgttct 1261 atgactgcaa agatggaaac gaccttctat gacgatgccc tcaacgcctc gttcctcccg 1321 tccgagagcg gaccttatgg ctacagtaac cccaagatcc tgaaacagag catgaccctg 1381 aacctggccg acccagtggg gagcctgaag ccgcacctcc gcgccaagaa ctcggacctc 1441 ctcacctcgc ccgacgtggg gctgctcaag ctggcgtcgc ccgagctgga gcgcctgata 1501 atccagtcca gcaacgggca catcaccacc acgccgaccc ccacccagtt cctgtgcccc 1561 aagaacgtga cagatgagca ggaggggttc gccgagggct tcgtgcgcgc cctggccgaa 1621 ctgcacagcc agaacacgct gcccagcgtc acgtcggcgg cgcagccggt caacggggca 1681 ggcatggtgg ctcccgcggt agcctcggtg gcagggggca gcggcagcgg cggcttcagc 1741 gccagcctgc acagcgagcc gccggtctac gcaaacctca gcaacttcaa cccaggcgcg 1801 ctgagcagcg gcggcggggc gccctcctac ggcgcggccg gcctggcctt tcccgcgcaa 1861 ccccagcagc agcagcagcc gccgcaccac ctgccccagc agatgcccgt gcagcacccg 1921 cggctgcagg ccctgaagga ggagcctcag acagtgcccg agatgcccgg cgagacaccg 1981 cccctgtccc ccatcgacat ggagtcccag gagcggatca aggcggagag gaagcgcatg 2041 aggaaccgca tcgctgcctc caagtgccga aaaaggaagc tggagagaat cgcccggctg 2101 gaggaaaaag tgaaaacctt gaaagctcag aactcggagc tggcgtccac ggccaacatg 2161 ctcagggaac aggtggcaca gcttaaacag aaagtcatga accacgttaa cagtgggtgc 2221 caactcatgc taacgcagca gttgcaaaca ttttgaagag agaccgtcgg gggctgaggg 2281 gcaacgaaga aaaaaaataa cacagagaga cagacttgag aacttgacaa gttgcgacgg 2341 agagaaaaaa gaagtgtccg agaactaaag ccaagggtat ccaagttgga ctgggttcgg 2401 tctgacggcg cccccagtgt gcacgagtgg gaaggacttg gtcgcgccct cccttggcgt 2461 ggagccaggg agcggccgcc tgcgggctgc cccgctttgc ggacgggctg tccccgcgcg 2521 aacggaacgt tggactttcg ttaacattga ccaagaactg catggaccta acattcgatc 2581 tcattcagta ttaaaggggg gagggggagg gggttacaaa ctgcaataga gactgtagat 2641 tgcttctgta gtactcctta agaacacaaa gcggggggag ggttggggag gggcggcagg 2701 agggaggttt gtgagagcga ggctgagcct acagatgaac tctttctggc ctgctttcgt 2761 taactgtgta tgtacatata tatatttttt aatttgatta aagctgatta ctgtcaataa 2821 acagcttcat gcctttgtaa gttatttctt gtttgtttgt ttgggtatcc tgcccagtgt 2881 tgtttgtaaa taagagattt ggagcactct gagtttacca tttgtaataa agtatataat 2941 ttttttatgt tttgtttctg aaaattccag aaaggatatt taagaaaata caataaacta 3001 ttggaaagta ctcccctaac ctcttttctg catcatctgt agatcctagt ctatctaggt 3061 ggagttgaaa gagttaagaa tgctcgataa aatcactctc agtgcttctt actattaagc 3121 agtaaaaact gttctctatt agacttagaa ataaatgtac ctgatgtacc tgatgctatg 3181 tcaggcttca tactccacgc tcccccagcg tatctatatg gaattgctta ccaaaggcta 3241 gtgcgatgtt tcaggaggct ggaggaaggg gggttgcagt ggagagggac agcccactga 3301 gaagtcaaac atttcaaagt ttggattgca tcaagtggca tgtgctgtga ccatttataa 3361 tgttagaaat tttacaatag gtgcttattc tcaaagcagg aattggtggc agattttaca 3421 aaagatgtat ccttccaatt tggaatcttc tctttgacaa ttcctagata aaaagatggc 3481 ctttgtctta tgaatattta taacagcatt ctgtcacaat aaatgtattc aaataccaat 3541 aacagatctt gaattgcttc cctttactac ttttttgttc ccaagttata tactgaagtt 3601 tttattttta gttgctgagg tt // LOCUS HUMKBF2 5360 bp DNA PRI 05-MAR-1996 DEFINITION Homo sapiens H2K binding factor 2 (KBF2) mRNA, complete cds. ACCESSION L08904 L09117 L09761 NID g1220319 KEYWORDS H-2K binding factor-2. SOURCE Homo sapiens (tissue library: lambda gt11) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5360) AUTHORS Tang,X., Gachelin,G., Yokoyama,K. and Israel,A. TITLE Nucleotide sequence and the chromosomal mapping of KBF-2 cDNA JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..5360 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_lib="lambda gt11" gene 239..5341 /gene="KBF2" CDS 239..1501 /gene="KBF2" /codon_start=1 /product="H-2K binding factor-2" /db_xref="PID:g1220320" /translation="MGSGWKKIKLQMKCDGCSEQGSHPCAFIGIGNSDQEMQQLNLEG KNYCTAKTLYISDSDKQKHFMLSVKVFYGNGDDIGVFLSKSSKPSKKKQSLKNADLCI GSGTKVALFNRLRSQTVSTRYLHVEGGNFHASSQQWGAFTLFLDDDGSEGEEFTVRDG YIHYGQTVKLVCSVTGMALPRLIIRKVDKQTTLLDADDPVSQLHKCAFDLEDTERMYL CLSQERIIQFQATPCPTEPNKEMINDGASWAIISTHKAKYTFYERMGPVLALVMPMPV VESLKLNGGGDEAMLELTGQNFTPNLRVWFGDVEAETMYRCGESMLRVVPDVLHSEKV GDSSQQPVQVSVTLVRNDGIIYSTSLTFTYTPEAGPRPHCSVAGAILKASSSHVPPNE LNTNSDGSYTNASTNSTSVTSSTPTVVS" polyA_signal 5336..5341 /gene="KBF2" BASE COUNT 1552 a 978 c 1008 g 1822 t ORIGIN 1 ttgaggtgca ttgaaatgtt ccaagctgtt acttacctta acatgttctt gaggtaccat 61 ggcatggatt aaaaggaaat ttggtaagtg gcctccactt aaacgactta ctagggaagc 121 tatgtgaaat tatttaaaag ggcgagggga tcaaatagta cttatccttc atgcaaaagt 181 tgtacagaag tcatatggaa tgaaaaaggt tttttgccct cccccttgtg tatatcttat 241 gggcagtgga tggaagaaaa taaaattaca aatgaaatgc gatggttgtt ctgaacaagg 301 ctctcatcca tgtgcattta ttgggatagg aaatagtgac caagaaatgc agcagctaaa 361 cttggaagga aagaactatt gcacagccaa aacattgtac atatctgatt cagacaagca 421 aaagcacttc atgttgtctg taaaggtgtt ctatggcaac ggtgatgaca ttggtgtgtt 481 cctcagcaag tcgtccaaac cttccaaaaa gaagcagtca ttgaaaaatg ctgacttatg 541 cattggctca ggaacaaagg tggctctgtt taatcgacta cgatcccaga cagttagtac 601 cagatacttg catgtagaag gagggaattt tcatgccagt tcacagcagt ggggagcatt 661 tacattattc ttggatgatg atggatcaga aggagaagaa ttcacagtca gagatggcta 721 cattcattat ggacaaacag tcaagcttgt gtgctcagtt actggcatgg cactcccaag 781 attgataatt aggaaagttg ataagcagac cacattattg gatgcagatg atcctgtgtc 841 acaactccat aaatgtgcat ttgaccttga ggatacagaa agaatgtact tatgcctttc 901 tcaagaaaga ataattcaat ttcaggccac tccatgccca acagaaccaa ataaagagat 961 gataaatgat ggtgcttcct gggcaatcat tagcacacat aaggcgaagt atacatttta 1021 tgagagaatg ggccctgtcc ttgccctggt catgcctatg cctgtcgtag agagccttaa 1081 gttgaatggc ggtggggacg aagcaatgct tgaacttaca ggacagaatt tcactccaaa 1141 tttacgagtg tggtttgggg atgtagaagc tgaaactatg tacaggtgtg gagagagtat 1201 gctccgtgtt gtcccagacg ttctgcattc tgagaaggtt ggagatagtt cccagcaacc 1261 agtccaggtt tcagtaactt tggtccgaaa tgatggaatc atatattcca ccagccttac 1321 ctttacctac acaccagaag cagggccgcg gccacattgc agtgtagcag gcgcaatcct 1381 taaggccagt tcaagccacg tgccccctaa tgaattaaac acaaacagcg atggaagtta 1441 cacaaatgcc agcacaaatt caaccagtgt cacatcatct acaccaacag tggtatcctg 1501 aactaccgtc tttttgctaa gactcaaacg gcttgagtgc agcaaaaagt tgacaaaaaa 1561 ggaaaaaaaa atgaacagtc ttttgtggtt tattgggaaa cttttcatac caggtgatac 1621 tattctaaaa ccccgttgtc tccctgcaag tgctgatttg aaatgcagaa gccacagtaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaagaaa aaaaaatcaa aatgtataaa tattggaaat 1741 caagtttttc agctgttttg ttggttggtt ggttggtttt tgtttggttt tgtttaaagg 1801 gacaagaagt aaataatgtg gctggaatac aagttgaaca aactagaaga cacaaatcta 1861 acatagtttt tatggaccaa ggaacttgta tattgtataa gctttagtaa aaggtacatt 1921 ttcaccatac ctttttttat atcacggtat tatagtacac cttgttacca ataggttgtt 1981 ctcttcccca ccctcctttg agctttgctc taaaatacat tctggttcca agcctgacca 2041 tccttgttta atctatcata ctcttccagg tttttttttt tggtctaagg ctggaacttt 2101 tttctttttt tttcagctga agtcttatga ctttttcatg agtcaaaatt gtttggattt 2161 cacgaagtca aatcttgcaa aggcctgcat atttttttta agattatatg aagtctgtgc 2221 aaaaagcttt aaaaaattgc ctctgccttg cctgcataca tgcaatgtat gtaacttagt 2281 ctctcttctc agacactgtt gggtagttat ttctgtgttt tcttttttta aaaaaaaata 2341 tggacttatt gtggtttatc tgagaggttc taacattcac atgcaatttg gtgtggcatt 2401 tagctattat gagttattgg cgcgaacttg tttgatattt gaagtgtctc tccccttttc 2461 ccatgacgta atacataggt gtgttccagg atttgttcag gtttttcccc ccctcctaat 2521 cttgtacata acttgtattt tgtgtaagtt aaacatttta tttgaacttg gaatgttccc 2581 agtgatttca ttcagcaggg tattttctgc cttgttggca agtagcaaaa aatatgggaa 2641 gtatttgcta ccagttgtta gatggtgccc cttattggta gaatcaggaa aatgtccgca 2701 aaagcatgtt ttattatctt tacttttttg gggggttgga gggggtagcc tagccagaca 2761 tcatgtaatc ttaaaacata agatgctttt attagatgat caactaaaaa tagctggaag 2821 acagtacttt agaaacaaaa tagttagtaa gatatataat gcaaatgtaa cttatgtttt 2881 catttttttc tctgcctttt ttttttgttt tttttctttt tttccagtac tgagcatctc 2941 cacaaatgtc tcctactcag aaaatgtttc ttttctttca gttgagattt ggtgcattca 3001 gggttgtagg ttggccttgc ttgctaaccc gccggtttta ccgtgcttta ttcctgaact 3061 ttgtttatgc ctttgtttgg ttcttctgaa attgcagcag actcattggg ctacatttag 3121 tacaggaacc acgtgtgtaa tgttatacaa cacagtcagt aatacaatca tccctcttag 3181 agtaaaaact acctctagat tgtgtaagct ttttactgtc cataaaacag gagccacagt 3241 accttatgaa tgcaaaactg taacttccta cagtgtttcc ccacagaaca ttgtctttct 3301 ggtgtctggg ctgtttttga aaaagtttcc attaatagac tttttagaaa ttattattag 3361 tagcattttt tttccagctt tgcgtcttca tcactcactc aagtgtcaga ctatgcactg 3421 taaatatctt cctaacatct ttaaatcgcc ttttcctcag ttttcaaggg gaaggtcatt 3481 tgtaaagcac gttaggtggt taaatcagtt attgcggttt tctcttacag caagcctttt 3541 taatcacccc caggctgcat tttattctat atcgcctttt ttcttcaaat ctgctccaat 3601 catccacttc tctcttataa gctattcctg cctcacacct aaatctgttt cagtgatcaa 3661 gggcagaact cattgtggcc ttatctttct ttgttgtaat tgttcactgt ctctttctta 3721 cagaccactt attctgagta gtagttattc ctccctatgg agtcatggca ggaatcatta 3781 cacagtgctt ttgttcagag catggacatg ttccaggtgc tgctttgctt taacggccac 3841 aagtttcctc cacttctcag gtttggtatt tagtaaggaa tcaattaaat taaccaataa 3901 caaaagagat acttttgaag aacaaactat tctttaccca ttttgtagct caaaaataat 3961 ttttcaagtt catgacctta ttaaaatgaa cttgtgtttt tttaacaaac gtctatttta 4021 ttttgatagt ttctttccga agataattga aatattatac tgtaaccctt ttcttttctt 4081 ttttgaaaag tccaagaatg tacttataca ggatttttcc ccacctattt ttggccattc 4141 tcataccaca gacaaaagag tgaatgattg tcattgtagc ttattgttta tcagtagttc 4201 ttttgtagct gcttacattt tttctttcat ggtttgtgaa tcatttcagt atgtaattta 4261 taggaacctt gtcctctggt atagtagact gtgtgccctc ctccaggatg gcattattag 4321 acatgctggt catttaccct cagaaagact ctcttataga atggtgagtg cttcagttat 4381 agtatgtttg aattttaaaa aattcctgtt tagaatgtat ctatgctctc atgactatgc 4441 agtttctaac atacacatag aagctgagtc tctgatccaa tatgttttta tttgttccat 4501 taatttatca catagattgg gaaggcaagc taaaagcctt aaaaatgccc tttatatttt 4561 gagtgatttc agcgttgaac acagtatact atctaaattt gctgctcact ttcttaaact 4621 gttgcaatta aaggcatgtt tatacatgac taatcgtgaa atgtttgtca ctcttactgc 4681 acagacttat ctgcaatcaa actggttagt ttttttgttt tgttttgttt tattgttttt 4741 aatgaatctg gtaccatctg tgctttcaca aaaaacttcc aatgccattt ttgagaacta 4801 acctaacagt catgctaacc agaaaatcca ctggggagga ggttcctttg aaacaaaatg 4861 ctgttcagtt agtaaccaag ttactttgat tgcaaaagca gctgtgtttc tgataagtac 4921 tgaacaaatg tgtgtaattt tctgtgccag acttatgact ttgttttcaa gcactgtaat 4981 gtgggatgga tggttagaaa caataatata tagggtttct gttaaccctt tcaggactca 5041 actgtatctc cttttgttaa ttttcccctg tgttgtgata aattgtttgc cagcattcag 5101 tactgtgttg gtgcagatga ggtttatatc tcattttagc ttatttcttg tacctttcag 5161 catgcctacg cattcagtcc ttaaggggtt tattttacaa actgtgcgcc tgaagtttat 5221 tagcaataag atagaaaatg agcaagttta taccataatt ttgagaaaaa aagaatctgc 5281 tcagttccat atttcatccg tgaaaaactt gcaatacgag cagtttcaag gaataaataa 5341 aaaggaaatg aaaccattgt // LOCUS HUMLACFE 4896 bp DNA PRI 09-JAN-1995 DEFINITION Human neutrophil lactoferrin mRNA, complete cds and 5' promoter region. ACCESSION M73700 NID g619784 KEYWORDS lactoferrin. SOURCE Homo sapiens male adult bone marrow DNA; and Homo sapiens (tissue library: lambda gt10) adult bone marrow cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4896) AUTHORS Wei,X., Han,J. and Rado,T.A. TITLE Human neutrophil lactoferrin coding and 5' flanking region DNA sequences JOURNAL Unpublished (1991) COMMENT Data kindly submitted by T.A.Rado, 02-JUL-1991. FEATURES Location/Qualifiers source 1..4896 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="myelocyte" /dev_stage="adult" /sex="male" /tissue_type="bone marrow" repeat_unit 233..264 /note="8 tandem repeats of GAAT" CAAT_signal complement(2493..2497) TATA_signal 2520..2525 mRNA 2550..4896 /gene="neutrophil lactoferrin" gene 2550..4896 /gene="neutrophil lactoferrin" CDS 2589..4721 /gene="neutrophil lactoferrin" /codon_start=1 /product="neutrophil lactoferrin" /db_xref="PID:g186818" /translation="MKLVFLVLLFLGALGLCLAGRRRSVQWCAVSQPEATKCFQWQRN MRKVRGPPVSCIKRDSPIQCIQAIAENRADAVTLDGGFIYEAGLAPYKLRPVAAEVYG TERQPRTHYYAVAVVKKGGSFQLNELQGLKSCHTGLRRTAGWNVPIGTLRPFLNWTGP PEPIEAAVARFFSASCVPGADKGQFPNLCRLCAGTGENKCAFSSQEPYFSYSGAFKCL RDGAGDVAFIRESTVFEDLSDEAERDEYELLCPDNTRKPVDKFKDCHLARVPSHAVVA RSVNGKEDAIWNLLRQAQEKFGKDKSPKFQLFGSPSGQKDLLFKDSAIGFSRVPPRID SGLYLGSGYFTAIQNLRKSEEEVAARRARVVWCAVGEQELRKCNQWSGLSEGSVTCSS ASTTEDCIALVLKGEADAMSLDGGYVYTAGKCGLVPVLAENYKSQQSSDPDPNCVDRP VEGYLAVAVVRRSDTSLTWNSVKGKKSCHTAVDRTAGWNIPMGLLFNQTGSCKFDEYF SQSCAPGSDPRSNLCALCIGDEQGENKCVPNSNERYYGYTGAFRCLAENAGDVAFVKD VTVLQNTDGNNNEAWAKDLKLADFALLCLDGKRKPVTEARSCHLAMAPNHAVVSRMDK VERLKQVLLHQQAKFGRNGSDCPDKFCLFQSETKNLLFNDNTECLARLHGKTTYEKYL GPQYVAGITNLKKCSTSPLLEACEFLRK" BASE COUNT 1141 a 1230 c 1306 g 1219 t ORIGIN 1 ggtacccagg ctaatcttct ggagttcttc tggagccctc atgatgggtg ctggtcattc 61 ctcctctggg ccctcacact cacttgcttt atgctgcttg gcctcttctt agggagcctc 121 tgtgatagaa gtgcattcct gtgactgtcc ctcccccata gtgctgtgag ctcctgaggg 181 caaggacaga cccctcttct ctgtctctca tggcatctac cagttctggc atgaatgaat 241 gaatgaatga atgaatgaat gaatgctgtt aatagatcag tgaaacttca ttgttttttg 301 gggcagactc atcccctaca tcagcctcct ctgcaaatgg ccttgagggg ctgctctgcc 361 tgtgtccaga tgctcacatc cctgccctgg gcctgggctg ttccaacagg cacagcagga 421 gaacagtctg ccctgttgcc tacccatgct gccttcaggg cactccttaa gctgagcctc 481 ttggtagtgg ccccaggttc tctgtgttct tgccaacact gtaacatact taagagggcc 541 cagggcttga cctcccagtc atcctttttt aaaatttgga ctcagaaaaa gagacacggt 601 tatgatgttt cttaattctt ttataatgat gaaaaggcaa agtcttgttg ccaatttagg 661 atcaaagatg cttcagcact cctgggaatt agtcaatttt gtatttcttc agtatttttg 721 aaagaactta ttgcaattat tgatgatggc aactttaaat ggtgcaatat catgtttcca 781 aacaatgaga gaccttggat ctgtcacccc caaaacccag ctggtgattc tagcaccaaa 841 tcttcagacc ccagtcttca tggcaggcaa gcatgatccc tgattagctc accctgtgct 901 tggcaggatc atgccagaaa tgaggggctc ccagctctta tgcttcacct tgggctgagc 961 tgcttctcca tggctaggtg cccactgcat gctcactctt ggcagagctg gctccctggg 1021 gcattgccta tctgctccag gaaatgcttt ttagtaccaa gtagtctaag cagagtcaag 1081 accagctttt cagaataagc agatttcaga gtcacatatt ggtcagactc acctgccaag 1141 aaggtgttca ggtgacctat tatttccctc actgttagta cctttttttc ttcacttagt 1201 tacccttttt ttttttggtg ggggttacag agtcttactc tattgcccag gctgcagtgc 1261 agtgacatga tcatggctca ctgccacctt catctcccag gctcaaatgg tcctcccact 1321 ttagcctccc aagtagctgg gaccataggc atacaccacc atgctggcta atttttgtat 1381 tttttgtaga gatggggggt ttccctatgt tgcccaggct agtcttgaac tcctgggctc 1441 aagcgatcct cccatcttgg cctcccaaag tgctgggatt acaggcatga gccactgtgc 1501 cctgccctag ttactcttgg gctaagttca catccataca cacaggtctt tctgaggccc 1561 ccaatgtgtc ccacaggtgc catgctgtat gtgacactcc cctagagatg gatgtttagt 1621 ttgcttccaa ctgattaatg gcatgcagtg gtggctggaa acatttgtac ctggggtgct 1681 gtgtgtcatg ggaatgtatt tacgagatgt attcttagaa gctctagctt ttgaattttt 1741 aaatctgaga tttatggcga ttgttaaaat gaggttacca tttcctactg aatactatca 1801 acaccaaaaa agaagaagga ggagatggag aaaaaaaaga caaaaaaaaa agtggtaggg 1861 catcttagcc atagggcatc tttctcattg gcaaataaga acatggaacc agccttgggt 1921 ggtggcattc ccctctgagg tccctgtctg ttttctggga gctgtattgt gggtctcagc 1981 agggcaggga gataccccat gggcagcttg cctgagactc tgggcagcct gctcttttct 2041 ctgtcagctg tccctaggct gctgctgggg gtggtcgggt catcttttcc aactctcagc 2101 tcactgctga gacaaggtga aagctaaaac ccacctgccc taactggctc ctaggcacct 2161 tcaaggtcat ctgctgaaga agatagcagt ctcacaggtc aaggcgatct tcaagtaaag 2221 accctctgct ctgtgtcctg ccctctagaa ggcactgaga ccagagactg ggacagggct 2281 cagggggctt gcgagactcc taggggcttg cagactagtg ggagagaaag aacatcgcag 2341 cagccaggca gaaccaggac aggtgaggtg caggctggct ttcctctcgc agcgcggtgt 2401 ggagtcctgt cctgcctcag ggcttttcgg agcctggatc ctcaaggaac aagtagacct 2461 ggccgcgggg agtggggagg gaaggggtgt ctattgggca acagggcggc aaagccctga 2521 ataaaggggc gcagggcagg cgcaagtggc agagccttcg tttgccaagt cgcctccaga 2581 ccgcagacat gaaacttgtc ttcctcgtcc tgctgttcct cggggccctc ggactgtgtc 2641 tggctggccg taggaggagt gttcagtggt gcgccgtatc ccaacccgag gccacaaaat 2701 gcttccaatg gcaaaggaat atgagaaaag tgcgtggccc tcctgtcagc tgcataaaga 2761 gagactcccc catccagtgt atccaggcca ttgcggaaaa cagggccgat gctgtgaccc 2821 ttgatggtgg tttcatatac gaggcaggcc tggcccccta caaactgcga cctgtagcgg 2881 cggaagtcta cgggaccgaa agacagccac gaactcacta ttatgccgtg gctgtggtga 2941 agaagggcgg cagctttcag ctgaacgaac tgcaaggtct gaagtcctgc cacacaggcc 3001 ttcgcaggac cgctggatgg aatgtcccta taggaacact tcgtccattc ttgaattgga 3061 cgggtccacc tgagcccatt gaggcagctg tggccaggtt cttctcagcc agctgtgttc 3121 ccggtgcaga taaaggacag ttccccaacc tgtgtcgcct gtgtgcgggg acaggggaaa 3181 acaaatgtgc cttctcctcc caggaaccgt acttcagcta ctctggtgcc ttcaagtgtc 3241 tgagagacgg ggctggagac gtggctttta tcagagagag cacagtgttt gaggacctgt 3301 cagacgaggc tgaaagggac gagtatgagt tactctgccc agacaacact cggaagccag 3361 tggacaagtt caaagactgc catctggccc gggtcccttc tcatgccgtt gtggcacgaa 3421 gtgtgaatgg caaggaggat gccatctgga atcttctccg ccaggcacag gaaaagtttg 3481 gaaaggacaa gtcaccgaaa ttccagctct ttggctcccc tagtgggcag aaagatctgc 3541 tgttcaagga ctctgccatt gggttttcga gggtgccccc gaggatagat tctgggctgt 3601 accttggctc cggctacttc actgccatcc agaacttgag gaaaagtgag gaggaagtgg 3661 ctgcccggcg tgcgcgggtc gtgtggtgtg cggtgggcga gcaggagctg cgcaagtgta 3721 accagtggag tggcttgagc gaaggcagcg tgacctgctc ctcggcctcc accacagagg 3781 actgcatcgc cctggtgctg aaaggagaag ctgatgccat gagtttggat ggaggatatg 3841 tgtacactgc aggcaaatgt ggtttggtgc ctgtcctggc agagaactac aaatcccaac 3901 aaagcagtga ccctgatcct aactgtgtgg atagacctgt ggaaggatat cttgctgtgg 3961 cggtggttag gagatcagac actagcctta cctggaactc tgtgaaaggc aagaagtcct 4021 gccacaccgc cgtggacagg actgcaggct ggaatatccc catgggcctg ctcttcaacc 4081 agacgggctc ctgcaaattt gatgaatatt tcagtcaaag ctgtgcccct gggtctgacc 4141 cgagatctaa tctctgtgct ctgtgtattg gcgacgagca gggtgagaat aagtgcgtgc 4201 ccaacagcaa tgagagatac tacggctaca ctggggcttt ccggtgcctg gctgagaatg 4261 ctggagacgt tgcatttgtg aaagatgtca ctgtcttgca gaacactgat ggaaataaca 4321 atgaggcatg ggctaaggat ttgaagctgg cagactttgc gctgctgtgc ctcgatggca 4381 aacggaagcc tgtgactgag gctagaagct gccatcttgc catggccccg aatcatgccg 4441 tggtgtctcg gatggataag gtggaacgcc tgaaacaggt gctgctccac caacaggcta 4501 aatttgggag aaatggatct gactgcccgg acaagttttg cttattccag tctgaaacca 4561 aaaaccttct gttcaatgac aacactgagt gtctggccag actccatggc aaaacaacat 4621 atgaaaaata tttgggacca cagtatgtcg caggcattac taatctgaaa aagtgctcaa 4681 cctcccccct cctggaagcc tgtgaattcc tcaggaagta aaaccgaaga agatggccca 4741 gctccccaag aaagcctcag ccattcactg cccccagctc ttctccccag gtgtgttggg 4801 gccttggctc ccctgctgaa ggtggggatt gcccatccat ctgcttacaa ttccctgctg 4861 tcgtcttagc aagaagtaaa atgagaaatt ttgttg // LOCUS HUMLORI 3321 bp DNA PRI 06-OCT-1992 DEFINITION Human loricrin gene exons 1 and 2, complete cds. ACCESSION M94077 NID g187186 KEYWORDS loricrin. SOURCE Homo sapiens (library: EMBL3 from Dr. Gonzales, NCI, NIH) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3321) AUTHORS Yoneda,K., Hohl,D., McBride,O.W., Wang,M.G., Cehrs,K.U, Idler,W.W. and Steinert,P.M. TITLE The human loricrin gene JOURNAL J. Biol. Chem. 267, 18060-18066 (1992) MEDLINE 92388173 FEATURES Location/Qualifiers source 1..3321 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="EMBL3 from Dr. Gonzales, NCI, NIH" CAAT_signal 235..237 /note="first putative CAAT site" CAAT_signal 248..250 /note="second putative CAAT site" TATA_signal 339..346 exon 378..416 /number=1 intron 417..1604 exon 1605..2814 /number=2 CDS 1628..2578 /codon_start=1 /product="loricrin" /db_xref="PID:g187187" /translation="MSYQKKQPTPQPPVDCVKTSGGGGGGGGTGGGGCGFFGGGGSGG GSSGSGCGYSGGGGYSGGGCGGGSSGGGGGGGIGGCGGGSGGSVKYSGGGGSSGGGSG CFSSGGGGSGCFSSGGGGSSGGGSGCFSSGGGGSSGGGSGCFSSGGGGFSGQAVQCQS YGGVSSGGSSGGGSGCFSSGGGGGSVCGYSGGGSGGGSGCGGGSSGGSGSGYVSSQQV TQTSCAPQPSYGGGSSGGGGSGGSGCFSSGGGGGSSGCGGGSSGIGSGCIISGGGSVC GGGSSGGGGGGSSVGGSGSGKGVPICHQTQQKQAPTWPSK" polyA_signal 2791..2796 BASE COUNT 687 a 895 c 1008 g 731 t ORIGIN 1 agatcttcag tttgactctc ttagggcacc tcaagactct gggacctatt cctcaagcac 61 agccctgtgg tcacctggta tgcgtcctgt gccagaggtt ttggaaacaa tgtctgccat 121 ccactctgac tgggtgaccc cactgatgag cctgccacac tgttgcatca gagaaggggc 181 cagtcacaca ccaggctgcc catctcaaga atgccaaaac cttcatgaat gggccatcct 241 gtgcctgcat cacagggagg tggggccgac agccacgggt cacgtaactg aggccaaaca 301 caagaagctg gcctggatca atgagtcagg gagagctcta tatataacct caggagatca 361 gtcgtcctca cattgccagc atcttctctc ctcactcacc cttcctggtg ctttgggtaa 421 gtgtggttct actgactctc tcattttccc agctggtctt gcccaggcct gactagatta 481 gatggaccag ggcctctttc ccctttggga gctataggac ctctgccttc ccaaaagcac 541 tcacatttag agggcggtca ggaaaggagc aggggatgag ctgctgccat gcagatggtg 601 tttctaggtc ttctggccag aatgtaaact ccacaaagac aagactatct cctgcctctc 661 tggcacccgc atagggcagg catggtgccg ggcacagaag gactctgcag aggctgtcca 721 aggcagcctg tgcacaggct gagcagacct tgtgaacctg tcaggaggag aggctgagcc 781 actctcaaga gagtagggag aatgatccaa aaaagttgca gatgggaggg atttcaggtg 841 ccacagaaat ccaattgctg ttttacagag ttagaagttc tgagagggaa atacaacttc 901 ctactagtca aagtcccttc taccaaactt ggacttgaga aaatcatgga agagatggct 961 aatagcttct ggtcggggat gttggtccaa aggaccacca gggtccttcc cctgtttcca 1021 ggcagggcca cagcagagcc tgtcttttct agtgactagc ccttggttca tggcctagct 1081 cagtgggaca gaccaagagt ttgaactaag agcttctgca gaggatggag tgcaacagcc 1141 ctcaatagaa tgaagtccga caaacctctc tctgttgtgc tgggaatggg taaaatctcc 1201 tatcctaggt agaggtttgt ggtagttttc tatttgcact cagagaatca gtgttgagat 1261 tggaagaggc ctgaaagatg aagtgttcaa accctttcat ttaacaggaa atgagaaaga 1321 ggcgggcgtg tgggtgatct gccctcaaat cacacaacag tggtcagagg cagagctggg 1381 agaaagaatg ggcactgcta actgggctgt ttaaagcata atgaaggctt tctgcagggg 1441 aatgaggaac tcaacttttc caaaagcagc aatcataaga aacccgctga ggctctggca 1501 cctgaaggag ccctgggagg atggcgatgt tgcctatgga tgcagctgcc tccgtaagta 1561 tcctcttgca gctggtccag cgatgctctc ttctcccctt ccaggctctc cttccttctc 1621 agacaagatg tcttatcaga aaaagcagcc cacccctcag cccccagtgg actgcgtgaa 1681 gacctctggc ggcggtggcg gtggcggcgg cacgggcggt ggtggctgcg gcttcttcgg 1741 cggcggcggc tcagggggcg gtagcagcgg ttctggctgc ggctactccg gcggcggtgg 1801 ctactctggc ggcggctgcg gcgggggctc ctccggcggc gggggcgggg gcggcattgg 1861 aggctgcgga gggggctccg gtgggagcgt caagtactcc ggaggcggcg gctcctccgg 1921 cgggggctct ggctgtttct ccagcggtgg gggcggctcc ggctgcttct cctccggtgg 1981 cggcggctcc tccgggggcg gctccggctg cttctccagc ggtgggggcg gctcctccgg 2041 gggcggctcc ggctgcttct cctccggcgg cggcggcttc tcgggccagg cggtccagtg 2101 ccagagctac ggaggcgtct ctagcggcgg ctcctccggg ggcggctccg gctgcttctc 2161 cagcggcggg ggcggcggct ctgtctgcgg ctactctggc ggcggctctg gcggcggctc 2221 tggctgcggc ggaggctcct ctggcggcag cggctccggc tacgtctcct cgcagcaggt 2281 cactcagacc tcgtgcgcgc cccagccgag ttacggaggg gggtcgtccg gcggcggcgg 2341 cagcggcgga agcggctgct tctccagcgg cgggggcggc gggagctccg gctgcggcgg 2401 cggctcctcc gggattggca gcggctgcat catcagtggc gggggctccg tctgcggagg 2461 tggttcctct ggaggcggcg gcggcggctc ctccgtgggt ggctccggga gtggcaaggg 2521 cgtcccgatc tgccaccaga cccagcagaa gcaggcgcct acctggccgt ccaaatagat 2581 cccccagggt accacggagg cgaaggagtt ggaggtgttt tccaggggca ccgatgggct 2641 tagagctctc atgatgctac ccgaggtttg caaatccttc atgtcttaac ctacctggaa 2701 gaagccattg agctctccgg ctgcatctag ttctgctgtt tagcctcttt ggtttctgta 2761 caactacctc ccaaccccag tgcctcagtc aataaatttg caaattcatg agaatcttta 2821 ggtctccaag agtatttgta gtgttcaagt tcatccattc cattccattc ttctcctttc 2881 ctgcattcgc tcattcacac ttacattcat tttacaaatg cacgcactct attggcggaa 2941 aggtgagtgg atagatggga gtcaccagga gagggaggaa tggctggcaa acgacggtca 3001 aacatatgaa cattctagtg tacggcttaa aatcatgacg tgatcccaaa acacaaagat 3061 cctgaggaga gacttcagta aacacctccc tgtggattga gtagctactg tgggtctggt 3121 tactgaactc tgttttttaa agacaggctg ccctggaaag acgctgtctc tcacagtcca 3181 aagccaattc tgcaaggtct gaaagctggg tggctgcaca ttaggagcac cagctcatgc 3241 tactcagact cacggagaaa taaaaagcat atcagatgtt tacgggcccc tagggagagc 3301 aagaccactt tagtaggtac c // LOCUS HUMMC5R 1262 bp DNA PRI 07-JAN-1995 DEFINITION Human melanocortin 5 receptor (MC5R) gene, complete cds. ACCESSION L27080 NID g435599 KEYWORDS melanocortin 5 receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1262) AUTHORS Griffon,N., Mignon,V., Facchinetti,P., Diaz,J., Schwartz,J.C. and Sokoloff,P. TITLE Molecular cloning and characterization of the rat fifth melanocortin receptor JOURNAL Biochem. Biophys. Res. Commun. 200 (2), 1007-1014 (1994) MEDLINE 94234987 FEATURES Location/Qualifiers source 1..1262 /organism="Homo sapiens" /db_xref="taxon:9606" gene 184..1161 /gene="MC5R" CDS 184..1161 /gene="MC5R" /codon_start=1 /product="melanocortin 5 receptor" /db_xref="PID:g435600" /translation="MNSSFHLHFLDLNLNATEGNLSGPNVKNKSSPCEDMGIAVEVFL TLGVISLLENILVIGAIVKNKNLHSPMYFFVCSLAVADMLVSMSSAWETITIYLLNNK HLVIADAFVRHIDNVFDSMICISVVASMCSLLAIAVDRYVTIFYALAYHHIMTARRSG AIIAGIWAFCTGCGIVFILYSESTYVILCLISMFFAMLFLLVSLYIHMFLLARTHVKR IAALPGASSARQRTSMQGAVTVTMLLGVFTVCWAPFFLHLTLMLSCPQNLYCSRFMSH FNMYLILIMCNSVMDPLIYAYRSQEMRKTFKEIICCRGFRIACSFPRRD" BASE COUNT 256 a 353 c 299 g 354 t ORIGIN 1 agactgagtg agcgagccag tcctctgatg cactgtgtat tcatcccctt tcttaggcgg 61 ctgtgttggt tctaggctag ctgctgtctt tctttggtag gctgctaacc tctttggatt 121 gtgaatttaa aacatgtttt acagtaaatt tgctgccaag acaagaggtg tatttctcca 181 gcaatgaatt cctcatttca cctgcatttc ttggatctca acctgaatgc cacagagggc 241 aacctttcag gacccaatgt caaaaacaag tcttcaccat gtgaagacat gggcattgct 301 gtggaggtgt ttctcactct gggtgtcatc agcctcttgg agaacatctt ggtcataggg 361 gccatagtga agaacaaaaa cctgcactcc cccatgtact tcttcgtgtg cagcctggca 421 gtggcggaca tgctggtgag catgtccagt gcctgggaga ccatcaccat ctacctactc 481 aacaacaagc acctagtgat agcagacgcc tttgtgcgcc acattgacaa tgtgtttgac 541 tccatgatct gcatttccgt ggtggcatcc atgtgcagct tactggccat tgcagtggat 601 aggtacgtca ccatcttcta cgccctggcc taccaccaca tcatgacggc gaggcgctca 661 ggggccatca tcgccggcat ctgggctttc tgcacgggct gcggcattgt cttcatcctg 721 tactcagaat ccacctacgt catcctgtgc ctcatctcca tgttcttcgc tatgctgttc 781 ctcctggtgt ctctgtacat acacatgttc ctcctggcgc ggactcacgt caagcggatc 841 gcggctctgc ccggggccag ctctgcgcgg cagaggacca gcatgcaggg cgcggtcacc 901 gtcaccatgc tgctgggcgt gtttaccgtg tgctgggccc cgttcttcct tcatctcact 961 ttaatgcttt cttgccctca gaacctctac tgctctcgct tcatgtctca cttcaatatg 1021 tacctcatac tcatcatgtg taattccgtg atggaccctc tcatatatgc ctaccgcagc 1081 caagagatgc ggaagacctt taaggagatt atttgctgcc gtggtttcag gatcgcctgc 1141 agctttccca gaagggatta agcacaaagt gctcctctct gtggctctgt tctcctttgt 1201 ttgctcacct atgacaaagc gacagcaagc gggtaggcta gggagtgcta gcatccattt 1261 tt // LOCUS HUMMFAP 1330 bp DNA PRI 27-APR-1995 DEFINITION Homo sapiens extracellular matrix protein (MFAP3) gene, complete cds. ACCESSION L35251 NID g786118 KEYWORDS elastic microfibrillar component; extracellular matrix protein. SOURCE Homo sapiens (tissue library: genomic) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1330) AUTHORS Abrams,W.R., Ma,R.I., Kucich,U., Bashir,M.M., Decker,S., Tsipouras,P., McPherson,J.D., Wasmuth,J.J. and Rosenbloom,J. TITLE Molecular cloning of the microfibrillar protein MFAP3 and assignment of the gene to human chromosome 5q32-q33.2 JOURNAL Genomics 26 (1), 47-54 (1995) MEDLINE 95301292 FEATURES Location/Qualifiers source 1..1330 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lung and dermal fibroblasts" /tissue_lib="genomic" /map="5q31.2-q33.3" mRNA 1..>1330 /gene="MFAP3" /note="G00-371-694" exon 1..467 /gene="MFAP3" /note="G00-371-694" /number=1 gene 1..1330 /gene="MFAP3" CDS 173..1261 /gene="MFAP3" /standard_name="microfibrillar-associated protein 3" /codon_start=1 /db_xref="GDB:G00-371-694" /product="extracellular matrix protein" /db_xref="PID:g786119" /translation="MKLHCCLFTLVASIIVPAAFVLEDVDFDQMVSLEANRSSYNASF PSSFELSASSHSDDDVIIAKEGTSVSIECLLTASHYEDVHWHNSKGQQLDGRSRGGKW LVSDNFLNITNVAFDDRGLYTCFVTSPIRASYSVTLRVIFTSGDMSVYYMIVCLIAFT ITLILNVTRLCMMSSHLRKTEKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLELA KVTQFKTMEFARYIEELARSVPLPPLILNCRAFVEEMFEAVRVDDPDDLGERIKERPA LNAQGGIYVINPEMGRSNSPGGDSDDGSLNEQGQEIAVQVSVHLQSETKSIDTESQGS SHFSPPDDIGSAESNCNYKDGAYENCQL" exon 468..>1330 /gene="MFAP3" /note="G00-371-694" /number=2 BASE COUNT 373 a 295 c 283 g 379 t ORIGIN 1 cgtcgggttc tctactcaca tcttttaatc ttgaagacta gaaaatataa ctggatctgc 61 cacttgtttg gaaaatatct ctaccaagca ataaattacc cgctgtgctt ttgttgtagt 121 gtagaagttt ttgagttctc caaatctaaa caagattttg tcccattttc ccatgaagct 181 acattgttgc ttattcactt tagtggcaag tattattgtg ccagctgctt ttgttttgga 241 agatgtggac ttcgaccaaa tggtttcact ggaagcaaat cgtagttctt acaatgcatc 301 ctttccctca agctttgaac tctcagcaag ttcccactcg gatgatgacg tcatcatagc 361 caaagaggga actagcgttt caattgagtg tcttctcaca gccagtcact atgaagatgt 421 ccattggcac aattcaaaag gacagcaact ggatggcaga agcagaggtg gaaagtggtt 481 ggtttctgat aacttcctaa acatcaccaa tgtagctttt gatgaccgtg ggctctatac 541 ctgtttcgtc acctctccaa ttcgtgcctc ctactctgtc accctacgtg ttatcttcac 601 ctcgggagac atgagtgtct attacatgat tgtttgcctg attgccttta caatcacact 661 catcttgaat gtcacacggc tgtgcatgat gagcagccat cttcgcaaga ctgagaaggc 721 catcaatgag ttctttagaa ctgaaggggc tgagaaactt cagaaggcct ttgagattgc 781 aaaacgtatc cccatcatta cctcagccaa aactctggag ctcgccaaag tcacacaatt 841 taagaccatg gagtttgctc gttatattga agaactggca agaagtgtcc ctcttccacc 901 tcttattcta aactgtcgag cctttgttga ggagatgttt gaggctgtgc gagtggatga 961 ccctgatgac ctgggtgaaa gaattaaaga gagacctgcc ttgaatgctc aaggtggcat 1021 ctatgtcatt aacccagaga tgggacggag taattcacca ggaggagatt cagatgatgg 1081 ctctctgaat gaacaaggcc aggaaatagc agttcaggtt tctgtccacc ttcagtcaga 1141 aaccaaaagt attgatacag agtctcaagg cagcagtcat ttcagtccac ctgatgatat 1201 aggatctgca gaatctaact gtaactacaa agatggggca tatgaaaact gtcagctgta 1261 acctacaatg ctgtaaccca gtacctacaa aatcagctcg ctctcagaaa aggaacctgt 1321 ttcttagaag // LOCUS HUMMHDRB5A 1172 bp DNA PRI 07-JAN-1995 DEFINITION Human MHC class II HLA-DR-beta DR2 gene, complete cds. ACCESSION M35159 NID g188286 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Homo sapiens (strain African American) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1172) AUTHORS Demopulos,J.T., Hodge,T.W., Wooten,V. and Acton,R.T. TITLE A novel DRB1 allele in DR2-positive American blacks JOURNAL Unpublished (1990) COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.W.Hodge, 12-JUN-1990. A thymidine to cytosine transition at nucleotide position 88 results in an amino acid change from tyrosine to histidine at residue 30. Author address: T.W.Hodge; A-25 Bldg 1 Rm 1226 Centers for Disease Control 1600 Clifton Rd. Atlanta, GA 30333. FEATURES Location/Qualifiers source 1..1172 /organism="Homo sapiens" /strain="African American" /db_xref="taxon:9606" /haplotype="DR2" /map="6p21.3" sig_peptide 6..92 /gene="HLA-DRB1" CDS 6..806 /gene="HLA-DRB1" /note="MHC HLA-DR-beta chain precursor" /codon_start=1 /db_xref="GDB:G00-120-642" /db_xref="PID:g386931" /translation="MVCLKLPGGSYMAKLTVTLMVLSSPLASAGDTRPRFLQQDKYEC HFFNGTERVRFLHRDIYNQEEDLRFDSDVGEYRAVTELGRPDAEYWNSQKDFLEDRRA AVDTYCRHNYGVGESFTVQRRVEPKVTVYPARTQTLQHHNLLVCSVNGFYPGSIEVRW FRNSQEEKAGVVSTGLIQNGDWTFQTLVMLETVPRSGEVYTCQVEHPSVTSPLTVEWR AQSESAQSKMLSGVGGFVLGLLFLGAGLFIYFKNQKGHSGLHPTGLVS" gene 6..806 /gene="HLA-DRB1" mat_peptide 93..803 /gene="HLA-DRB1" /note="MHC-HLA-DR-beta chain" BASE COUNT 272 a 315 c 324 g 261 t ORIGIN 1 ccagcatggt gtgtctgaag ctccctggag gttcctacat ggcaaagctg acagtgacac 61 tgatggtgct gagctcccca ctggcttcgg ctggggacac ccgaccacgt ttcttgcagc 121 aggataagta tgagtgtcat ttcttcaacg ggacggagcg ggtgcggttc ctgcacagag 181 acatctataa ccaagaggag gacttgcgct tcgacagcga cgtgggggag taccgggcgg 241 tgacggagct ggggcggcct gacgctgagt actggaacag ccagaaggac ttcctggaag 301 acaggcgcgc cgcggtggac acctactgca gacacaacta cggggttggt gagagcttca 361 cagtgcagcg gcgagttgag cctaaggtga ctgtgtatcc tgcaaggacc cagaccctgc 421 agcaccacaa cctcctggtc tgctctgtga atggtttcta tccaggcagc attgaagtca 481 ggtggttccg gaacagccag gaagagaagg ctggggtggt gtccacaggc ctgattcaga 541 atggagactg gaccttccag accctggtga tgctggaaac agttcctcga agtggagagg 601 tttacacctg ccaagtggag cacccaagcg tgacgagccc tctcacagtg gaatggagag 661 cacagtctga atctgcacag agcaagatgc tgagtggagt cgggggcttt gtgctgggcc 721 tgctcttcct tggggccggg ctattcatct acttcaagaa tcagaaaggg cactctggac 781 ttcacccaac aggactcgtg agctgaagtg cagatgacca cattcaaggg ggaaccttct 841 gccccagctt tgcatgatga aaagctttcc tgcttggctc ttattcttcc acaagagagg 901 actttctcag gccctggttg ctaccggttc agcaactctg cagaaaatgt ccatccttgt 961 ggcttcctca gctcctgccc cttggcctga agtcccagca ttgatggcag tgcctcatct 1021 tcaactttag tgctcccctt tacctaaccc tacggcctcc catgcatctg tactccccct 1081 gtgtgccaca aatgcactac gttattaaat ttttctgaag cccagagtta aaaatcatct 1141 gtccacctgg ctccaaagac aaaaaataaa aa // LOCUS HUMMHHSPHO 3330 bp DNA PRI 07-MAR-1995 DEFINITION Human MHC class III HSP70-HOM gene (HLA), complete cds. ACCESSION M59829 M34268 NID g188491 KEYWORDS class III gene; complement system protein; heat shock-induced protein; major histocompatibility complex. SOURCE Human DNA, clone H92. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3330) AUTHORS Milner,C.M. and Campbell,R.D. TITLE Structure and expression of the three MHC-linked HSP70 genes JOURNAL Immunogenetics 32 (4), 242-251 (1990) MEDLINE 91055806 FEATURES Location/Qualifiers source 1..3330 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H92" /haplotype="HLA:A2,B7,C2C,Bfs,C4A3,C4BQ0,DR2" gene 960..2885 /gene="HSP70-HOM" CDS 960..2885 /gene="HSP70-HOM" /codon_start=1 /product="heat shock-induced protein" /db_xref="PID:g188492" /translation="MATAKGIAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYV AFTDTERLIGDAAKNQVAMNPQNTVFDAKRLIGRKFNDPVVQADMKLWPFQVINEGGK PKVLVSYKGENKAFYPEEISSMVLTKLKETAEAFLGHPVTNAVITVPAYFNDSQRQAT KDAGVIAGLNVLRIINEPTAAAIAYGLDKGGQGERHVLIFDLGGGTFDVSILTIDDGI FEVKATAGDTHLGGEDFDNRLVSHFVEEFKRKHKKDISQNKRAVRRLRTACERAKRTL SSSTQANLEIDSLYEGIDFYTSITRARFEELCADLFRGTLEPVEKALRDAKMDKAKIH DIVLVGGSTRIPKVQRLLQDYFNGRDLNKSINPDEAVAYGAAVQAAILMGDKSEKVQD LLLLDVAPLSLGLETVGGVMTALIKRNSTIPPKQTQIFTTYSDNQPGVLIQVYEGERA MTKDNNLLGRFDLTGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKVNKITITND KGRLSKEEIERMVLDAEKYKAEDEVQREKIAAKNALESYAFNMKSVVSDEGLKGKISE SDKNKILDKCNELLSWLEVNQLAEKDEFDHKRKELEQMCNPIITKLYQGGCTGPACGT GYVPGRPATGPTIEEVD" BASE COUNT 951 a 738 c 867 g 774 t ORIGIN Chromosome 6p21.3. 1 ggatcctatg agcctgggag gtcaggactg cagtgagcca tgattacacc actgcagtgc 61 agcctgcgtg acaaaacgag accctgtctc taaaaaatga gaaaaaaaaa tggttgttac 121 caggcgataa agggagggga aaacgggagt tacttaatga gtatacagtt tcagttttgc 181 gagatgaaca gaattctgga aattggttga acaccgctgt gattgaactc actaccaaac 241 tctacactta aaaatggtta agatggtaca atttgtatgt attttaccac aataaaaaat 301 aaaaaaaagg ctgggcgaga tgttcactcc tgtaatccca gtacttgggg aggctggggc 361 tgaaggatcg tttgagccct gaaggagttt gagaccagcc tgagcaacat aaggagaccc 421 catctgtaca caaaattaaa acattagcca ggcagagagc tggtcacggt ggctcacgta 481 tgtaatccca gcactttggg aggccgaggc gggcgggcgg atcacctgag gtcaggagtt 541 tgagaccagc ctggccaaca tagtgaaacc gtgaaacccc atctctacta aaaatacaaa 601 aattagctgg gcgtggtggt gccctcataa tcccagccac tcgggaggct gagacaggag 661 aatcgcttga actcaggagg tggaggttgc agtgagccta gatcacacca ctgcagtcca 721 aagcaagact ccgtctcaaa aaaaaaaaaa attagcccgg ctgttgtctc cagttattct 781 ggaggctaag gcaggaagat tgctggagcc taggagatca aagctgcagt gagctatgac 841 tgcgcctctg cactccaacc tgggtgacag aggaagaccc tgtctcaaaa aaataaataa 901 cattgaaaag gaactctccc aaaagtatct tattctttct ccataggcct cagagaacca 961 tggctactgc caagggaatc gccataggaa tcgacctggg caccacctac tcctgtgtgg 1021 gggtgttcca gcacggcaag gtggagatca tcgccaacga ccagggcaac cgcaccaccc 1081 ccagctacgt ggccttcaca gacaccgagc ggctcattgg ggatgcggcc aagaaccagg 1141 tagcaatgaa tccccagaac actgtttttg atgctaaacg tctgatcggc aggaaattta 1201 atgatcctgt tgtacaagca gatatgaaac tttggccttt tcaagtgatt aatgaaggag 1261 gcaagcccaa agtccttgtg tcctacaaag gggagaataa agctttctac cctgaggaaa 1321 tctcttcgat ggtattgact aagttgaagg agactgctga ggcctttttg ggccaccctg 1381 tcaccaatgc agtgattacc gtgccagcct atttcaatga ctctcaacgt caggctacta 1441 aggatgcagg tgtgattgct ggacttaatg tgctaagaat catcaatgag cccacggctg 1501 ctgccattgc ctatggttta gataaaggag gtcaaggaga acgacatgtc ctgatttttg 1561 atctgggtgg aggcacattt gatgtgtcaa ttctgaccat agatgatggg atttttgagg 1621 taaaggccac tgctggggac actcacctgg gtggggagga ctttgacaac aggcttgtga 1681 gccacttcgt ggaggagttc aagaggaaac acaaaaagga catcagccag aacaagcgag 1741 ccgtgaggcg gctgcgcacc gcctgcgaga gggccaagag gaccctgtcg tccagcaccc 1801 aggccaacct agaaattgat tcactttatg aaggcattga cttctataca tccatcacca 1861 gagctcgatt tgaagagttg tgtgcagacc tgtttagggg taccctggag cctgtagaaa 1921 aagcgcttcg ggatgccaag atggataagg ctaaaatcca tgacattgtt ttagtagggg 1981 gctccacccg catccccaag gtgcagcggc tgcttcagga ctacttcaat ggacgtgatc 2041 tcaacaagag catcaaccct gatgaggccg tagcatatgg ggctgcggta caagcagcca 2101 tcctgatggg ggacaagtct gagaaggtac aggacctgct gctgctggac gtggctcccc 2161 tgtccctggg tctggagacg gttgggggcg tgatgactgc cctgataaag cgcaactcca 2221 ccatcccacc caagcagaca cagattttca ccacctactc tgacaaccaa cccggggtgc 2281 tgatccaggt gtatgagggc gagagggcca tgacaaagga caacaacctg ctggggcggt 2341 ttgatctgac tggaatccct ccagcaccca ggggagttcc tcagatcgag gtgacgtttg 2401 acattgatgc caatggtatt ctcaatgtca cagccacgga caagagcacc ggcaaggtga 2461 acaagatcac catcaccaat gacaagggcc gcctgagcaa ggaggagatt gagcggatgg 2521 ttctggatgc tgagaaatat aaagctgaag atgaggtcca gagggagaaa attgctgcaa 2581 agaatgcctt agaatcctat gcttttaaca tgaagagtgt tgtgagtgat gaaggtttga 2641 agggcaagat tagtgagtct gataaaaata aaatattgga taaatgcaac gagctccttt 2701 cgtggctgga ggtcaatcaa ctggcagaga aagatgagtt tgatcataag agaaaggaat 2761 tggagcagat gtgtaaccct atcatcacaa aactctacca aggaggatgc actgggcctg 2821 cctgcggaac agggtatgtg cctggaaggc ctgccacagg ccccacaatt gaagaagtag 2881 attaattctt tttagaactg aagcatccta ggatgcctct acatgtattt cattcccctc 2941 atgttgaaac atcattatta ttcttgacca gacctgaatc taagttacca tcccttggaa 3001 attctggaga aggagtctca tgcaccacct atcacactcc ctcacatcct gtttctgact 3061 ttggaatgga ctcaggaaaa ctaggcccct ctttaaccgt gtgatgtatt tgaatgtctg 3121 ttatttccag ccaccctaac attcttcttc ctgtgtggat gcttatttgt caatcagtaa 3181 atttgttcgt aaagaaaatt acttctggta tttaggctgt gaatgtacct tgaaggggag 3241 agttcatgga gagagcatgt gttctctgat tgtgaggtca ctgtgaatga ttaaattggt 3301 aagggtaaag tatttgaatt ttcatgaact // LOCUS HUMMR 1083 bp DNA PRI 18-FEB-1993 DEFINITION Homo sapiens melanortin receptor gene, complete cds. ACCESSION L06155 NID g188673 KEYWORDS melanortin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1083) AUTHORS Gantz,I., Konda,Y., Tashiro,T., Shimoto,Y., Munzert,G., DelValle,J. and Yamada,T. TITLE Molecular cloning of a novel melanortin receptor JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..1083 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1083 /codon_start=1 /product="melanortin receptor" /db_xref="PID:g188674" /translation="MSIQKKYLEGDFVFPVSSSSFLRTLLEPQLGSALLTAMNASCCL PSVQPTLPNGSEHLQAPFFSNQSSSAFCEQVFIKPEIFLSLGIVSLLENILVILAVVR NGNLHSPMYFFLCSLAVADMLVSVSNALETIMIAIVHSDYLTFEDQFIQHMDNIFDSM ICISLVASICNLLAIAVDRYVTIFYALRYHSIMTVRKALTLIVAIWVCCGVCGVVFIV YSESKMVIVCLITMFFAMMLLMGTLYVHMFLFARLHVKRIAALPPADGVAPQQHSCMK GAVTITILLGVFIFCWAPFFLHLVLIITCPTNPYCICYTAHFNTYLVLIMCNSVIDPL IYAFRSLELRNTFREILCGCNGMNLG" BASE COUNT 208 a 361 c 251 g 263 t ORIGIN 1 atgagcatcc aaaagaagta tctggaggga gattttgtct ttcctgtgag cagcagcagc 61 ttcctacgga ccctgctgga gccccagctc ggatcagccc ttctgacagc aatgaatgct 121 tcgtgctgcc tgccctctgt tcagccaaca ctgcctaatg gctcggagca cctccaagcc 181 cctttcttca gcaaccagag cagcagcgcc ttctgtgagc aggtcttcat caagcccgag 241 attttcctgt ctctgggcat cgtcagtctg ctggaaaaca tcctggttat cctggccgtg 301 gtcaggaacg gcaacctgca ctccccgatg tacttctttc tctgcagcct ggcggtggcc 361 gacatgctgg taagtgtgtc caatgccctg gagaccatca tgatcgccat cgtccacagc 421 gactacctga ccttcgagga ccagtttatc cagcacatgg acaacatctt cgactccatg 481 atctgcatct ccctggtggc ctccatctgc aacctcctgg ccatcgccgt cgacaggtac 541 gtcaccatct tttacgcgct ccgctaccac agcatcatga ccgtgaggaa ggccctcacc 601 ttgatcgtgg ccatctgggt ctgctgcggc gtctgtggcg tggtgttcat cgtctactcg 661 gagagcaaaa tggtcattgt gtgcctcatc accatgttct tcgccatgat gctcctcatg 721 ggcaccctct acgtgcacat gttcctcttt gcgcggctgc acgtcaagcg catagcagca 781 ctgccacctg ccgacggggt ggccccacag caacactcat gcatgaaggg ggcagtcacc 841 atcaccattc tcctgggcgt gttcatcttc tgctgggccc ccttcttcct ccacctggtc 901 ctcatcatca cctgccccac caacccctac tgcatctgct acactgccca cttcaacacc 961 tacctggtcc tcatcatgtg caactccgtc atcgacccac tcatctacgc tttccggagc 1021 ctggaattgc gcaacacctt tagggagatt ctctgtggct gcaacggcat gaacttggga 1081 tag // LOCUS HUMMTALD 6074 bp DNA PRI 15-APR-1996 DEFINITION Human mitochondrial aldehyde dehydrogenase x gene, complete cds. ACCESSION M63967 NID g337184 KEYWORDS aldehyde dehydrogenase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6074) AUTHORS Hsu,L.C. and Chang,W.C. TITLE Cloning and characterization of a new functional human aldehyde dehydrogenase gene JOURNAL J. Biol. Chem. 266 (19), 12257-12265 (1991) MEDLINE 91286241 FEATURES Location/Qualifiers source 1..6074 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1301..1317 exon 2257..3966 CDS 2266..3819 /codon_start=1 /product="aldehyde dehydrogenase" /db_xref="PID:g1263008" /translation="MLRFLAPRLLSLQGRTALYSSAAALPSPILNPDIPYNQLFINNE WQDAVSKKTFPTVNPTTGEVIGHVAEGDRADVDRAVKAAREAFRLGSPWRRMDASERG RLLNLLADLVERDRVYLASLETLDNGKPFQESYALDLDEVIKVYRYFAGWADKWHGKT IPMHGQHFCFTRHEPVGVCGQIIPWNFPLVMQGWKLAPALATGNTVVMKVAEQTPLSA LYLASLIKEAGFPPGVVNIITGYGPTAGAAIAQHMDVDKVAFTGSTEVGHLIQKAAGD SNLKRVTLELGGKSPSIVLADADMEHAVEQCHEALFFNMGQCCCAGSRTFVEESIYNE FLERTVEKAKQRKVGNPFELDTQQGPQVDKEQFERVLGYIQLGQKEGAKLLCGGERFG ERGFFIKPTVFGGVQDDMRIAKEEIFGPVQPLFKFKKIEEVVERANNTRYGLAAAVFT RDLDKAMYFTQALQAGTVWVNTYNIVTCHTPFGGFKESGNGRELGEDGLKAYTEVKTV TIKVPQKNS" mutation 2448 mutation 2552 mutation 2585 repeat_region 4803..5137 repeat_region 5613..5924 BASE COUNT 1475 a 1505 c 1594 g 1500 t ORIGIN 1 ctgcagcctc cattcaaaga gcaggaagcc aggtcaagga gcggctacag ccacagcggc 61 cggtccacag ccagaaagct ggagaaggga caaggaatgc tgccgcagaa catctcccat 121 ttctgctatc ctgctagtgc caccctacag gggcaggaca ataccctcca atgcgtgccc 181 gccttccaca tccggcagag tactactaat tggcagcacc tagtccacat ccagaactct 241 aactgcaaat gaggctggga aacatggcgg ttagctttgg gacctcccta gccagaagga 301 aggtgcagcg agggtggagc gagtgagatg catgttcagt ggtccatctt ggcaaccttt 361 atagcaacaa aatggagaca attaaatgtc caacagttta cccctacaac agaattctat 421 gccaccttgg agaggatgat gtcatctgta attagtgata ggagaaaact tcaaaataca 481 gcgctaagtt tttaaatttt ttttaaagca gcacaacaga ggttactata tgattttatt 541 tttattaaaa taattttgaa agatgcagag gaaacaatac tagcaggacg aacaccgtaa 601 taatggctac cgtaggtgaa gagattacag gtgatatttt tcttcttttt tcatcctttc 661 ctgtgttttc caagctctac tgcatacctt ctttttgtcc ttaggacaaa aatgtacttt 721 gaaaaatgcc caaatataaa atcttcttga aggaactgaa gaaaggagca atgaaagtgg 781 aggacagaaa ttagtgaatc ctggacaaaa aaagtgaggg agaggcgagg cagagaaggg 841 gttaagggga gtctcaagga tggcagtcga ttaagcagcg tcttgcagcg gggagaaatg 901 caaagtgtag ttggggatcg gtctccgaaa actttgctgt gtgacctggg gaagcccctg 961 cccctctctg ggtatctctg tttccgctcc caaaccagtg ccttaggacc ctagatctga 1021 gcaacgtgga cactgggctt gcggagaccc gagggaggga gcgctgaagc ggtgcgggct 1081 gccgggggta gaggggcggg gaccgggaaa gaccgggctg ggcgagggag gagcggcctt 1141 ggccggcgac aggacgtagg agcgccccag gcgcagcgga gcctcattgg ccgctgagcc 1201 ccgggctgcg cggaggcggg acctgcggcc agccctgggc ggccatgtgg acagagctgg 1261 gagggccgga accagaaccc aagcgtgatc ctgaaccgga gcccgagcct gctgcaggta 1321 actaacgctg gtctcccctc ggctccctcg ggaagccgca gctcctggct ccgcgtgggg 1381 ggctttcctc tctgggccgc gtcgacactc ttcagagttg tagcttttct tctcagctgc 1441 ctcctgactt gctgcacctc tgggaaagct gaaaagggat tctgagacct gtggttgggg 1501 gatgctgggc taggtcaatt tctgaggcac ggccagttct gatgtcctga gtggagcttg 1561 ccagggtttt tatatgcaag acatcattca acataacccc atgaggcatt tcacatatgg 1621 gggaactgag gcagagaggt gaaatgaagc aagtccctct cagagaacaa ccatctcacc 1681 gtccctttga tttctactca agtcttaggt caattcttag acagcctttg tggcaatttc 1741 agtgttttct cagtgtatca ttcagtactg tgtgttgagc cgctgctctg tagcaggcat 1801 acctctaggc tcggggacat agcagtgaat ggaacagaaa aataagttcc tgtgctcatg 1861 gagcttgcat tttattgggg ggagacagac aaacatataa gcctgtaaag atagtatttg 1921 ggatgtgaga tagagaagga gtaaagcagg gtagcgggga tagagagtgt tggggtggga 1981 agggttgcaa ttttttaaag ggtggtcagg gaaggccttg ctgagacagt ggcttttgag 2041 acccagcaga ggtgagggag tgagctgagg ataggcggga cttgatggag ttggcccaga 2101 gagttcatca gggccctcac agctcttaca gtctgtgttt ttagaggtga cagtccttta 2161 tgctggaatc ttgaaatgtt tgagctggtg ggcccttggg taccgccacc tgccttctcc 2221 cacctgttca ccctggtttc ttttgtccct ctccagagtg tcagcatgct gcgcttcctg 2281 gcaccccggc tgcttagcct ccagggcagg accgccctct actcctcggc agcagccctc 2341 ccaagcccca ttctgaaccc agacatcccc tacaaccagc tgttcatcaa caatgaatgg 2401 caagatgcag tcagcaagaa gaccttcccg acggtcaacc ctaccaccgg ggaggtcatc 2461 gggcacgtgg ctgaaggtga ccgggctgat gtggatcggg ccgtgaaagc agcccgggaa 2521 gccttccgcc tggggtcccc atggcgccgg atggatgcct ctgagcgggg ccggctgctg 2581 aacctcctgg cagacctagt ggagcgggat cgagtctact tggcctcact cgagaccttg 2641 gacaatggga agcctttcca agagtcttac gccttggact tggatgaggt catcaaggtg 2701 tatcggtact ttgctggctg ggctgacaag tggcatggca agaccatccc catgcatggc 2761 cagcatttct gcttcacccg gcatgagccc gttggtgtct gtggccagat catcccgtgg 2821 aacttcccct tggtcatgca gggttggaaa cttgccccgg cactcgccac aggcaacact 2881 gtggttatga aggtggcaga gcagaccccc ctctctgccc tgtatttggc ctccctcatc 2941 aaggaggcag gctttccccc tggggtggtg aacatcatca cggggtatgg cccaacagca 3001 ggtgcggcca tcgcccagca catggatgtt gacaaagttg ccttcaccgg ttccaccgag 3061 gtgggccacc tgatccagaa agcagctggc gattccaacc tcaagagagt caccctggag 3121 ctgggtggta agagccccag catcgtgctg gccgatgctg acatggagca tgccgtggag 3181 cagtgccacg aagccctgtt cttcaacatg ggccagtgct gctgtgctgg ctcccggacc 3241 ttcgtggaag aatccatcta caatgagttt ctcgagagaa ccgtggagaa agcaaagcag 3301 aggaaagtgg ggaacccctt tgagctggac acccagcagg ggcctcaggt ggacaaggag 3361 cagtttgaac gagtcctagg ctacatccag cttggccaga aggagggcgc aaaactcctc 3421 tgtggcggag agcgtttcgg ggagcgtggt ttcttcatca agcctactgt ctttggtggc 3481 gtgcaggatg acatgagaat tgccaaagag gagatctttg ggcctgtgca gcccctgttc 3541 aagttcaaga agattgagga ggtggttgag agggccaaca acaccaggta tggcctggct 3601 gcggctgtgt tcacccggga tctggacaag gccatgtact tcacccaggc actccaggcc 3661 gggaccgtgt gggtaaacac ctacaacatc gtcacctgcc acacgccatt tggagggttt 3721 aaggaatctg gaaacgggag ggagctgggt gaggatgggc ttaaggccta cacagaggta 3781 aagacggtca ccatcaaggt tcctcagaag aactcgtaag gcagctgtca gggaggccca 3841 gtcacagtcc agcaattcca caaccacctt gacgaatgct tgccaagctg ttttaaagcc 3901 aagaacaccc tttctttgtt ccaaattaac tcttagaaga aaccccacaa ataaagcaat 3961 tcaatcaagg ctgttctatt taaatcagag atggggacca ggctcagagt tctacctatc 4021 taacccccaa ccacagcccc cttggtggcc catgagttgc ttccatgaaa tcttaggagt 4081 ctctggagga cagattaaaa accagtgatc tgtaatttgt agctcttcct gctgatccaa 4141 ggactttccc atgggtgcgc ttgatggttt agtggatcga ctcaactcag aacacaagct 4201 tggaaagtgt taggggtttg aactaggtgg atactaaatc tcggccccac tcttcattgg 4261 cttaacctaa aaaccagagg tgcttttcct tgtctgtgtg ccagttgctg gctgttttag 4321 ttgcttgccc ttcattttgc tactgatttt ccttaatttg tgggaaggag taggcaaaga 4381 atatgcttac atgattacac ctgtaaagta agcccaaaca tcccaaatgt ccgtcaactg 4441 atgagtggat taataaaatg tttccatgga atattccttg gattactcag ccataaaaag 4501 gaatgaagta ctgacacatg ctgtgacatc agtgaaccct gaaaacatcc ttctcagtga 4561 aacaagccag agagatgtac aaggctacag actgtatgat tccatttata tgaaatatac 4621 atactaggca aatccatgga gatggaacat agattagtgg ttgccagcgg atagaggagt 4681 aattgttagt gggcatggga tttgtttttg ggaggttttg aaaatgttct ggagttgaac 4741 aatagtaatg gttgcatgat ttggtaaaaa tactaaaaac tatggaattg tttaattatg 4801 atatgaatta atttctttct ttttttcttt tttttttttt ttttttgaca cggagtctca 4861 ctctgtcgcc caggctggag tgcagtggtg tgatctcgtc tcactcgaac ctccgcctcc 4921 cagattcaag cgattctcct gtctcagcct cctgagtagc tgggattata ggtacatgcc 4981 atcacacctg gctaattttt gtatttttag tagagatgag gtttcaccat tttggccagg 5041 ctgatcttga actcctgacc tcaggtgatc cacccgcctc agcctcccaa agtgctggga 5101 ttacaggtgg gagccacaac acccggccat gaattaactc cgttaaaaaa taaacgtata 5161 cattctgtga gcaaatccat tttgtctcca tattgtcctg ctgctgaaca ttttatagag 5221 tgtgggggat ggaaggaccc aggtggtcta gtcgaccagt caggtgcaaa atcccctccc 5281 caatgtttct gtttttgttt tctcttggat gggtgacaaa gtgcaactgt aagacccgta 5341 gagaaaactc tggttcctgc tcagaatggg cccatcttgt tggactcgtt tccacagccc 5401 cccacccctt ccaaaccata cccacccttc aagccaagcc tagcatgggg gccaccttga 5461 catctgggga tttccagagt ttcaaacctc agagttcata atgtttattg ttagtcttgc 5521 tacattcatt ccccaactga tgggagtgaa ggcaaatcca agccagccag gccacaatac 5581 agaagcagaa tgctgagcct ccctctctcg gtccatcttt ctagaccatg ctgcttacaa 5641 gccaccttac gttcaagtcc ccccattcct gtaaatgcac accccactca tctctgcttc 5701 ccaggctcaa gcaatcctcc cacctgagcc tcccaagtag ctgggactac aggcactcac 5761 caccatgcct ggctaatttt tttttttttt ttggtatttt ttgtagagat gaggtttcac 5821 catgtttccc aggcttgtct caaactcctg gtctcaagca accctcctgc cacggcctcc 5881 caaagtgctg gaattacagg catgagccac catgcctggc cccagtcaca tcctattgtc 5941 ttacctaggc aacatgatga ttttgtttat tatcataact tgggtttggc tggaaactgc 6001 tgcttgaggg ttcagagtct gtttggttcc ctcacctccc ttcctaggac ctcctttcta 6061 caccatttgt ctac // LOCUS HUMMYCL2A 3854 bp DNA PRI 07-JAN-1995 DEFINITION Human MYCL2 gene, complete cds. ACCESSION J03069 NID g188952 KEYWORDS c-myc proto-oncogene; proto-oncogene; repeat region. SOURCE Human peripheral blood leukocytes DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3854) AUTHORS Morton,C.C., Nussenzweig,M.C., Sousa,R., Sorenson,G.D., Pettengill,O.S. and Shows,T.B. TITLE Mapping and characterization of an X-linked processed gene related to MYCL1 JOURNAL Genomics 4 (3), 367-375 (1989) MEDLINE 89233129 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.Morton, 18-JAN-1989. FEATURES Location/Qualifiers source 1..3854 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /tissue_type="peripheral blood" /map="Xq22-q28" repeat_region 209..479 /note="Alu repetitive element" mRNA 920..3854 /gene="MYCL2" /note="G00-120-209" gene 920..3854 /gene="MYCL2" CDS 1040..2113 /gene="MYCL2" /codon_start=1 /db_xref="GDB:G00-120-209" /db_xref="PID:g188953" /translation="MDRDSYHHYFYDYDGGEDFYRSTTPSEDIWKKFELVPPPWTWVR SREPSPQLWSPGTWPVGCAGDETESQDYWKAWDANYASLIRRDCMWSGFSTQEPLERA VSDLLAVGAPSGYSPKEFATPDYTPELEAGNLAPIFPCLLGEPKIQACSRSESPSDSE GEEIDVTVKKRQSLSTRKPVIIAVRADLLDPRMNLFHISIHQQQHNYAAPFPPESCFQ EGAPKRMPPKEALEREAPGGKDDKEDEEIVSLPPVESEAAQSCQPKPIHYDTENWTKK KYHSYLERKRRNDQRSRFLALRDEVPALASCSRVSKVMILVKATEYLHELAEAEERMA TEKRQLECQRRQLQKRIEYLSSY" repeat_region 3483..3734 /note="Alu repetitive element" BASE COUNT 1065 a 928 c 944 g 917 t ORIGIN 204 bp upstream of SacI site; chromosome Xq22-q25. 1 tgattcatct ggtgaaagac gagactccaa tatctaaaaa aaagaaaaaa aaatagggct 61 aataaatgat ttaagcaaag atctgtagga tacaagggca acatacaaaa atcatcattt 121 atatttatat acactaacag tgaacaatct gaaaaagaaa tattaaaaaa ttataattaa 181 aatatccgca ctgcaggtgg agctcattgc tcatgcctgt aatcccaaca ctttaggagg 241 ctgaggcata ccgaccactt gcggtcagga gtcaagacca gcctggccaa catggcgaaa 301 cctcgtctct actagaaata caaaaaataa aaataaaaat aaattaacca ggcgtggtgg 361 cccacgcgcc cctgtagtcg cagctacttt ggaggctgag gtgggagaat cacttgaact 421 cgggaggcgg aggtcgcagc gagcagagat tgagccactg cactccaacc tgggtgacac 481 aagaaagaaa gaaaatgaag gaaagaagaa ggaaggaaag aaagaaggaa agaaggaagg 541 aaggaagaaa ggaaggaagg aaggaaggaa aaaaatagct ggacatgatg gaggactagc 601 atttctcaat ttcaaaacgt actacaaacc acactaatca aaacaatgtg gtactggcat 661 aaggatagac atatagatca atggaataga attgagagtc agaaacccat acatctaagg 721 tcaactgatt ttcaaagaga tgtcaagacc atgcaattgg aaaagaataa tctcttcaac 781 aaatggtgct ggaatacttg gatactcaca tgcaaaagaa tgaagctggg cccttacctc 841 acgccattta caaaaaataa ctcaaaatga accaaagacc taaatataag agctaaaatt 901 gtaagcctct tagaaataaa cagagggcgg gtcgcgcgct cggtggccgt tgtgcgcgtg 961 tgtggagtgc cctgctgccc ccagctggag gggaactagt ctgctccagg tggcaagctg 1021 cgtgagcaag caagccaaca tggaccgcga ctcgtaccat cactatttct acgactatga 1081 cggcggggag gatttctacc gctccacgac gcccagcgag gacatctgga agaaattcga 1141 gttggtgccg ccgccctgga cttgggtccg cagccgggaa cccagccctc agctttggtc 1201 tcctggaacg tggccggtag ggtgcgctgg ggacgagacg gaatcccagg actactggaa 1261 agcttgggac gcgaactacg cctccctcat ccgccgtgac tgcatgtgga gcggcttctc 1321 cacccaggag ccgctggaga gagcggtgag tgacctgctt gccgttggcg cgccctcggg 1381 atactcgccc aaggagttcg ccacccccga ctacactccc gagctcgaag ccggcaacct 1441 agcgcccatc ttcccctgtt tgttgggcga gcccaagatc caggcctgct ccaggtctga 1501 gagcccaagc gactccgagg gtgaagaaat cgacgtgaca gtaaagaaga ggcagtcttt 1561 gagtacgcgg aagccagtca tcatcgcggt gcgtgcagac cttctggatc cccgcatgaa 1621 tctcttccac atctccatcc accagcaaca gcacaactat gctgcccctt ttcctccaga 1681 aagctgcttc caagaagggg ctccaaagag gatgccccca aaagaggctc tagagagaga 1741 agctccaggg ggaaaggatg ataaggaaga tgaagagatt gtgagcctcc cacctgtaga 1801 aagtgaggct gcccagtcct gccagcccaa acccatccat tatgatactg agaattggac 1861 caagaagaag taccacagct acctggagcg caagagacgg aatgatcaac gttcgcggtt 1921 cttggccctg agggacgagg tacccgccct ggccagctgc tctagggttt ccaaagtaat 1981 gatcctagtc aaggccacgg aatacttaca tgaactggcg gaagccgagg agaggatggc 2041 tacggagaaa aggcagctcg aatgccagcg acggcaattg cagaaaagaa ttgagtacct 2101 cagtagctac tgaccaaaaa gcctgaccat tctgtcttaa aaagacacaa gttttctttt 2161 tgatctccct ctccccttta gtaacttgta catttttgtt acagcaggac actctggaca 2221 gtagattgca gaatcgattg cagccagtgc acaaacaata taaaggcttg cattcttgga 2281 aactttgaaa cccagctctc tctcttccct gacttatggg agtgctttgt gttttctggc 2341 acctttggct tctcagcagg cagctgactg aggagacttg gggtcttcct ggctcactat 2401 ctccaaagaa aaggctgaca gatggtatgc aacaggtggt ggatgttgtt gggggctcca 2461 gcctggagga aatctcacac tctacatgaa ctttaggcta ggaaaggatg tctctggggt 2521 gatgcaagga cagctgggtg tggacgctct cctgcggctc catttttttc caggagacac 2581 acaagctgcc ttgggtgaaa acaagctcag agacttgatc aacgtggacc attacctcac 2641 tgtcagacac tacagctagc tgaggagttg gaaaccttac atatatgtat atatatatgt 2701 atgtatatat gtatatatgt atatatatat gtatgtatat atgtatatat gtatatatat 2761 atgtatgtat atatgtatat atgtatatat atatgtatgt atatatgtat atattatgat 2821 gttggctgac ccccttcctc ccactctcaa tgctgtgact cagaacattt aagagaactt 2881 cgtctgtaag taatttgtct taaagccctc tgggctctct tctctgagtg agggaacttt 2941 ctgtcttcac aagggacttt gtctcattct gcctctgtta tgcaatgggt tctacagcac 3001 cctttcccgc aggttagaaa tatttcccta agacacaggg aaatgggtct tagcctgggg 3061 cctggggaaa gttcccaagc cctggctcat gaactcaatc cctgcccagg tgttttctga 3121 ggggcccttg aggccaatct tttctcaaga cagtgtgagg caccttagaa gggagaactg 3181 taacactttc tctttcgcac ctgcctctca tctcaatcct tgactgatga atttgaagtt 3241 ctactagaac catgaaaact tgttcctttc gtgcatctcc aaggagcttg ctggctctgc 3301 agccacgctt gggccctcgc accagcctgc aatgaatcag atgtctgtca cagaatctgg 3361 gcctctctga agttttctgg agagctgttg ggactcatcc agtgctccac aacgtggact 3421 tgcctcctgg tgtgttttaa aggatcctcc aggagctctg cttagccaat catcatgatg 3481 gatttttttt tttttttttg agacggagtc tcaactcttg tcgcccaggc tggaggttaa 3541 tggcatgatc tcggctcact gcaacctctg cctcccgggt tcaagcgatt ctcctgcctg 3601 agccttccga gtagctggga tcgcaggcgc ctgccaccac gcctagctaa tttctgtatt 3661 tttagtagag atggggtttc accacattgg ccaggctggt cttgacctcc tgacctaggt 3721 gatccactgc ctccatgata gattttgccc cagctggact ctgcagctcc acgtggaatc 3781 caggtgcctg cctccagtct gggaaagtca ccaacccgca gcttgtcatg tgggtaactt 3841 ctgaacccta agcc // LOCUS HUMMYCTM 2222 bp DNA PRI 07-JAN-1995 DEFINITION Human translocation-associated myc allele of SKW-3 oncogene, complete cds. ACCESSION M20605 NID g188974 KEYWORDS Myc protein. SOURCE Human leukemia T-cell line SKW-3 DNA, clone lambda-SKW22. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2222) AUTHORS Finver,S.N., Nishikura,K., Finger,L.R., Haluska,F.G., Finan,J., Nowell,P.C. and Croce,C.M. TITLE Sequence analysis of the MYC oncogene involved in the t(8;14)(q24;q11) chromosome translocation in a human leukemia T-cell line indicates that putative regulatory regions are not altered JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (9), 3052-3056 (1988) MEDLINE 88203638 FEATURES Location/Qualifiers source 1..2222 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SKW-3" /cell_type="T cell" /tissue_type="leukemia" /map="8q24" mutation 283..286 /note="cccc in SKW22 and myc; cc in SKW13" mutation 1236 /note="t in SKW22 and myc; g in SKW13" mutation 1405 /note="g in SKW22 and SKW13; c in myc" gene 1576..2142 /gene="MYC" CDS 1576..2142 /gene="MYC" /codon_start=1 /db_xref="GDB:G00-120-208" /product="myc protein" /db_xref="PID:g188975" /translation="MRGSGRLRTPELVCSRPPPPGPGRPWLPSCLEKGRASQRLGGKK NGGRDRAEYKSRFSGLYLTRCSNSSERQRERAGGRLGWKSRASRAALRASWEGRSGAN RGLRLWPSPPADPPASRPQPLPHPRNFAHSSGRALCTGTYNTRARTRLSRRGEAILPI WGHFPAAARTRFSERLSLQLLRRWIFFG" BASE COUNT 508 a 619 c 601 g 494 t ORIGIN 1666 bp upstream of XhoI site; chromosome 8q24. 1 caaggatgag aagaatgttt tttgtttttc atgccgtgga ataacacaaa ataaaaaatc 61 ccgagggaat atacattata tattaaatat agatcatttc agggagcaaa caaatcatgt 121 gtggggctgg gcaactagct gatgcgaagc gtaaataaaa tgtgaataca cgtttgcggg 181 ttacatacag tgcactttca ctagtattca gaaaaaattg tgagtcagtg aactaggaaa 241 ttaatgcctg gaaggcagcc aaattttaat tagctcaaga ctcccccccc cccccaaaaa 301 aaggcacgga agtaatactc ctctcctctt ctttgatcag aatcgatgca ttttttgtgc 361 atgaccgcat ttccaataat aaaaggggaa agaggacctg gaaaggaatt aaacgtccgg 421 tttgtccggg gaggaaagag ttaacggttt ttttcacaag ggtctctgct gactcccccg 481 gctcggtcca caagctctcc acttgcccct tttaggaagt ccggtcccgc ggttcgggta 541 ccccctgccc ctcccatatt ctcccgtcta gcacctttga tttctcccaa acccggcagc 601 ccgagactgt tgcaaaccgg cgccacaggg cgcaaagggg atttgtctct tctgaaacct 661 ggctgagaaa ttgggaactc cgtgtgggag gcgtgggggt gggacggtgg ggtacagact 721 ggcagagagc aggcaacctc cctctcgccc tagcccagct ctggaacagg cagacacatc 781 tcagggctaa acagacgcct cccgcacggg gccccacgga agcctgagca ggcggggcag 841 gaggggcggt atctgctgct ttggcagcaa attgggggac tcagtctggg tggaaggtat 901 ccaatccaga tagctgtgca tacataatgc ataatacatg actcccccca acaaatgcaa 961 tgggagttta ttcataacgc gctctccaag tatacgtggc aatgcgttgc tgggttattt 1021 taatcattct aggcatcgtt ttcctcctta tgcctctatc attcctccct atctacacta 1081 acatcccacg ctctgaacgc gcgcccatta atacccttct ttcctccact ctccctggga 1141 ctcttgatca aagcgcggcc ctttccccag ccttagcgag gcgccctgca gcctggtacg 1201 cgcgtggcgt ggcggtgggc gcgcagtgcg ttctctgtgt ggagggcagc tgttccgcct 1261 gcgatgattt atactcacag gacaaggatg cggtttgtca aacagtactg ctacggagga 1321 gcagcagaga aagggagagg gtttgagagg gagcaaaaga aaatggtagg cgcgcgtagt 1381 taattcatgc ggctctctta ctctgtttac atcctagagc tagagtgctc ggctgcccgg 1441 ctgagtctcc tccccacctt ccccaccctc ccccaccctc cccataagcg cccctcccgg 1501 gttcccaaag cagagggcgt gggggaaaag aaaaaagatc ctctctcgct aatctccgcc 1561 caccggccct ttataatgcg agggtctgga cggctgagga cccccgagct ggtctgctcg 1621 cggccgccac cgccgggccc cggccgtccc tggctcccct cctgcctcga gaagggcagg 1681 gcttctcaga ggcttggcgg gaaaaagaac ggagggaggg atcgcgctga gtataaaagc 1741 cggttttcgg ggctttatct aactcgctgt agtaattcca gcgagaggca gagggagcga 1801 gcgggcggcc ggctagggtg gaagagccgg gcgagcagag ctgcgctgcg ggcgtcctgg 1861 gaagggagat ccggagcgaa tagggggctt cgcctctggc ccagccctcc cgctgatccc 1921 ccagccagcc gtccgcaacc cttgccgcat ccacgaaact ttgcccatag cagcgggcgg 1981 gcactttgca ctggaactta caacacccga gcaaggacgc gactctcccg acgcggggag 2041 gctattctgc ccatttgggg acacttcccc gccgctgcca ggacccgctt ctctgaaagg 2101 ctctccttgc agctgcttag acgctggatt tttttcgggt agtggaaaac caggtaagca 2161 ccgaagtcca cttgcctttt aatttatttt tttatcactt taatgctgag atgagtcgaa 2221 tg // LOCUS HUMNGFBA2 5778 bp DNA PRI 07-JAN-1995 DEFINITION Human nerve growth factor beta (beta-NGF) gene, segment 2. ACCESSION M21062 M14806 NID g189201 KEYWORDS nerve growth factor. SEGMENT 2 of 2 SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5778) AUTHORS Ullrich,A., Gray,A., Berman,C., Coussens,L. and Dull,T.J. TITLE Sequence homology of human and mouse beta-NGF subunit genes JOURNAL Cold Spring Harb. Symp. Quant. Biol. 48 Pt 1, 435-442 (1983) MEDLINE 84206565 FEATURES Location/Qualifiers source 1..5778 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p13" gene join(M21061:1..2938,1..4043) /gene="NGFB" CDS 3318..4043 /gene="NGFB" /codon_start=1 /db_xref="GDB:G00-120-233" /product="nerve growth factor beta" /db_xref="PID:g189203" /translation="MSMLFYTLITAFLIGIQAEPHSESNVPAGHTIPQVHWTKLQHSL DTALRRARSAPAAAIAARVAGQTRNITVDPRLFKKRRLRSPRVLFSTQPPREAADTQD LDFEVGGAAPFNRTHRSKRSSSHPIFHRGEFSVCDSVSVWVGDKTTATDIKGKEVMVL GEVNINNSVFKQYFFETKCRDPNPVDSGCRGIDSKHWNSYCTTTHTFVKALTMDGKQA AWRFIRIDTACVCVLSRKAVRRA" mat_peptide 3681..4034 /gene="NGFB" /note="G00-120-233" /product="nerve growth factor beta" BASE COUNT 1681 a 1284 c 1436 g 1377 t ORIGIN About 2.8 kb after segment 1. 1 ggcagagcag tccctactgg cttacagaca agggacttgg caccagctcc tttgagcagg 61 gtgatatgat cagagcaggc tttaagaagc taaatggggt ggccagggat aggaggaatt 121 ggaagaagag agatgagggt ccaggatatc aggtgggaga ctgttcccat gacctgaact 181 aggggcccag agatgagaat ggaagataaa atacagtttc aagataatgc taccaaacca 241 gaaccaaaag ggctggatgg aggtcaggga ggaaaaagca gagggataag cagaggtatg 301 gagctagtgg atgaatgaat gatggttcca gtaacagaaa aaggaaactg tgaagaagcg 361 tgtttaggga gatgaggaga gaggatcaag agagatggta gctgtgaaaa ctatcatcaa 421 atggtttcaa aatggggctg ttgctattgt atgcagagaa ggaagatggg gagaagatga 481 tcctaattat gattatgagt atgaagattt ttgttttggt agtcttgcat ttaaaatgcc 541 agtatctatg gagaattgcc ctcaggaagc aggagggcag aactagcttc aaaagactag 601 ttgattagag atccaggtat gggagtcaat tgcacagaaa tttcgatggg ctccttagaa 661 gaggaggagc ccaccaaagg agagaccaag aatcgaagtg ggaacctctg gtaatgggga 721 atagggagag gtatgtccct tatgtccatc gaaacccttt tatttagtta atttcatttt 781 cctcttgaaa ttctagcaag tgttactaat ttggatataa gggaatgaat ttcagttttc 841 tttcttctcc tttaatgtat ctgataagag atattatgta ccgagaagaa caaagaagtg 901 aaacatgaaa aggagcaaaa gtaaaaggtt tagatttgct gccaactgaa tattcagtcc 961 gctaagagtt gaccggacat ttatcgcaag accattaaca gatctggtaa tggtcaaaac 1021 cccagatatg ccattgcctc ctttgcatgg tcagctggag aactggagca aacagaaaca 1081 ttcttcaaag aaaccaaaga gaaaagaatt tggtgatttc cttcttggta tctcaccaaa 1141 gagctgtcta tttccagctg aataaaagca cagtatggag gagaagaatg tgttgcctta 1201 aaacagatta gaaaagcagt gtgtagccat atgcacaaag gccaaaagaa accacacact 1261 gtgatggaaa gaaggcagat ggtgagcttc tcttgccact ctcattcatt ctttctttca 1321 ttcattctcc ttaaattttt tgagtaccta ctacatgcca gacacggcaa ctggccctgg 1381 aaatattgaa atgcaaattg tgcagcttgc aaagccctct cagggagaca gacaagttaa 1441 tgattatcca caaccctagg agaggtgata gggttatgca caaactgctg tggaatacaa 1501 gggatggagt gcccagatac acaagtggat gagacacggc tccaccctcg agtacctcac 1561 agtcggggtg agacaataaa aatggctttg aaggaggtct gagagaggta aaaccgcgtg 1621 aaagcacaaa agacaaagtc cttgaccaca aggagacaga aataactcat gatttacaca 1681 aattttgtga gaacctactt catcctaaca actactctgg gagatagtgt tttttgccca 1741 cctccgatca atgggaacgc taaggctctg ggaaattaaa tgggtttctt ctgctaacaa 1801 gtagtggagt aggaatccaa acacgatttt ctaaccccca gacccaatgc agagttcttg 1861 tcattacact gatttagtag gagagaacac acacactcac actcacatga acacacacat 1921 atgcacatac aattacgcta ttctgaagtg cagaatggca aaagatcatt tttgattgga 1981 gtgatctgag aaggcttcga ggactaggtc gtatctgtgt ggaaacggag cagaatgacc 2041 cataagagtg ggaaggagag atgggagaaa ggatagaatt ggaagtttgt agagcactct 2101 tacaggagat atttctgatc ctaccaggaa gatctatgga agctttacag aggagctgac 2161 gtttgctaca catctacaag tatgcatagg agctccgcgg aggccagtga gaggccctcc 2221 aggagcagaa ctaattccac aattacttga ccaagttggg ggattatttg tggggtaact 2281 gcagtgcagt atggagtcct cttggggaca gttagagcca taccatttga tctatagtca 2341 cataagaaca aacaataaaa agaaaagaca tgcttaagag tgaaagagaa agggagggag 2401 aaaagaagga agggtggatg gaaggacact agcttagtaa ggggtcaact ttggattcta 2461 tttctggttc agttttcatt tgtgacttca gtgctttagt gttagttcat tttctgtgaa 2521 tcagtttcct tatatgtgaa ataaatatga taaatcctaa ttgaactcac agactcatga 2581 gaagatagaa gtgaacacat tttaaaaaca tcacacaaag aggaactatt atgtggtcca 2641 catttatata tgtggggtag cgtctgaaga ggtgcctgga ctaagatggt cccagagcca 2701 caaggttttt gccaaacatg acgctttgtg aattcataac aagggctcca agtcaccaga 2761 tcttagagct gacccagtgc actgtctgaa agggggtacc agttctgagg cttcaagaca 2821 tgtccccagc agatcttccc cgtgccttcc cagaggattc aaaactgttg agcaggacgg 2881 caccatcaca tcaaggcaca agtgccaggg agaggtgtta aactctcccc accaacctcc 2941 ctggtacaca catggacact tgccacctcc ctcagccgcc ttaagcttca gagaactcaa 3001 aggactctgt aagtgatgtc tccaagctca tatcgaacta ctgggcaaaa tttcaggggc 3061 tctgtcactt cctggagaag ctcggatggg gtgaccacac atccatactg cctgagtcag 3121 ccccgggtta cgcctgttgt cccggtataa ccattgctag cacacccttt ccctctcaga 3181 agtgccccgg tttgaatgaa acctcttcgt gatccccttg gaggtcaact ctgagggacc 3241 cagaaactgc cttttgactg catttagtac tccatgaagt caccctcatt tctttttcat 3301 tccaggtgca tagcgtaatg tccatgttgt tctacactct gatcacagct tttctgatcg 3361 gcatacaggc ggaaccacac tcagagagca atgtccctgc aggacacacc atcccccaag 3421 tccactggac taaacttcag cattcccttg acactgccct tcgcagagcc cgcagcgccc 3481 cggcagcggc gatagctgca cgcgtggcgg ggcagacccg caacattact gtggacccca 3541 ggctgtttaa aaagcggcga ctccgttcac cccgtgtgct gtttagcacc cagcctcccc 3601 gtgaagctgc agacactcag gatctggact tcgaggtcgg tggtgctgcc cccttcaaca 3661 ggactcacag gagcaagcgg tcatcatccc atcccatctt ccacaggggc gaattctcgg 3721 tgtgtgacag tgtcagcgtg tgggttgggg ataagaccac cgccacagac atcaagggca 3781 aggaggtgat ggtgttggga gaggtgaaca ttaacaacag tgtattcaaa cagtactttt 3841 ttgagaccaa gtgccgggac ccaaatcccg ttgacagcgg gtgccggggc attgactcaa 3901 agcactggaa ctcatattgt accacgactc acacctttgt caaggcgctg accatggatg 3961 gcaagcaggc tgcctggcgg tttatccgga tagatacggc ctgtgtgtgt gtgctcagca 4021 ggaaggctgt gagaagagcc tgacctgccg acacgctccc tccccctgcc ccttctacac 4081 tctcctgggc ccctccctac ctcaacctgt aaattatttt aaattataag gactgcatgg 4141 taatttatag tttatacagt tttaaagaat cattatttat taaatttttg gaagcatcct 4201 gtgtgctgat gctggttatt ttttttgagt aaaatcatct gcaagtctga ggaagatgca 4261 gggggaattg tctgaagcac cccctggctc cttctaaggc ccacctgtaa cccttacccc 4321 acccccagcc agtgctgcaa cttcaggaaa ggctaactgg tttcagatat tagcctaagc 4381 caggcatgat tagcaaagga agcctgctgg attgcaactt tgttctacat tccagagcca 4441 aagctaccta atcattggat tacctgccca cggcctggaa gcatgggccc tggctcctcc 4501 cttggagaag ttagtggagc ctcatcagga ggcagatcac cactgggcat ggggctgccc 4561 tggcctcaga ggcacagtct cagtcctggg ggcactcgat agacaggaag aggcttgtac 4621 caaagatctg tgagctgctc atttgtagag ggagtgtgct gtctggagag ttgtaacaga 4681 gtaactcacg aagcctgcca aagtaagatc tagggtttca tgttcagctg gccttagtac 4741 tcgttttatc cctcagcagc ctcctgcaga aagactttca ttaccccaag aaaacattag 4801 tgcctccgac actcttgcaa cccactaaca gccacagtag caacttccag gggccccacc 4861 atcagcccac agccagggaa gccagagacg ggagagaact tctgaaaata tttttattct 4921 taattggatg atttccataa gtgaccaaag gtgactgggg ccgtgggacc tgcagagaag 4981 agggtggggc agaaagacta agtaaatgga gtaagaacca aataaacctg gagatgtgga 5041 ggacaggtgg gccggcagga accacggctt tctgcttctc ccaaacaggg gaaatgaaac 5101 aaggatgaca aagagaaggc cagaggtgga tgtgaagaag gggaggggag gacagagcag 5161 gaggaggagg gaggtgtgat ctctcatctt taccagcact ctccagcctc caggaaagtt 5221 agcaacctgg gtggttcttg tggggctttt tggatgtgct aataaacact gactccatcc 5281 agagcatctt aggagtgaag gtgacagagg gactccaggg ttcagcctgc agcagggctc 5341 cacccagcct cacacgaccc ccttcccctg caactttcat tttgtatttt ccttgaagcc 5401 acaatattcc cccagacaga gcacttatat taagacagaa aatttcccct cctatccatc 5461 ctgtctcccc ccgaggatga aatgcctaca ctggttttca agagaacagg cccaaattgt 5521 ccatccgaga aaagagcaaa tggtcttatt cattccctct gtaatcctcg ctgtccggcc 5581 ccacctccaa cgccatccaa atagggcaaa ttagaaacgt agggagcttt ggcatgagag 5641 ccatttgatt agggtagaag tatgatgaaa tgtgccctgt tttttctatc agagccactg 5701 caattggaat aaaggtttat gctcaagaaa caccagctag aacattctgt gctgagattc 5761 taagtgagat aaaggcat // LOCUS HUMNOS3A 2200 bp DNA PRI 17-AUG-1993 DEFINITION Human consitutive endothelial nitric oxide synthase gene, complete cds. ACCESSION L23210 NID g349703 KEYWORDS nitric oxide synthase. SOURCE Homo sapiens (library: Stratagene in lambda FIX) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2200) AUTHORS Robinson,L.J., Morton,C. and Michel,T. TITLE Isolation and chromosomal localization of the human endothelial nitric oxide synthase gene JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..2200 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="Stratagene in lambda FIX" /map="7q36" exon 2008..2186 CDS 2030..2191 /codon_start=1 /product="consitutive endothelial nitric oxide sythase" /db_xref="PID:g349704" /translation="MGNLKSVAQEPGPPCGLGLGLGLGLCGKQGPATPAPEPSRAPAS LLPPAPEHR" BASE COUNT 466 a 678 c 613 g 443 t ORIGIN 1 tcacctgagg tcaggagttc gagaccagcc tggtcaacat ggtgaaaccc tgtctctaat 61 aaaattataa aaattagccg ggcgtggtgg tgggtacctg taatctcagc tactcaggag 121 gctgggtcag gagaatcgct tgaacccagg aggcggaggt tacagtgagc tgagatagca 181 ccattgcatt ccagcctgga caacaaaagc gagactctgt ctcaaaaaaa aaaaaaaatt 241 agccaggcgt ggtggtgggt gcctgtcgtc ctcgggaggc tgaggcatga gaatcactcc 301 gggaggcaga ggttgcaatg aaccaagatc acaccactgc actccagcct gggtgacaga 361 gcaagactct gtctaaaaaa aaaaaaaaga cagaaggatg tcagcatctg atgctgcctg 421 tcaccttgac cctgaggatg ccagtcacag ctccattaac tgggacctag gaaaatgagt 481 catccttggt catgcacatt tcaaatggtg gcttaatatg gaagccacac ttgggatctg 541 ttgtctcctc cagcatggta gaagatgcct gaaaagtagg ggctggatcc catcccctgc 601 ctcactggga aggcgaggtg gtggggtggg gtggggcctc aggcttgggg tcatgggaca 661 aagcccaggc tgaatgccgc ccttccatct ccctcctcct gagacagggg cagcagggca 721 cactagtgtc caggagcagc ttatgaggcc ccttcaccct ccgatcctcc aaaactggca 781 gaccccacct tcttcggtgt gaccccagag ctctgagcac agcccgttcc ttccgcctgc 841 cggcccccca cccaggccca ccccaacctt atcctccact gcttttcaga ggagtctggc 901 caacacaaat cctcttgttt gtttgtctgt ctgtctgctg ctcctagtct ctgcctctcc 961 cagtctctca gcttccgttt ctttcttaaa ctttctctca gtctctgagg tctcgaaatc 1021 acgaggcttc gacccctgtg gaccagatgc ccagctagtg gcctttctcc agcccctcag 1081 atggcacaga actacaaacc ccagcatgca ctctggcctg aagtgcctgg agagtgctgg 1141 tgtaccccac ctgcattctg ggaactgtag tttccctagt cccccatgct cccaccaggg 1201 catcaagctc ttccctggcc ggctgaccct gcctcagccc tagtctctct gctgacctgc 1261 ggccccggga agcgtgcgtc actgaatgac agggtggggg tggaggcact ggaaggcagc 1321 ttcctgctct tttgtgtccc ccacttgagt catgggggtg tgggggttcc aggaaattgg 1381 ggctgggagg ggaagggata ccctaatgtc agactcaagg acaaaaagtc actacatcct 1441 tgctgggcct ctatccccaa gaacccaaaa ggactcaagg gtggggatcc aggagttctt 1501 gtatgtatgg ggggaggtga aggagagaac ctgcatgacc ctagaggtcc ctgtggtcac 1561 tgagagtgtg ggctgccatc ccctgctaca gaaacggtgc tcaccttctg cccaaccctc 1621 cagggaaagg cacacagggg tgaggccgaa ccttccgtct ggtgccacat cacagaagga 1681 cctttatgac cccctggtgg ctctaccctg ccactcccca atgccccagc ccccatgctg 1741 cagccccagg gctctgctgg acacctgggc tcccacttat cagcctcagt cctcacagcg 1801 gaacccaggc gtccggcccc ccacccttca ggccagcggg cgtggagctg aggctttaga 1861 gcctcccagc cgggcttgtt cctgtcccat tgtgtatggg ataggggcgg ggcgagggcc 1921 agcactggag agccccctcc cactgccccc tcctctcggt cccctccctc ttcctaagga 1981 aaaggccagg gctctgctgg agcaggcagc agagtggacg cacagtaaca tgggcaactt 2041 gaagagcgtg gcccaggagc ctgggccacc ctgcggcctg gggctggggc tgggccttgg 2101 gctgtgcggc aagcagggcc cagccacccc ggcccctgag cccagccggg ccccagcatc 2161 cctactccca ccagcgccag aacacaggta agggccaggc // LOCUS HUMNTF9 2800 bp DNA PRI 04-DEC-1995 DEFINITION Homo sapiens nuclear factor p97 (NTF97) gene, complete cds. ACCESSION L39793 NID g755649 KEYWORDS cytoplasmic factor; nuclear transport protein; p97 gene. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2800) AUTHORS Chi,N.C., Adam,E.J. and Adam,S.A. TITLE Sequence and characterization of cytoplasmic nuclear protein import factor p97 JOURNAL J. Cell Biol. 130 (2), 265-274 (1995) MEDLINE 95340629 FEATURES Location/Qualifiers source 1..2800 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="HFBEK78 library (Stratagene)" /dev_stage="fetal" /sex="female" /tissue_type="brain" gene 53..2683 /gene="NTF97" CDS 53..2683 /gene="NTF97" /codon_start=1 /product="nuclear factor p97" /db_xref="PID:g1100994" /translation="MELITILEKTVSPDRLELEAAQKFLERAAVENLPTFLVELSRVL ANPGNSQVARVAAGLQIKNSLTSKDPDIKAQYQQRWLAIDANARREVKNYVLHTLGTE TYRPSSASQCVAGIACAEIPVNQWPELIPQLVANVTNPNSTEHMKESTLEAIGYICQD IDPEQLQDKSNEILTAIIQGMRKEEPSNNVKLAATNALLNSLEFTKANFDKESERHFI MQVVCEATQCPDTRVRVAALQNLVKIMSLYYQYMETYMGPALFAITIEAMKSDIDEVA LQGIEFWSNVCDEEMDLAIEASEAAEQGRPPEHTSKFYAKGALQYLVPILTQTLTKQD ENDDDDDWNPCKAAGVCLMLLATCCEDDIVPHVLPFIKEHIKNPDWRYRDAAVMAFGC ILEGPEPSQLKPLVIQAMPTLIELMKDPSVVVRDTAAWTVGRICELLPEAAINDVYLA PLLQCLIEGLSAEPRVASNVCWAFSSLAEAAYEAADVADDQEEPATYCLSSSFELIVQ KLLETTDRPDGHQNNLRSSAYESLMEIVKNSAKDCYPAVQKTTLVIMERLQQVLQMES HIQSTSDRIQFNDLQSLLCATLQNVLRKVQHQDALQISDVVMASLLRMFQSTAGSGGV QEDALMAVSTLVEVLGGEFLKYMEAFKPFLGIGLKNYAEYQVCLAAVGLVGDLCRALQ SNIIPFCDEVMQLLLENLGNENVHRSVKPQILSVFGDIALAIGGEFKKYLEVVLNTLQ QASQAQVDKSDYDMVDYLNELRESCLEAYTGIVQGLKGDQENVHPDVMLVQPRVEFIL SFIDHIAGDEDHTDGVVACAAGLIGDLCTAFGKDVLKLVEARPMIHELLTEGRRSKTN KAKTLARWATKELRKLKNQA" BASE COUNT 785 a 606 c 723 g 686 t ORIGIN 1 caaaggccgg gccgtcgtct taggaggagt cgccgccgcc gccacctccg ccatggagct 61 gatcaccatt ctcgagaaga ccgtgtctcc cgatcggctg gagctggaag cggcgcagaa 121 gttcctggag cgtgcggccg tggagaacct gcccactttc cttgtggaac tgtccagagt 181 gctggcaaat ccaggaaaca gtcaggttgc cagagttgca gctggtctac aaatcaagaa 241 ctctttgaca tctaaagatc cagatatcaa ggcacaatat cagcagaggt ggcttgctat 301 tgatgctaat gctcgacgag aagtcaagaa ctatgttttg cacacattgg gtacagaaac 361 ttaccggcct agttctgcct cacagtgtgt ggctggtatt gcttgtgcag agatcccagt 421 aaaccagtgg ccagaactca ttcctcagct ggtggccaat gtcacaaacc ccaacagcac 481 agagcacatg aaggagtcga cattggaagc catcggttat atttgccaag atatagaccc 541 agagcagcta caagataaat ccaatgagat tctgactgcc ataatccagg ggatgaggaa 601 agaagagcct agtaataatg tgaagctagc tgctacaaat gcactcctga actcattgga 661 gttcaccaaa gcaaactttg ataaagagtc tgaaaggcac tttattatgc aggtggtctg 721 tgaagccaca cagtgtccag atacgagggt acgagtggct gctttacaga atctggtgaa 781 gataatgtcc ttatattatc agtacatgga gacatatatg ggtcctgctc tttttgcaat 841 cacaatcgaa gcaatgaaaa gtgacattga tgaggtggct ttacaaggga tagaattctg 901 gtccaatgtc tgtgatgagg aaatggattt ggccattgaa gcttcagagg cagcagaaca 961 aggacggccc cctgagcaca ccagcaagtt ttatgcgaag ggagcactac agtatctggt 1021 tccaatcctc acacagacac taactaaaca ggacgaaaat gatgatgacg atgactggaa 1081 cccctgcaaa gcagcagggg tgtgcctcat gcttctggcc acctgctgtg aagatgacat 1141 tgtcccacat gtcctcccct tcattaaaga acacatcaag aacccagatt ggcggtaccg 1201 ggatgcagca gtgatggctt ttggttgtat cttggaagga ccagagccca gtcagctcaa 1261 accactagtt atacaggcta tgcccaccct aatagaatta atgaaagacc ccagtgtagt 1321 tgttcgagat acagctgcat ggactgtagg cagaatttgt gagctgcttc ctgaagctgc 1381 catcaatgat gtctacttgg ctcccctgct acagtgtctg attgagggtc tcagtgctga 1441 acccagagtg gcttcaaatg tgtgctgggc tttctccagt ctggctgaag ctgcttatga 1501 agctgcagac gttgctgatg atcaggaaga accagctact tactgcttat cttcttcatt 1561 tgaactcata gttcagaagc tcctagagac tacagacaga cctgatggac accagaacaa 1621 cctgaggagt tctgcatatg aatctctgat ggaaattgtg aaaaacagtg ccaaggattg 1681 ttaccctgct gtccagaaaa cgactttggt catcatggaa cgactgcaac aggttcttca 1741 gatggagtca catatccaga gcacatccga tagaatccag ttcaatgacc ttcagtcttt 1801 actctgtgca actcttcaga atgttcttcg gaaagtgcaa catcaagatg ctttgcagat 1861 ctctgatgtg gttatggcct ccctgttaag gatgttccaa agcacagctg ggtctggggg 1921 agtacaagag gatgccctga tggcagttag cacactggtg gaagtgttgg gtggtgaatt 1981 cctcaagtac atggaggcct ttaaaccctt cctgggcatt ggattaaaaa attatgctga 2041 ataccaggtt tgtttggcag ctgtgggctt agtgggagac ttgtgccgtg ccctgcaatc 2101 caacatcata cctttctgtg acgaggtgat gcagctgctt ctggaaaatt tggggaatga 2161 gaacgtccac aggtctgtga agccgcagat tctgtcagtg tttggtgata ttgcccttgc 2221 tattggagga gagtttaaaa aatacttaga ggttgtattg aatactcttc agcaggcctc 2281 ccaagcccag gtggacaagt cagactatga catggtggat tatctgaatg agctaaggga 2341 aagctgcttg gaagcctata ctggaatcgt ccagggatta aagggggatc aggagaacgt 2401 acacccggat gtgatgctgg tacaacccag agtagaattt attctgtctt tcattgacca 2461 cattgctgga gatgaggatc acacagatgg agtagtagct tgtgctgctg gactaatagg 2521 ggacttatgt acagcatttg ggaaggatgt actgaaatta gtagaagcta ggccaatgat 2581 ccatgaattg ttaactgaag ggcggagatc gaagactaac aaagcaaaaa cccttgctag 2641 atgggcaaca aaagaactga ggaaactgaa gaaccaagct tgatctgtta ccaattggga 2701 tgatagcctg aggaccccca ctggaaatct cccatctttt gaaaaacctg gagtgaagga 2761 gtgtgcacga tgctgaatgt ttgggaatgc agaggatgca // LOCUS HUMNUP358G 10677 bp DNA PRI 09-JUN-1995 DEFINITION Homo sapiens nucleoporin (NUP358) gene, complete cds. ACCESSION L41840 NID g857367 KEYWORDS nucleoporin. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10677) AUTHORS Wu,J., Matunis,M.J., Kraemer,D., Blobel,G. and Coutavas,E. TITLE Nup358, a cytoplasmically exposed nucleoporin with peptide repeats, Ran-GTP binding sites, zinc fingers, a cyclophilin A homologous domain, and a leucine-rich region JOURNAL J. Biol. Chem. 270 (23), 14209-14213 (1995) MEDLINE 95294031 FEATURES Location/Qualifiers source 1..10677 /organism="Homo sapiens" /note="(vector lambda EXlox)" /db_xref="taxon:9606" /cell_line="HeLa" mRNA <114..>9788 /gene="NUP358" gene 114..9788 /gene="NUP358" CDS 114..9788 /gene="NUP358" /codon_start=1 /product="nucleoporin" /db_xref="PID:g857368" /translation="MRRSKADVERYIASVQGSTPSPRQKSMKGFYFAKLYYEAKEYDL AKKYICTYINVQERDPKAHRFLGLLYELEENTDKAVECYRRSVELNPTQKDLVLKIAE LLCKNDVTDGRAKYWLERAAKLFPGSPAIYKLKEQLLDCEGEDGWNKLFDLIQSELYV RPDDVHVNIRLVEVYRSTKRLKDAVAHCHEAERNIALRSSLEWNSCVVQTLKEYLESL QCLESDKSDWRATNTDLLLAYANLMLLTLSTRDVQESRELLQSFDSALQSVKSLGGND ELSATFLEMKGHFYMHAGSLLLKMGQHSSNVQWRALSELAALCYLIAFQVPRPKIKLI KGEAGQNLLEMMACDRLSQSGHMLLNLSRGKQDFLKEIVETFANKSGQSALYDALFSS QSPKDTSFLGSDDIGNIDVREPELEDLTRYDVGAIRAHNGSLQHLTWLGLQWNSLPAL PGIRKWLKQLFHHLPHETSRLETNAPESICILDLEVFLLGVVYTSHLQLKEKCNSHHS SYQPLCLPLPVCKQLCTERQKSWWDAVCTLIHRKAVPGNVAKLRLLVQHEINTLRAQE KHGLQPALLVHWAECLQKTGSGLNSFYDQREYIGRSVHYWKKVLPLLKIIKKKNSIPE PIDPLFKHFHSVDIQASEIVEYEEDAHITFAILDAVNGNIEDAVTAFESIKSVVSYWN LALIFHRKAEDIENDALSPEEQEECKNYLRKTRDYLIKIIDDSDSNLSVVKKLPVPLE SVKEMLNSVMQELEDYSEGGPLYKNGSLRNADSEIKRSTPSPTRYSLSPSKSYKYSPK TPPRWAEDQNSLLKMICQQVEAIKKEMQELKLNSSNSASPHRWPTENYGPDSVPDGYQ GSQTFHGAPLTVATTGPSVYYSQSPAYNSQYLLRPAANVTPTKGPVYGMNRLPPQQHI YAYPQQMHTPPVQSSSACMFSQEMYGPPALRFESPATGILSPRGDDYFNYNVQQTSTN PPLPEPGYFTKPPIAAHASRSAESKTIEFGKTNFVQPMPGEGLRPSLPTQAHTTQPTP FKFNSNFKSNDGDFTFSSPQVVTQPPPAAYSNSESLLGLLTSDKPLQGDGYSGAKPIP GGQTIGPRNTFNFGSKNVSGISFTENMGSSQQKNSGFRRSDDMFTFHGPGKSVFGTPT LETANKNHETDGGSAHGDDDDDGPHFEPVVPLPDKIEVKTGEEDEEEFFCNRAKLFRF DVESKEWKERGIGNVKILRHKTSGKIRLLMRREQVLKICANHYISPDMKLTPNAGSDR SFVWHALDYADELPKPEQLAIRFKTPEEAALFKCKFEEAQSILKAPGTNVAMASNQAV RIVKEPTSHDNKDICKSDAGNLNFEFQVAKKEGSWWHCNSCSLKNASTAKKCVSCQNL NPSNKELVGPPLAETVFTPKTSPENVQDRFALVTPKKEGHWDCSICLVRNEPTVSRCI ACQNTKSANKSGSSFVHQASFKFGQGDLPKPINSDFRSVFSTKEGQWDCSACLVQNEG SSTKCAACQNPRKQSLPATSIPTPASFKFGTSETSKTLKSGFEDMFAKKEGQWDCSSC LVRNEANATRCVACQNPDKPSPSTSVPAPASFKFGTSETSKAPKSGFEGMFTKKEGQW DCSVCLVRNEASATKCIACQNPGKQNQTTSAVSTPASSETSKAPKSGFEGMFTKKEGQ WDCSVCLVRNEASATKCIACQNPGKQNQTTSAVSTPASSETSKAPKSGFEGMFTKKEG QWDCSVCLVRNEASATKCIACQCPSKQNQTTAISTPASSEISKAPKSGFEGMFIRKGQ WDCSVCCVQNESSSLKCVACDASKPTHKPIAEAPSAFTLGSEMKLHDSSGSQVGTGFK SNFSEKASKFGNTEQGFKFGHVDQENSPSFMFQGSSNTEFKSTKEGFSIPVSADGFKF GISEPGNQEKKSEKPLENGTGFQAQDISGQKNGRGVIFGQTSSTFTFADLAKSTSGEG FQFGKKDPNFKGFSGAGEKLFSSQYGKMANKANTSGDFEKDDDAYKTEDSDDIHFEPV VQMPEKVELVTGEEDEKVLYSQRVKLFRFDAEVSQWKERGLGNLKILKNEVNGKLRML MRREQVLKVCANHWITTTMNLKPLSGSDRAWMWLASDFSDGDAKLEQLAAKFKTPELA EEFKQKFEECQRLLLDIPLQTPHKLVDTGRAAKLIQRAEEMKSGLKDFKTFLTNDQTK VTEEENKGSGTGAAGASDTTIKPNPENTGPTLEWDNYDLREDALDDSVSSSSVHASPL ASSPVRKNLFRFGESTTGFNFSFKSALSPSKSPAKLNQSGTSVGTDEESDVTQEEERD GQYFEPVVPLPDLVEVSSGEENEQVVFSHRAKLYRYDKDVGQWKERGIGDIKILQNYD NKQVRIVMRRDQVLKLCANHRITPDMTLQNMKGTERVWLWTACDFADGERKVEHLAVR FKLQDVADSFKKIFDEAKTAQEKDSLITPHVSRSSTPRESPCGKIAVAVLEETTRERT DVIQGDDVADATSEVEVSSTSETTPKAVVSPPKFVFGSESVKSIFSSEKSKPFAFGNS SATGSLFGFSFNAPLKSNNSETSSVAQSGSESKVEPKKCELSKNSDIEQSSDSKVKNL FASFPTEESSINYTFKTPEKAKEKKKPEDSPSDDDVLIVYELTPTAEQKALATKLKLP PTFFCYKNRPDYVSEEEEDDEDFETAVKKLNGKLYLDGSEKCRPLEENTADNEKECII VWEKKPTVEEKAKADTLKLPPTFFCGVCSDTDEDNGNGEDFQSELQKVQEAQKSQTEE ITSTTDSVYTGGTEVMVPSFCKSEEPDSITKSISSPSVSSETMDKPVDLSTRKEIDTD STSQGESKIVSFGFGSSTGLSFADLASSNSGDFAFGSKDKNFQWANTGAAVFGTQSVG TQSAGKVGEDEDGSDEEVVHNEDIHFEPIVSLPEVEVKSGEEDEEILFKERAKLYRWD RDVSQWKERGVGDIKILWHTMKNYYRILMRRDQVFKVCANHVITKTMELKPLNVSNNA LVWTASDYADGEAKVEQLAVRFKTKEVADCFKKTFEECQQNLMKLQKGHVSLAAELSK ETNPVVFFDVCADGEPLGRITMELFSNIVPRTAENFRALCTGEKGFGFKNSIFHRVIP DFVCQGGDITKHDGTGGQSIYGDKFEDENFDVKHTGPGLLSMANQGQNTNNSQFVITL KKAEHLDFKHVVFGFVKDGMDTVKKIESFGSPKGSVCRRITITECGQI" BASE COUNT 3475 a 1907 c 2331 g 2964 t ORIGIN 1 ggatcgaatc gcggccgcgt cgacggtttg caggcgcttt cctcttggaa gtggcgactg 61 ctgcgggcct gagcgctggt ctcacgcgcc tcgggagcca ggttggcggc gcgatgaggc 121 gcagcaaggc tgacgtggag cggtacatcg cctcggtgca gggctccacc ccgtcgcctc 181 gacagaagtc aatgaaagga ttctattttg caaagctgta ttatgaagct aaagaatatg 241 atcttgctaa aaaatacata tgtacttaca ttaatgtgca agagagggat cccaaagctc 301 acagatttct gggtcttctt tatgaattgg aagaaaacac agacaaagcc gttgaatgtt 361 acaggcgttc agtggaatta aacccaacac aaaaagatct tgtgttgaag attgcagaat 421 tgctttgtaa aaatgatgtt actgatggaa gagcaaaata ctggcttgaa agagcagcca 481 aacttttccc aggaagtcct gcaatttata aactaaagga acagcttcta gattgtgaag 541 gtgaagatgg atggaataaa ctttttgact tgattcagtc agaactttat gtaagacctg 601 atgacgtcca tgtgaacatc cggctagtgg aggtgtatcg ctcaactaaa agattgaagg 661 atgctgtggc ccactgccat gaggcagaga ggaacatagc tttgcgttca agtttagaat 721 ggaattcgtg tgttgtacag acccttaagg aatatctgga gtctttacag tgtttggagt 781 ctgataaaag tgactggcga gcaaccaata cagacttact gctggcctat gctaatctta 841 tgcttcttac gctttccact agagatgtgc aggaaagtag agaattactg caaagttttg 901 atagtgctct tcagtctgtg aaatctttgg gtggaaatga tgaactgtca gctactttct 961 tagaaatgaa aggacatttc tacatgcatg ctggttctct gcttttgaag atgggtcagc 1021 atagtagtaa tgttcaatgg cgagctcttt ctgagctggc tgcattgtgc tatctcatag 1081 catttcaggt tccaagacca aagattaaat taataaaagg tgaagctgga caaaatctgc 1141 tggaaatgat ggcctgtgac cgactgagcc aatcagggca catgttgcta aacttaagtc 1201 gtggcaagca agatttttta aaagagattg ttgaaacttt tgccaacaaa agcgggcagt 1261 ctgcattata tgatgctctg ttttctagtc agtcacctaa ggatacatct tttcttggta 1321 gcgatgatat tggaaacatt gatgtacgag aaccagagct tgaagatttg actagatacg 1381 atgttggtgc tattcgagca cataatggta gtcttcagca ccttacttgg cttggcttac 1441 agtggaattc attgcctgct ttacctggaa tccgaaaatg gctaaaacag cttttccatc 1501 atttgcccca tgaaacctca aggcttgaaa caaatgcacc tgaatcaata tgtattttag 1561 atcttgaagt atttctcctt ggagtagtat ataccagcca cttacaatta aaggagaaat 1621 gtaattctca ccacagctcc tatcagccgt tatgcctgcc ccttcctgtg tgtaaacagc 1681 tttgtacaga aagacaaaaa tcttggtggg atgcggtttg tactctgatt cacagaaaag 1741 cagtacctgg aaacgtagca aaattgagac ttctagttca gcatgaaata aacactctaa 1801 gagcccagga aaaacatggc cttcaacctg ctctgcttgt acattgggca gaatgccttc 1861 agaaaacggg cagcggtctt aattcttttt atgatcaacg agaatacata gggagaagtg 1921 ttcattattg gaagaaagtt ttgccattgt tgaagataat aaaaaagaag aacagtattc 1981 ctgaacctat tgatcctctg tttaaacatt ttcatagtgt agacattcag gcatcagaaa 2041 ttgttgaata tgaagaagac gcacacataa cttttgctat attggatgca gtaaatggaa 2101 atatagaaga tgctgtgact gcttttgaat ctataaaaag tgttgtttct tattggaatc 2161 ttgcactgat ttttcacagg aaggcagaag acattgaaaa tgatgccctt tctcctgaag 2221 aacaagaaga atgcaaaaat tatctgagaa agaccaggga ctacctaata aagattatag 2281 atgacagtga ttcaaatctt tcagtggtca agaaattgcc tgtgcccctg gagtctgtaa 2341 aagagatgct taattcagtc atgcaggaac tcgaagacta tagtgaagga ggtcctctct 2401 ataaaaatgg ttctttgcga aatgcagatt cagaaataaa acgttctaca ccgtctccta 2461 ccagatattc actatcacca agtaaaagtt acaagtattc tcccaaaaca ccacctcgat 2521 gggcagaaga tcagaattct ttactgaaaa tgatttgcca acaagtagag gccattaaga 2581 aagaaatgca ggagttgaaa ctaaatagca gtaactcagc atcccctcat cgttggccca 2641 cagagaatta tggaccagac tcggtgcctg atggatatca ggggtcacag acatttcatg 2701 gggctccact aacagttgca actactggcc cttcagtata ttatagtcag tcaccagcat 2761 ataattccca gtatcttctc agaccagcag ctaatgttac tcccacaaag ggcccagtct 2821 atggcatgaa taggcttcca ccccaacagc atatttatgc ctatccgcaa cagatgcaca 2881 caccgccagt gcaaagctca tctgcttgta tgttctctca ggagatgtat ggtcctcctg 2941 cattgcgttt tgagtctcct gcaacgggaa ttctatcgcc caggggtgat gattacttta 3001 attacaatgt tcaacagaca agcacaaatc cacctttgcc agaaccagga tatttcacaa 3061 aacctccgat tgcagctcat gcttcaagat ctgcagaatc taagactata gaatttggga 3121 aaactaattt tgttcagccc atgccgggtg aaggattaag gccatctttg ccaacacaag 3181 cacacacaac acagccaact ccttttaaat ttaactcaaa tttcaaatca aatgatggtg 3241 acttcacgtt ttcctcacca caggttgtga cacagccccc tcctgcagct tacagtaaca 3301 gtgaaagcct tttaggtctc ctgacttcag ataaaccctt gcaaggagat ggctatagtg 3361 gagccaaacc aattcctggt ggtcaaacca ttgggcctcg aaatacattc aattttggaa 3421 gcaaaaatgt gtctggaatt tcatttacag aaaacatggg gtcgagtcag caaaagaatt 3481 ctggttttcg gcgaagtgat gatatgttta ctttccatgg tccagggaaa tcagtatttg 3541 gaacacccac tttagagaca gcaaacaaga atcatgagac agatggagga agtgcccatg 3601 gggatgatga tgatgacggt cctcactttg agcctgtagt acctcttcct gataagattg 3661 aagtaaaaac tggtgaggaa gatgaagaag aattcttttg caaccgcgcg aaattgtttc 3721 gtttcgatgt agaatccaaa gaatggaaag aacgtgggat tggcaatgta aaaatactga 3781 ggcataaaac atctggtaaa attcgccttc taatgagacg agagcaagta ttgaaaatct 3841 gtgcaaatca ttacatcagt ccagatatga aattgacacc aaatgctgga tcagacagat 3901 cttttgtatg gcatgccctt gattatgcag atgagttgcc aaaaccagaa caacttgcta 3961 ttaggttcaa aactcctgag gaagcagcac tttttaaatg caagtttgaa gaagcccaga 4021 gcattttaaa agccccagga acaaatgtag ccatggcgtc aaatcaggct gtcagaattg 4081 taaaagaacc cacaagtcat gataacaagg atatttgcaa atctgatgct ggaaacctga 4141 attttgaatt tcaggttgca aagaaagaag ggtcttggtg gcattgtaac agctgctcat 4201 taaagaatgc ttcaactgct aagaaatgtg tatcatgcca aaatctaaac ccaagcaata 4261 aagagctcgt tggcccacca ttagctgaaa ctgtttttac tcctaaaacc agcccagaga 4321 atgttcaaga tcgatttgca ttggtgactc caaagaaaga aggtcactgg gattgtagta 4381 tttgtttagt aagaaatgaa cctactgtat ctaggtgcat tgcgtgtcag aatacaaaat 4441 ctgctaacaa aagtggatct tcatttgttc atcaagcttc atttaaattt ggccagggag 4501 atcttcctaa acctattaac agtgatttca gatctgtttt ttctacaaag gaaggacagt 4561 gggattgcag tgcatgtttg gtacaaaatg aggggagctc tacaaaatgt gctgcttgtc 4621 agaatccgag aaaacagagt ctacctgcta cttctattcc aacacctgcc tcttttaagt 4681 ttggtacttc agagacaagt aaaactctaa aaagtggatt tgaagacatg tttgctaaga 4741 aggaaggaca gtgggattgc agttcatgct tagtgcgaaa tgaagcaaat gctacaagat 4801 gtgttgcttg tcagaatccg gataaaccaa gtccatctac ttctgttcca gctcctgcct 4861 cttttaagtt tggtacttca gagacaagca aggctccaaa gagcggattt gagggaatgt 4921 tcactaagaa ggagggacag tgggattgca gtgtgtgctt agtaagaaat gaagccagtg 4981 ctaccaaatg tattgcttgt cagaatccag gtaaacaaaa tcaaactact tctgcagttt 5041 caacacctgc ctcttcagag acaagcaagg ctccaaagag cggatttgag ggaatgttca 5101 ctaagaagga gggacagtgg gattgcagtg tgtgcttagt aagaaatgaa gccagtgcta 5161 ccaaatgtat tgcttgtcag aatccaggta aacaaaatca aactacttct gcagtttcaa 5221 cacctgcctc ttcagagaca agcaaggctc caaagagcgg atttgaggga atgttcacta 5281 agaaggaagg acagtgggat tgcagtgtgt gcttagtaag aaatgaagcc agtgctacca 5341 aatgtattgc ttgtcagtgt ccaagtaaac aaaatcaaac aactgcaatt tcaacacctg 5401 cctcttcgga gataagcaag gctccaaaga gtggatttga aggaatgttc atcaggaaag 5461 gacagtggga ttgtagtgtt tgctgtgtac aaaatgagag ttcttcctta aaatgtgtgg 5521 cttgtgatgc ctctaaacca actcataaac ctattgcaga agctccttca gctttcacac 5581 tgggctcaga aatgaagttg catgactctt ctggaagtca ggtgggaaca ggatttaaaa 5641 gtaatttctc agaaaaagct tctaagtttg gcaatacaga gcaaggattc aaatttgggc 5701 atgtggatca agaaaattca ccttcattta tgtttcaggg ttcttctaat acagaattta 5761 agtcaaccaa agaaggattt tccatccctg tgtctgctga tggatttaaa tttggcattt 5821 cggaaccagg aaatcaagaa aagaaaagtg aaaagcctct tgaaaatggt actggcttcc 5881 aggctcagga tattagtggc cagaagaatg gccgtggtgt gatttttggc caaacaagta 5941 gcacttttac atttgcagat cttgcaaaat caacttcagg agaaggattt cagtttggca 6001 aaaaagaccc caatttcaag ggattttcag gtgctggaga aaaattattc tcatcacaat 6061 acggtaaaat ggccaataaa gcaaacactt ccggtgactt tgagaaagat gatgatgcct 6121 ataagactga ggacagcgat gacatccatt ttgaaccagt agttcaaatg cccgaaaaag 6181 tagaacttgt aacaggagaa gaagatgaaa aagttctgta ttcacagcgg gtaaaactat 6241 ttagatttga tgctgaggta agtcagtgga aagaaagggg cttggggaac ttaaaaattc 6301 tcaaaaacga ggtcaatggc aaactaagaa tgctgatgcg aagagaacaa gtactaaaag 6361 tgtgtgctaa tcattggata acgactacga tgaacctgaa gcctctctct ggatcagata 6421 gagcatggat gtggttagcc agtgatttct ctgatggtga tgccaaacta gagcagttgg 6481 cagcaaaatt taaaacacca gagctggctg aagaattcaa gcagaaattt gaggaatgcc 6541 agcggcttct gttagacata ccacttcaaa ctccccataa acttgtagat actggcagag 6601 ctgccaagtt aatacagaga gctgaagaaa tgaagagtgg actgaaagat ttcaaaacat 6661 ttttgacaaa tgatcaaaca aaagtcactg aggaagaaaa taagggttca ggtacaggtg 6721 cggccggtgc ctcagacaca acaataaaac ccaatcctga aaacactggg cccacattag 6781 aatgggataa ctatgattta agggaagatg ctttggatga tagtgtcagt agtagctcag 6841 tacatgcttc tccattggca agtagccctg tgagaaaaaa tcttttccgt tttggtgagt 6901 caacaacagg atttaacttc agttttaaat ctgctttgag tccatctaag tctcctgcca 6961 agttgaatca gagtgggact tcagttggca ctgatgaaga atctgatgtt actcaagaag 7021 aagagagaga tggacagtac tttgaacctg ttgttccttt acctgatcta gttgaagtat 7081 ccagtggtga ggaaaatgaa caagttgttt ttagtcacag ggcaaaactc tacagatatg 7141 ataaagatgt tggtcaatgg aaagaaaggg gcattggtga tataaagatt ttacagaatt 7201 atgataataa gcaagttcgt atagtgatga gaagggacca agtattaaaa ctttgtgcca 7261 atcacagaat aactccagac atgactttgc aaaatatgaa agggacagaa agagtatggt 7321 tgtggactgc atgtgatttt gcagatggag aaagaaaagt agagcattta gctgttcgtt 7381 ttaaactaca ggatgttgca gactcgttta agaaaatttt tgatgaagca aaaacagccc 7441 aggaaaaaga ttctttgata acacctcatg tttctcggtc aagcactccc agagagtcac 7501 catgtggcaa aattgctgta gctgtattag aagaaaccac aagagagagg acagatgtta 7561 ttcagggtga tgatgtagca gatgcaactt cagaagttga agtgtctagc acatctgaaa 7621 caacaccaaa agcagtggtt tctcctccaa agtttgtatt tggttcagag tctgttaaaa 7681 gcatttttag tagtgaaaaa tcaaaaccat ttgcattcgg caacagttca gccactgggt 7741 ctttgtttgg atttagtttt aatgcacctt tgaaaagtaa caatagtgaa actagttcag 7801 tagcccagag tggatctgaa agcaaagtgg aacctaaaaa atgtgaactg tcaaagaact 7861 ctgatatcga acagtcttca gatagcaaag tcaaaaatct ctttgcttcc tttccaacgg 7921 aagaatcttc aatcaactac acatttaaaa caccagaaaa ggcaaaagag aagaaaaaac 7981 ctgaagattc tccctcagat gatgatgttc tcattgtata tgaactaact ccaaccgctg 8041 agcagaaagc ccttgcaacc aaacttaaac ttcctccaac tttcttctgc tacaagaata 8101 gaccagatta tgttagtgaa gaagaggagg atgatgaaga tttcgaaaca gctgtcaaga 8161 aacttaatgg aaaactatat ttggatggct cagaaaaatg tagacccttg gaagaaaata 8221 cagcagataa tgagaaagaa tgtattattg tttgggaaaa gaaaccaaca gttgaagaga 8281 aggcaaaagc agatacgtta aaacttccac ctacattttt ttgtggagtc tgtagtgata 8341 ctgatgaaga caatggaaat ggggaagact ttcaatcaga gcttcaaaaa gttcaggaag 8401 ctcaaaaatc tcagacagaa gaaataacta gcacaactga cagtgtatat acaggtggga 8461 ctgaagtgat ggtaccttct ttctgtaaat ctgaagaacc tgattctatt accaaatcca 8521 ttagttcacc atctgtttcc tctgaaacta tggacaaacc tgtagatttg tcaactagaa 8581 aggaaattga tacagattct acaagccaag gggaaagcaa gatagtttca tttggatttg 8641 gaagtagcac agggctctca tttgcagact tggcttccag taattctgga gattttgctt 8701 ttggttctaa agataaaaat ttccaatggg caaatactgg agcagctgtg tttggaacac 8761 agtcagtcgg aacccagtca gccggtaaag ttggtgaaga tgaagatggt agtgatgaag 8821 aagtagttca taatgaagat atccattttg aaccaatagt gtcactacca gaggtagaag 8881 taaaatctgg agaagaagat gaagaaattt tgtttaaaga gagagccaaa ctttatagat 8941 gggatcggga tgtcagtcag tggaaggagc gcggtgttgg agatataaag attctttggc 9001 atacaatgaa gaattattac cggatcctaa tgagaagaga ccaggttttt aaagtgtgtg 9061 caaaccacgt tattactaaa acaatggaat taaagccctt aaatgtttca aataatgctt 9121 tagtttggac tgcctcagat tatgctgatg gagaagcaaa agtagaacag cttgcagtga 9181 gatttaaaac taaagaagta gctgattgtt tcaagaaaac atttgaagaa tgtcagcaga 9241 atttaatgaa actccagaaa ggacatgtat cactggcagc agaattatca aaggagacca 9301 atcctgtggt gttttttgat gtttgtgcgg acggtgaacc tctagggcgg ataactatgg 9361 aattattttc aaacattgtt cctcggactg ctgagaactt cagagcacta tgcactggag 9421 agaaaggctt tggtttcaag aattccattt ttcacagagt aattccagat tttgtttgcc 9481 aaggaggaga tatcaccaaa catgatggaa caggcggaca gtccatttat ggagacaaat 9541 ttgaagatga aaattttgat gtgaaacata ctggtcctgg tttactatcc atggccaatc 9601 aaggccagaa taccaataat tctcaatttg ttataacact gaagaaagca gaacatttgg 9661 actttaagca tgtagtattt gggtttgtta aggatggcat ggatactgtg aaaaagattg 9721 aatcatttgg ttctcccaaa gggtctgttt gtcgaagaat aactatcaca gaatgtggac 9781 agatataaaa tcattgttgt tcatagaaaa tttcatctgt ataagcagtt ggattgaagc 9841 ttagctatta caatttgata gttatgttca gcttttgaaa atggacgttt ccgatttaca 9901 aatgtaaaat tgcagcttat agctgttgtc actttttaat gtgttataat tgaccttgca 9961 tggtgtgaaa taaaagttta aacactggtg tatttcaggt gtacttgtgt ttatgtactc 10021 ctgacgtatt aaaatggaat aatactaatc ttgttaaaag caatagacct caaactattg 10081 aaggaatatg atatatgcaa tttaatttta attcctttta agatatttgg acttcctgca 10141 tggatatact taccatttga ataaagggac cacaacttgg ataatttaat tttaggtttg 10201 aaatatattt ggtaatctta actattggtg tactcattta tgcatagaga ctcgtttatg 10261 aatgggtaga gccacagaac gtatagagtt aaccaaagtg ctcttctcta gaatctttac 10321 acctcctgtg tggttacaag ttaactttgt aagtagcgta ccttccttcc ttaaaatatc 10381 tagcttcctg tgccctttca tagatattcg attaattttt acattttaaa caagttgact 10441 atttccttta ggggttttgt ttcaaacttt tctgtcatct gtctctacta cctcagaaac 10501 tgcagcttgg ttctgatgat agaaattgaa tttttccttg tagttattgt gataaagtat 10561 gaatattttt agaaagtcta taccatgttc tttcgttaaa gatttgcttt atacaagatt 10621 gttgcagtac ctttttctgg taaattttgt agcagaaata aaatgacaat tcctaag // LOCUS HUMORLMHC 1990 bp DNA PRI 25-NOV-1996 DEFINITION Human olfactory receptor-like gene, complete cds. ACCESSION L35475 NID g1041044 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1990) AUTHORS Fan,W., Cai,W., Parimoo,S., Lennon,G.G. and Weissman,S.M. TITLE Identification of seven new human MHC class I region genes around the HLA-F locus JOURNAL Immunogenetics 44 (2), 97-103 (1996) MEDLINE 96269983 REFERENCE 2 (bases 1 to 1990) AUTHORS Fan,W.-F. TITLE Direct Submission JOURNAL Submitted (27-SEP-1995) Wufang Fan, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA FEATURES Location/Qualifiers source 1..1990 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="FAT11" /map="6p21" /tissue_lib="YAC A146G11" CDS 500..1450 /codon_start=1 /product="olfactory receptor-like protein" /db_xref="PID:g601919" /translation="MDNQSSTPGFLLLGFSEHPGLGRTLFVDVITSYLLTLVGNTLII LLSALDTKLHSPMYFFLSNLSFLDLCFTTSCVPQMLANLWGPKKTISFLDCSVQIFIF LSLGTTECILMKVMAFDRYVAVCQPLHYATIIHPRLCWQLASVAWVIGLVGSVVQTPS TLHLPFCPDRQVDDFVCEVPALIRLSCEDTSYNEIQVAVASVFILVVPLSLILVSYGA ITWAVLRINSATAWRKAFGTCSSHLTVVTLFYSSVIAVYLQPKNPYAQGRGKFFGLFY AVGTPSLNPLVYTLRNKEIKRALRRLLGKERDSRESWRAA" BASE COUNT 508 a 535 c 440 g 507 t ORIGIN 1 tttacagagt tctataaaat tcattcaacc aatagagcaa taattgagcc tacagagaca 61 acttatcaga aaattcattc aatatacctt acgagatcat ccaatagata agagacaact 121 ctagaacagc attcagaaca tagtggcact caataaattt cccctgaatg aatgaattaa 181 tgaattagtg catattttaa tcagcctcct ttgccctcac ccaggaagtc agaggcacca 241 gtgtgagtat ccatctgctg tccagtacat tcatggattc ctcactctca ctagacaatg 301 tttgaccagg aagaacaggg aatgagaagg agctgctggg tggtgatgag ccttggaaag 361 ggaggctggg cgagcagaga cagaagagaa acacctacct gctgtgacct cacaaacacc 421 caggctgagt tttgataaga caggttgaat cacactgggg tgacagcctc atccctccag 481 gtacaaacaa gaacaggcca tggataacca aagctccaca ccgggcttcc tccttctggg 541 cttctctgaa cacccagggc tgggaaggac tctcttcgtg gatgtcatca cttcctacct 601 cctaacccta gtgggcaaca cactcatcat cctgctgtct gcgctggaca ccaagctcca 661 ctctccaatg tactttttcc tctccaacct ctccttcttg gacctctgtt tcaccacgag 721 ttgtgttccc caaatgctgg ccaacctctg gggcccaaag aagaccatca gcttcctgga 781 ctgctctgtc cagatcttca tcttcctgtc cctggggaca actgagtgca tcctcatgaa 841 agtgatggct tttgatcgct acgtggctgt ctgccagccc ctccactatg ccaccatcat 901 ccacccccgc ctgtgctggc agctggcatc tgtggcctgg gtcattgggc tagtggggtc 961 agtggtccag acaccatcca ccctgcacct gcccttctgc cccgatcggc aggtggatga 1021 ttttgtctgt gaggtcccag ctctaattcg actctcctgt gaagacacct cctacaatga 1081 gatccaggtg gctgttgcca gtgtcttcat cttggttgtg cctctcagcc tcatccttgt 1141 ctcttacgga gccattacct gggcagtgct gaggattaac tccgccacag catggagaaa 1201 ggcctttggg acctgctcct cccatctcac tgtggtcacc ctcttctaca gctcagtcat 1261 tgctgtctac ctccagccca aaaatccgta tgcccaaggg aggggcaagt tctttggtct 1321 cttctatgca gtgggcactc cttcacttaa ccctctcgta tacaccctga ggaacaagga 1381 gataaagcga gcactcagga ggttactagg gaaggaaaga gactccaggg aaagctggag 1441 agctgcttaa tatactttcg aaagtaagaa gagtttcttc aagatttatg aacatgttaa 1501 gttttccaga ctactaccct tcccacatac aacctggagc cactgtgggg gggtcacagg 1561 gtgggtatgt tatctatgag agggagaatg agaaagagag ggacagagag ataaaagaat 1621 ttgggtgaga ggagataggt agctccataa ggcacacaaa ttcagatatt atcattccta 1681 tcactgtcca tccttaatat ttctatcctc cattctgtcc tatttactgt catcactcct 1741 atagattccc taactccacc atgcctatct ctggttatat aattgctctc caatggtcat 1801 gtcagtgtag gggaactact ccatcatagc attctggaca cctcgcatgt atctacgtag 1861 gtcatgtcag cacaggcttg aaggaacagc tactctgaga tttaggaaga atgcttctgg 1921 gatcccccct cggcaatatg agaggatcgg gaggcccctt caggaacctg cctcaaatgc 1981 caccttctca // LOCUS HUMOTCEX1 815 bp DNA PRI 17-JAN-1992 DEFINITION Human ornithine transcarbamylase (OTC) gene, 5'-end region. ACCESSION D00095 NID g219960 KEYWORDS OTC; ornithine transcarbamylase. SOURCE Human peripheral white blood cell DNA, clone phOTC8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 815) AUTHORS Hata,A., Tsuzuki,T., Shimada,K., Takiguchi,M., Mori,M. and Matsuda,I. TITLE Isolation and characterization of the human ornithine transcarbamylase gene: structure of the 5'-end region JOURNAL J. Biochem. 100 (3), 717-725 (1986) MEDLINE 87057134 COMMENT In [J. Biochem. 100, 717-725 (1986)], they sequenced the 5'-end region of the OTC gene and found that it covered 665 bp of the 5'-flanking region, the complete first exon and a part of the first intron (150 bp). FEATURES Location/Qualifiers source 1..815 /organism="Homo sapiens" /db_xref="taxon:9606" CAAT_signal 337..340 /note="CAAT box (putative)" TATA_signal 389..394 /note="TATA box (putative)" CDS 589..669 /partial /note="ornithine transcarbamylase (OTC), exon 1" /codon_start=1 /db_xref="PID:d1000502" /db_xref="PID:g219961" /translation="MLFNLRILLNNAAFRNGHNFMVRNFR" intron 666..>815 /note="OTC cds intron 1" BASE COUNT 230 a 166 c 168 g 251 t ORIGIN BanII site. 1 gagccccagg actgagatat ttttactata ccttctctat catcttgcac ccccaaaata 61 gcttccaggg cacttctatt tgtttttgtg gaaagactgg caattagagg tagaaaagtg 121 aaataaatgg aaatagtact actcagggct gtcacatcta catctgtgtt tttgcagtgc 181 caatttgcat tttctgagtg agttacttct actcaccttc acagcagcca gtaccgcagt 241 gccttgcata tattatatcc tcaatgagta cttgtcaatt gattttgtac atgcgtgtga 301 cagtataaat atattatgaa aaatgaggag gccaggcaat aaaagagtca ggatttcttc 361 caaaaaaaat acacagcggt ggagcttggc ataaagttca aatgctccta caccctgccc 421 tgcagtatct ctaaccaggg gactttgata aggaagctga agggtgatat tacctttgct 481 ccctcactgc aactgaacac atttcttagt ttttaggtgg cccccgctgg ctaacttgct 541 gtggagtttt caagggcata gaatcgtcct ttacacaatt aaaagaagat gctgtttaat 601 ctgaggatcc tgttaaacaa tgcagctttt agaaatggtc acaacttcat ggttcgaaat 661 tttcggtaag tgatggtcag agacttgggt ttgatttagg aatcatggtg atgcataaaa 721 ctatattctg cagtaaggcc tctttctgca gaatgtagtg ccacgctctg ctttactctt 781 atttgagaca gctgcctcta attccagcaa agctt // LOCUS HUMPCD 1820 bp DNA PRI 24-SEP-1992 DEFINITION Human potassium channel protein (HPCN3) gene, complete cds. ACCESSION M55515 NID g189672 KEYWORDS potassium channel protein. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Philipson,L.H., LaMendola,J., Bell,G.I. and Steiner,D.F. TITLE Genomic sequence of a human potassium channel related to RCK3 JOURNAL Unpublished (1990) REFERENCE 2 (bases 1 to 1820) AUTHORS Philipson,L.H., Hice,R.E., Schaefer,K., LaMendola,J., Bell,G.I. and Nelson,D.J. TITLE Sequence and functional expression in Xenopus oocytes of a human insulinoma and islet potassium channel JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 53-57 (1991) MEDLINE 91095456 FEATURES Location/Qualifiers source 1..1820 /organism="Homo sapiens" /db_xref="taxon:9606" gene 126..1697 /gene="HPCN3" CDS 126..1697 /gene="HPCN3" /codon_start=1 /function="ion transport" /product="potassium channel protein" /db_xref="PID:g189673" /translation="MTVVPGDHLLEPEVADGGGGPPQGGCGGGGCDRYEPVPPSLPAA GEQDCCGERVVINISGLRFETQLKTLCQFPETLLGDPKRRMRYFDPVRNEYFFDRNRP SFDAILYYYQSGGRIRRPVNVPIDIFSEEIRFYQLGEEAMEKFREDEGFLREEERPLP RRDFQRQVWLLFEYPESSGPARGIAIVSVLVILISIVIFCLETLPEFRDEKDYPASTS QDSFEAAGNSTSGSRAGASSFSDPFFVVETLCIIWFSFELLVRFFACPSKATFSRNIM NLIDIVAIIPYFITLGTELAERQGNGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQI LGQSLKASMRELGLLIFFLFIGVILFSSAVYFAEADDPTSGFSSIPDAFWWAVVTMTT VGYGDMHPVTIGGKIVGSLCAIAGVLSIALPVPVIVSNFNYFYHRETEGEEQSQYMHV GSCQHLSSSAEELRKARSNSTLSKSEYMVIEEGGMNHTAFPQTPFKTGNSTATCTTNN NPNSCVNIKKIFTDV" BASE COUNT 347 a 581 c 520 g 372 t ORIGIN 1 gccgccgccc tcagcccgcc accgcgccca ccctcctcag cgcccagcga gcagcggcgg 61 tgcccacacg ctggtgaacc acggctacgc ggagccccgc cgcaggccgc gagctgccgc 121 ccgacatgac cgtggtgccc ggggaccacc tgctggagcc ggaggtggcc gatggtggag 181 ggggcccgcc tcaaggcggc tgtggcggcg gcggctgcga ccgctacgag cccgtgccgc 241 cctcactgcc ggccgcgggc gagcaggact gctgcgggga gcgcgtggtc atcaacatct 301 ccgggctgcg cttcgagacg cagctgaaga ccctttgcca gttccccgag acgctgctgg 361 gcgaccccaa gcggcgcatg aggtacttcg accccgtccg caacgagtac ttcttcgacc 421 gcaaccggcc cagcttcgac gccatcctct actactatca gtccgggggc cgcatccgcc 481 ggccggtcaa cgtgcccatc gacattttct ccgaggagat ccgcttctac cagctgggcg 541 aggaggccat ggagaagttc cgcgaggacg agggcttcct gcgggaggag gagcggccct 601 tgccccgccg cgacttccag cgccaggtgt ggctgctctt cgagtacccc gagagctccg 661 ggccggcccg gggcatcgcc atcgtgtccg tgctggtcat cctcatctcc attgtcatct 721 tctgcctgga gacgctgccg gagttccgcg acgagaagga ctaccccgcc tcgacgtcgc 781 aggactcatt cgaagcagcc ggcaacagca cgtcggggtc ccgcgcagga gcctccagct 841 tctccgatcc cttcttcgtg gtggagacgc tgtgcatcat ctggttctcc ttcgaactgc 901 tggtgcggtt cttcgcttgt cctagcaaag ccaccttctc gcgaaacatc atgaacctga 961 tcgacattgt ggccatcatt ccttatttta tcactctggg taccgagctg gcggaacgac 1021 agggcaatgg acagcaggcc atgtctctgg ccatcctgag ggtcatccgc ctggtaaggg 1081 tcttccgcat cttcaagctg tcgcgccact ccaaggggct gcagatcctc gggcaaagcc 1141 tgaaggcgtc catgcgggag ctgggattgc tcatcttctt cctctttatt ggggtcatcc 1201 ttttctccag cgcggtctac tttgccgagg cagacgaccc cacttcaggt ttcagcagca 1261 tcccggatgc cttctggtgg gcagtggtaa ccatgacaac agtgggttac ggcgatatgc 1321 acccagtgac catagggggg aagattgtgg gatctctctg tgccatcgcc ggtgtcttgt 1381 ccatcgcatt gccagttccc gtgattgttt ccaacttcaa ttacttctac caccgggaga 1441 cagaagggga agagcaatcc cagtacatgc acgtgggaag ttgccagcac ctctcctctt 1501 cagccgagga gctccgaaaa gcaaggagta actcgactct gagtaagtcg gagtatatgg 1561 tgatcgaaga ggggggtatg aaccataccg ctttccccca gacccctttc aaaacgggca 1621 attccactgc cacctgcacc acgaacaata atcccaactc ttgtgtcaac atcaaaaaga 1681 tattcaccga tgtttaatat gtgatacaag tgacatgctg tgctcagtat tgtgtggaac 1741 gtgccccctt ggtctgccta tgcccttgtt ttatacattt ccagaccatt catcaaggaa 1801 aggacctgaa gaagtggaaa // LOCUS HUMPD1A 921 bp DNA PRI 27-DEC-1994 DEFINITION Human PD-1 gene, complete cds. ACCESSION L27440 NID g604540 KEYWORDS . SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 921) AUTHORS Shinohara,T., Taniwaki,M., Ishida,Y., Kawaichi,M. and Honjo,T. TITLE Structure and chromosomal localization of the human PD-1 gene (PDCD1) JOURNAL Genomics 23 (3), 704-706 (1994) MEDLINE 95154844 FEATURES Location/Qualifiers source 1..921 /organism="Homo sapiens" /db_xref="taxon:9606" gene 25..891 /gene="PD-1" CDS 25..891 /gene="PD-1" /note="Tyr-X-X-Leu motif 691..777; extracellular domain 25..525; immunoglobulin-like domain 172..405; intracellular domain 601..891; transmembrane region 526..600" /codon_start=1 /db_xref="PID:g604541" /translation="MQIPQAPWPVVWAVLQLGWRPGWFLDSPDRPWNPPTFSPALLVV TEGDNATFTCSFSNTSESFVLNWYRMSPSNQTDKLAAFPEDRSQPGQDCRFRVTQLPN GRDFHMSVVRARRNDSGTYLCGAISLAPKAQIKESLRAELRVTERRAEVPTAHPSPSP RSAGQFQTLVVGVVGGLLGSLVLLVWVLAVICSRAARGTIGARRTGQPLKEDPSAVPV FSVDYGELDFQWREKTPEPPVPCVPEQTEYATIVFPSGMGTSSPARRGSADGPRSAQP LRPEDGHCSWPL" BASE COUNT 163 a 326 c 280 g 152 t ORIGIN 1 cactctggtg gggctgctcc aggcatgcag atcccacagg cgccctggcc agtcgtctgg 61 gcggtgctac aactgggctg gcggccagga tggttcttag actccccaga caggccctgg 121 aaccccccca ccttctcccc agccctgctc gtggtgaccg aaggggacaa cgccaccttc 181 acctgcagct tctccaacac atcggagagc ttcgtgctaa actggtaccg catgagcccc 241 agcaaccaga cggacaagct ggccgccttc cccgaggacc gcagccagcc cggccaggac 301 tgccgcttcc gtgtcacaca actgcccaac gggcgtgact tccacatgag cgtggtcagg 361 gcccggcgca atgacagcgg cacctacctc tgtggggcca tctccctggc ccccaaggcg 421 cagatcaaag agagcctgcg ggcagagctc agggtgacag agagaagggc agaagtgccc 481 acagcccacc ccagcccctc acccaggtca gccggccagt tccaaaccct ggtggttggt 541 gtcgtgggcg gcctgctggg cagcctggtg ctgctagtct gggtcctggc cgtcatctgc 601 tcccgggccg cacgagggac aataggagcc aggcgcaccg gccagcccct gaaggaggac 661 ccctcagccg tgcctgtgtt ctctgtggac tatggggagc tggatttcca gtggcgagag 721 aagaccccgg agccccccgt gccctgtgtc cctgagcaga cggagtatgc caccattgtc 781 tttcctagcg gaatgggcac ctcatccccc gcccgcaggg gctcagctga cggccctcgg 841 agtgcccagc cactgaggcc tgaggatgga cactgctctt ggcccctctg accggcttcc 901 ttggccacca gtgttctgca g // LOCUS HUMPGMR 2292 bp DNA PRI 14-FEB-1996 DEFINITION Homo sapiens phosphoglucomutase-related protein (PGMRP) gene, complete cds. ACCESSION L40933 NID g1160964 KEYWORDS PGM-related protein; adherens junction; dystrophin; homologue; phosphoglucomutase-related protein; utrophin. SOURCE Homo sapiens (clone: A111;A85) (clone library: ZAPII) female uterus DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2292) AUTHORS Moiseeva,E.P., Belkin,A.M., Spurr,N.K., Koteliansky,V.E. and Critchley,D.R. TITLE A novel dystrophin/utrophin-associated protein is an enzymatically inactive member of the phosphoglucomutase superfamily JOURNAL Eur. J. Biochem. 235 (1-2), 103-113 (1996) MEDLINE 96202923 COMMENT PGMRP gene is located on 9q12-13. FEATURES Location/Qualifiers source 1..2292 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="A111;A85" /clone_lib="ZAPII" /sex="female" /tissue_type="uterus" /map="9qcen-q13" gene 92..1612 /gene="PGMRP" CDS 92..1612 /gene="PGMRP" /note="homologue to phosphoglucomutase 1 (PGM1)" /codon_start=1 /evidence=experimental /product="phosphoglucomutase-related protein" /db_xref="PID:g1160965" /translation="MVVGSDGRYFSRTAIEIVVQMAAANGIGRLIIGQNGILSTPAVS CIIRKIKAAGGIILTASHCPGGPGGEFGVKFNVANGGPAPDVVSDKIYQISKTIEEYA ICPDLRIDLSRLGRQEFDLENKFKPFRVEIVDPVDIYLNLLRTIFDFHAIKGLLTGPS QLKIRIDAMHGVMGPYVRKVLCDELGAPANSAINCVPLEDFGGQHPDPNLTYATTLLE AMKGGEYGFGAAFDADGDRYMILGQNGFFVSPSDSLAIIAANLSCIPYFRQMGVRGFG RSMPTSMALDRVAKSMKVPVYETPAGWRFFSNLMDSGRCNLCGEESFGTGSDHLREKD GLWAVLVWLSIIAARKQSVEEIVRDHWAKFGRHYYCRFDYEGLDPKTTYYIMRDLEAL VTDKSFIGQQFAVGSHVYSVAKTDSFEYVDPVDGTVTKKQGLRIIFSDASRLIFRLSS SSGVRATLRLYAESYERDPSGHDQEPQAVLSPLIAIALKISQIHERTGRRGPTVIT" BASE COUNT 595 a 566 c 583 g 548 t ORIGIN 1 cggcctcttc gagggccagc gcaactacct gcccaacttt atccagagcg tgctgtcgtc 61 catcgacctg cgcgaccgtc agggctgcac catggtggtg ggcagcgacg gcaggtactt 121 tagcaggacg gccatcgaga tcgtggtgca gatggccgcg gccaacggga ttggacgact 181 gattattgga cagaatggca tcttgtcgac acctgcggtc tcctgcatta tcaggaagat 241 caaggcagct ggtggaatca ttctaacagc cagccactgc cctggaggac cagggggaga 301 gtttggagtg aagtttaatg ttgccaatgg aggtcctgca cccgatgttg tctcagacaa 361 aatctaccaa atcagcaaaa cgattgagga atatgctata tgtcctgatc tccgaatcga 421 cctatctcga ctaggaagac aagaatttga cctagaaaac aaattcaaac cattcagagt 481 ggagatagtg gacccagtgg atatctatct taacctcctt cggaccatct ttgactttca 541 tgccatcaag ggtttgctga ctggacccag ccaactgaag attcgcattg acgcaatgca 601 cggagttatg ggaccttatg tgagaaaagt tctgtgtgat gagctggggg ccccagccaa 661 ttctgcaata aactgtgttc ccctggaaga ctttggaggg cagcaccctg acccaaacct 721 gacatatgca acgactcttc tggaagcaat gaaaggagga gaatatggat ttggagctgc 781 atttgatgct gatggggacc gttatatgat cctaggccaa aatggcttct ttgtgagccc 841 ttctgactcc ctggccatca ttgctgccaa cctctcttgc attccatatt tccgtcagat 901 gggggtccgc gggtttggga ggagtatgcc aaccagcatg gccctggaca gagtggccaa 961 atcaatgaag gtccctgtat atgagacccc agctggatgg agattcttct caaatctgat 1021 ggactcagga cgttgcaatc tgtgtgggga agagagcttt ggcactggct ctgaccacct 1081 ccgagagaag gatggcctgt gggctgtctt ggtctggctc tccattattg ctgcccggaa 1141 gcagagtgtg gaggaaattg tccgagatca ctgggccaaa tttggccgcc actactattg 1201 caggtttgac tatgaggggt tggatcccaa gacgacatat tatatcatga gggacctgga 1261 ggccctggtc acagacaaat ccttcattgg ccagcagttt gctgtgggga gccatgtcta 1321 cagcgtggcg aagacggata gttttgaata cgtggaccct gtggatggca ctgtgaccaa 1381 gaaacagggc ctaaggatca ttttctcgga tgcatcacgg ctcatcttcc ggctcagttc 1441 ctccagtggt gtgcgggcca ccctcagact gtacgcagag agctacgaga gggatcccag 1501 cggccatgac caggagccac aggcagtgct gagccctctc atagccatcg cactgaaaat 1561 atcccagatt catgagagaa ctggccggag gggacccact gtcatcacct gaatagagga 1621 aagatcactc accagggcca aagagagtgc tcagcgggag atgcttcact gatgccttct 1681 tgctacctgt ttgtgcctct tatgactttg gaaaaacaaa agatattttg cttttggggg 1741 atagagggtg ggtgggaaaa gaaaaaaaat ccatttggtt ttggttttgt cctattcctc 1801 caaatgcagc agggccttta gttgtctgtt aaagctgcac tataatttgg tatctacatt 1861 ttatcacaca aaggaacctc cccttttgac aacaactggg ctaggcagct gttaatcaca 1921 acatttgtgc atcacttgtg ccaagtgaga aaatgttcta aaatcacaag agagaacagt 1981 gccagaatga aactgaccct aagtcccagg tgcccctggg caggcagaag gagacactcc 2041 cagcatggag gagggtttat cttttcatcc taggtcaggt ctacaatggg ggaaggtttt 2101 attatagaac tcccaacagc ccacctcact cctgccaccc acccgatggc cctgcctccc 2161 ccatcccatc cccaacatcc ctgtaccacc ttctctcaca tcttctaaag ctttgtacaa 2221 atcacaatgg tgcacttcca acaaaatata tcaataggtg ttttcctctc tcaaaaaaaa 2281 aaaaaaaaaa aa // LOCUS HUMPP2AB 2574 bp DNA PRI 04-MAR-1991 DEFINITION Human protein phosphatase 2A catalytic subunit-beta gene, complete cds. ACCESSION M60484 J05297 NID g190225 KEYWORDS protein phosphatase-2A catalytic subunit-beta. SOURCE Human placenta leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2574) AUTHORS Khew-Goodall,Y., Mayer,R.E., Maurer,F., Stone,S.R. and Hemmings,B.A. TITLE Structure and transcriptional regulation of protein phosphatase 2A catalytic subunit genes JOURNAL Biochemistry 30, 89-97 (1991) MEDLINE 91105105 COMMENT Although this is a genomic sequence, the introns have been omitted. Intronic sequences have been requested. The data is presented as submitted by the author. FEATURES Location/Qualifiers source 1..2574 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 567..>2574 /product="protein phosphatase-2A catalytic subunit-beta" CDS 991..1920 /EC_number="3.1.3.16" /codon_start=1 /product="protein phosphatase-2A catalytic subunit-beta" /db_xref="PID:g190226" /translation="MDDKAFTKELDQWVEQLNECKQLNENQVRTLCEKAKEILTKESN VQEVRCPVTVCGDVHGQFHDLMELFRIGGKSPDTNYLFMGDYVDRGYYSVETVTLLVA LKVRYPERITILRGNHESRQITQVYGFYDECLRKYGNANVWKYFTDLFDYLPLTALVD GQIFCLHGGLSPSIDTLDHIRALDRLQEVPHEGPMCDLLWSDPDDRGGWGISPRGAGY TFGQDISETFNHANGLTLVSRAHQLVMEGYNWCHDRNVVTIFSAPNYCYRCGNQAAIM ELDDTLKYSFLQFDPAPRRGEPHVTRRTPDYFL" BASE COUNT 660 a 558 c 686 g 670 t ORIGIN 1 ttggggaggg gggtgcctag atggccccta agaggggtcc ctgttctgtc tctcaataaa 61 tatttgttga atgaacaaat cattacaact cagtacacat tgcagaaaat atagccaaga 121 gctctggagc tggaagggcc acagattatc ctacagaaca atcatttcac ttttctaatg 181 tcgaaaggga ggattcgaga tgttaggggc tacaggtgag gctggaaata attaatttac 241 tccatcaata ttgaatgcct gctattgagt gctagactct ggggagacaa tgttaaggga 301 agcccaagtt ttccaactcc ctgtccagag cgctgcaaag tactgcagga agctaaagtg 361 aggacaaagt tcccagagat caggatattt aaagggagaa ccagcagagc ttggtctggg 421 gcagggtggg gaaaagaggg accctggcct cctcggaccg tttctccgcc aagccacgcg 481 agggcgctgt tctgctccta gggcgccgtg tcccggcggc gccgcctgct cgccttttcc 541 cggcggaaat gcccgagcga tgacggaaac ccggaggagg ggagagaaag agcgagagaa 601 ggggaaagac aagtcgggag aggccggtag gcgtgaggcg ggcctgaagc ggcagcgggc 661 ggccttcgtc cggcgagagc taggccgagg acccgcgccg cgctccccgg cacctcaccg 721 cgtccttcac cgactcccgc ggcgcgcggc cgggcgggga agggcgggcg ggggtctcct 781 ccaggctgcg cgctcggagc cgcctgctgg gcttgggcgg ggcgcggggc ccgcggccgc 841 cctacccggc tcagtcctcc ccctgtggga cctggcgacg gcggcggagg gagaggggag 901 cggcgcccgg gccggggccg ggggcgggtg gggagggggg agggcggcgg ccgggctggg 961 gctcgggatc cgcatcggga tcgggccgcc atggacgaca aggcgttcac caaggagctg 1021 gaccagtggg tcgagcagct gaacgagtgt aagcagctga acgagaacca agtgcggacg 1081 ctgtgcgaga aggcaaagga aattttaaca aaagaatcaa atgtgcaaga ggttcgttgc 1141 cctgttactg tctgtggaga tgtgcatggt caatttcatg atcttatgga actctttaga 1201 attggtggaa aatcaccgga tacaaactac ttattcatgg gtgactatgt agacagagga 1261 tattattcag tggagactgt gactcttctt gtagcattaa aggtgcgtta tccagaacgc 1321 attacaatat tgagaggaaa tcacgaaagc cgacaaatta cccaagtata tggcttttat 1381 gatgaatgtc tgcgaaagta tgggaatgcc aacgtttgga aatattttac agatctcttt 1441 gattatcttc cacttacagc tttagtagat ggacagatat tctgcctcca tggtggcctc 1501 tctccatcca tagacacact ggatcatata agagccctgg atcgtttaca ggaagttcca 1561 catgagggcc caatgtgtga tctgttatgg tcagatccag atgatcgtgg tggatggggt 1621 atttcaccac gtggtgctgg ctacacattt ggacaagaca tttctgaaac ctttaaccat 1681 gccaatggtc tcacactggt ttctcgtgcc caccagcttg taatggaggg atacaattgg 1741 tgtcatgatc ggaatgtggt taccattttc agtgcaccca attactgtta tcgttgtggg 1801 aaccaggctg ctatcatgga attagatgac actttaaaat attccttcct tcaatttgac 1861 ccagcgcctc gtcgtggtga gcctcatgtt acacggcgca ccccagacta cttcctataa 1921 atttctcctg ggaaacctgc ctttgtatgt ggaagtatac ctggcttttt aaaatatatg 1981 tatttaaaaa caaaaagcaa cagtaatcta tgtgtttctg taacaaattg ggatctgtct 2041 tggcattaaa ccacatcatg gaccaaatgt gccatactaa tgatgagcat ttagcacaat 2101 ttgagactga aatttagtac actatgttct aggtcagtct aacagtttgc ctgctgtatt 2161 tatagtaacc attttccttt ggactgttca agcaaaaaag gtaactaact gcttcatctc 2221 cttttgcgct tatttggaaa ttttagttat agtgtttaac tggcatggat taatagagtt 2281 ggagttttat ttttaagaaa aattcacaag ctaacttcca ctaatccatt atcctttatt 2341 ttattgaaat gtataattaa cttaactgaa gaaaaggttc ttcttgggag tatgttgtca 2401 taacatttaa agagatttcc cttcatttaa actaaattac tgttttatgt tgatctgcat 2461 atttctgtat atttgtcatg acagtgcttg catcctattt ggtgtactca gcaaataaac 2521 ttttcatttt aaacaaaaca ttcatttatt gtgttgtgca ttaaatgaaa actt // LOCUS HUMPPNT4P 1404 bp DNA PRI 08-JAN-1995 DEFINITION Human neurotrophin-4 (NT-4) gene, complete cds. ACCESSION M86528 NID g190264 KEYWORDS nerve growth factor; neurotrophin-4. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1404) AUTHORS Ip,N.Y., Ibanez,C.F., Nye,S.H., McClain,J., Jones,P.F., Gies,D.R., Belluscio,L., Le Beau,M.M., Espinosa,R.III., Squinto,S.P., Persson,H. and Yancopoulos,G.D. TITLE Mammalian neurotrophin-4: structure, chromosomal localization, tissue distribution, and receptor specificity JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (7), 3060-3064 (1992) MEDLINE 92212967 FEATURES Location/Qualifiers source 1..1404 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" gene 475..1107 /gene="NT-4" CDS 475..1107 /gene="NT-4" /note="pre-pro protein" /codon_start=1 /product="neurotrophin-4" /db_xref="PID:g190265" /translation="MLPLPSCSLPILLLFLLPSVPIESQPPPSTLPPFLAPEWDLLSP RVVLSRGAPAGPPLLFLLEAGAFRESAGAPANRSRRGVSETAPASRRGELAVCDAVSG WVTDRRTAVDLRGREVEVLGEVPAAGGSPLRQYFFETRCKADNAEEGGPGAGGGGCRG VDRRHWVSECKAKQSYVRALTADAQGRVGWRWIRIDTACVCTLLSRTGRA" mat_peptide 715..1104 /gene="NT-4" /function="nerve growth factor-related neurotrophic factor" /product="neurotrophin-4" BASE COUNT 273 a 405 c 386 g 340 t ORIGIN 1 cttgtcaccc aggtggcagg ggagtggtgc actctctgct cactgcaacc tcggcctcct 61 gggttcgagt gattctccta cctcagccta ctgagtagct gggattacag gcgtgcagca 121 ctatgcccgg ttaattttgg tatttttggt agagatgagg tttcaccatg ttgaccagct 181 gctctggaac tcctgacctc aagtcatcca cctgcctcag cctcccagag tgctgggatt 241 agaggtgtgg ggcacagtgc ctggcctgta gtagttgaat atttattatt aatctacaag 301 ttgcgcatta cgcaagccct agatataggg tcccccaaac ttctagaaca agggcttccc 361 cacaatcctg gcaggcaagc ctcccctggg gttcccaact tctttcccca ctgaagtttt 421 tacccccttc tctaatccca gcctccctct ttctgtctcc aggtgctccg agagatgctc 481 cctctcccct catgctccct ccccatcctc ctccttttcc tcctccccag tgtgccaatt 541 gagtcccaac ccccaccctc aacattgccc ccttttctgg cccctgagtg ggaccttctc 601 tccccccgag tagtcctgtc taggggtgcc cctgctgggc cccctctgct cttcctgctg 661 gaggctgggg cctttcggga gtcagcaggt gccccggcca accgcagccg gcgtggggtg 721 agcgaaactg caccagcgag tcgtcggggt gagctggctg tgtgcgatgc agtcagtggc 781 tgggtgacag accgccggac cgctgtggac ttgcgtgggc gcgaggtgga ggtgttgggc 841 gaggtgcctg cagctggcgg cagtcccctc cgccagtact tctttgaaac ccgctgcaag 901 gctgataacg ctgaggaagg tggcccgggg gcaggtggag ggggctgccg gggagtggac 961 aggaggcact gggtatctga gtgcaaggcc aagcagtcct atgtgcgggc attgaccgct 1021 gatgcccagg gccgtgtggg ctggcgatgg attcgaattg acactgcctg cgtctgcaca 1081 ctcctcagcc ggactggccg ggcctgagac ccatgcccag gaaaataaca gagctggatg 1141 ctgagagacc tcagggatgg cccagctgat ctaaggaccc cagtttggga actcatcaaa 1201 taatcacaaa atcacaattc tctgattttg agctcaatct ctgcaggatg ggtgaaacca 1261 catggggttt tggaggttga ataggagttc tcctggagca acttgagggt aataatgatg 1321 atgatataat aataatagcc actatttact gagtgtttac tgtttcttat ccctaataca 1381 taactcctca gatcaactct catg // LOCUS HUMPPT 2287 bp DNA PRI 11-JAN-1996 DEFINITION Homo sapiens palmitoyl-protein thioesterase gene, complete cds. ACCESSION L42809 NID g1160966 KEYWORDS CLN1 gene; infantile Batten's disease; palmitoyl-protein thioesterase; thioesterase. SOURCE Homo sapiens (clone library: Stratagene cat. no. 935205) female 2 yr old brain temportal cortex DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2287) AUTHORS Vesa,J., Hellsten,E., Verkruyse,L.A., Camp,L.A., Rapola,J., Santavuori,P., Hofmann,S.L. and Peltonen,L. TITLE Mutations in the palmitoyl protein thioesterase gene causing infantile neuronal ceroid lipofuscinosis JOURNAL Nature 376 (6541), 584-587 (1995) MEDLINE 95364950 FEATURES Location/Qualifiers source 1..2287 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene cat. no. 935205" /dev_stage="2 yr old" /sex="female" /tissue_type="brain temportal cortex" sig_peptide 15..89 CDS 15..935 /codon_start=1 /product="palmitoyl-protein thioesterase" /db_xref="PID:g1160967" /translation="MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDS CCNPLSMGAIKKMVEKKIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKD PKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHIC DFIRKTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKN LMALKKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDN AGQLVFLATEGDHLQLSEEWFYAHIIPFLG" mat_peptide 90..932 /product="palmitoyl-protein thioesterase" BASE COUNT 629 a 507 c 519 g 632 t ORIGIN 1 gtgacacagc gaagatggcg tcgcccggct gcctgtggct cttggctgtg gctctcctgc 61 catggacctg cgcttctcgg gcgctgcagc atctggaccc gccggcgccg ctgccgttgg 121 tgatctggca tgggatggga gacagctgtt gcaatccctt aagcatgggt gctattaaaa 181 aaatggtgga gaagaaaata cctggaattt acgtcttatc tttagagatt gggaagaccc 241 tgatggagga cgtggagaac agcttcttct tgaatgtcaa ttcccaagta acaacagtgt 301 gtcaggcact tgctaaggat cctaaattgc agcaaggcta caatgctatg ggattctccc 361 agggaggcca atttctgagg gcagtggctc agagatgccc ttcacctccc atgatcaatc 421 tgatctcggt tgggggacaa catcaaggtg tttttggact ccctcgatgc ccaggagaga 481 gctctcacat ctgtgacttc atccgaaaaa cactgaatgc tggggcgtac tccaaagttg 541 ttcaggaacg cctcgtgcaa gccgaatact ggcatgaccc cataaaggag gatgtgtatc 601 gcaaccacag catcttcttg gcagatataa atcaggagcg gggtatcaat gagtcctaca 661 agaaaaacct gatggccctg aagaagtttg tgatggtgaa attcctcaat gattccattg 721 tggaccctgt agattcggag tggtttggat tttacagaag tggccaagcc aaggaaacca 781 ttcccttaca ggagacctcc ctgtacacac aggaccgcct ggggctaaag gaaatggaca 841 atgcaggaca gctagtgttt ctggctacag aaggggacca tcttcagttg tctgaagaat 901 ggttttatgc ccacatcata ccattccttg gatgaaaccc gtatagttca caatagagct 961 cagggagccc ctaactcttc caaaccacat gggagacagt ttccttcatg cccaagcctg 1021 agctcagatc cagcttgcaa ctaatccttc tatcatctaa catgccctac ttggaaagat 1081 ctaagatctg aatcttatcc tttgccatct tctgttacca tatggtgttg aatgcaagtt 1141 taattaccat ggagattgtt ttacaaactt ttgatgtggt caagttcagt tttagaaaag 1201 ggagtctgtt ccagatcagg gccagaactg tgcccaggcc caaaggagac aactaactaa 1261 agtagtgaga tagattctaa gggcaaacat ttttccaagt cttgccatat ttcaagcaaa 1321 gaggtgccca ggcctgaggt actcacataa atgctttgtt ttgctggtga tttaaccagt 1381 gcttggaaaa atcttgcttg gctatttctg catcatttct taaggctgcc ttcctctctg 1441 agtacgttgc cctctgtgct atcaatcatc ttatcatcaa ttattagaca aatcccactg 1501 gcctacagtc ttgcttctgc agcacccact ttgtctcctc aggtagtgat gaattagttg 1561 ctgtcacaaa aggagggaag tagcacccaa attaaattgc ttaagagagg aaatgtacat 1621 cttgtataac ttagggagcg aagaaaatgt aggcgcgaaa gtgaaaagtg aggcagctag 1681 ttcttcctat tccattctcg accaacctgc cctttcttaa tatgactagt ggtcttgatg 1741 ctagagtcaa cttactctgt tgctggcttt agcagagaat aggaggaacc atatgaaaaa 1801 gatcaggctt tctgacttcc atccccaaaa cacatttacc agcatactcc aaactgtttc 1861 tgatgtgttc catgagaaaa ggattgtttg ctcaaaaagc ttggaaaata ctacacactc 1921 cctttctcct tctggagatc aacccacatt agagtgtcta aggactcctg agaattcctg 1981 ttacagtaaa caaaactaac gtaatctacc atttcctaca ctatttgagc atggaaatca 2041 tagtccccac tctgtgaaaa cttaacgctt tttggaagac atttctgtag catgtcagtt 2101 tggagaaatg atgagctacg ccttgatgaa agaaccgtgt tggtgctgct aagtttagcc 2161 attatggttt ttcctttctc tctcttaagc cttattcttc aactaaaaga tgaggattaa 2221 gagcaagaag ttggggggga tgtgaaaata attttatgag gttgtctaaa ataaagagta 2281 gtttctt // LOCUS HUMPROAF 1455 bp DNA PRI 23-JUL-1992 DEFINITION Human prothymosin-alpha pseudogene, complete sequence. ACCESSION J04801 NID g190371 KEYWORDS prothymosin-alpha; pseudogene. SOURCE Human lymphocyte cell line RPMI 8226 DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1455) AUTHORS Eschenfeldt,W.H., Manrow,R.E., Krug,M.S. and Berger,S.L. TITLE Isolation and partial sequencing of the human prothymosin alpha gene family: Evidence against export of the gene products JOURNAL J. Biol. Chem. 264, 7546-7555 (1989) MEDLINE 89214202 FEATURES Location/Qualifiers source 1..1455 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RPMI 8226" /cell_type="lymphocyte" TATA_signal 31..36 repeat_unit 172..187 /rpt_type=direct CDS 352..705 /note="open reading frame A" /codon_start=1 /db_xref="PID:g190372" /translation="MSDAAVDTSSEITTEDLKEKKEVVEEAENGRDAPAHGNANEENG EPDDNEVDEEEEEGGEEEEEEEGDGEEEDGDEDEGAESATGKRAAEDDEDNDVDTQKQ KTDEDDQTAKKEKLN" polyA_signal 1331..1336 repeat_unit 1391..1406 /rpt_type=direct BASE COUNT 451 a 299 c 341 g 364 t ORIGIN 1 caacacgctg cccttcacag gtaagtgtaa tatatagaga aaaatttcat gttaagtgac 61 ttgccagagt atgcagttgt ggaatgtctt ttcgggggca tgcttttctc tgagagagag 121 agagaaaaaa aaaacaactc ttaccctcct ccaaatctaa ggacatttga tgaaaaacat 181 tgtcctgccc cactggctgc tctgaaaagc cgtctttgca ttgtgcgtcg tcagcctcct 241 tgctcgccgc agccgcctcg ccgccgcgga ctccggcagc tttatcgcca gagtccctga 301 actctcgctt tcttttttat cccctgcatc gcgtcaccgg cgtgccccac catgtcagac 361 gcagccgtag acaccagctc cgaaatcacc accgaggact taaaggagaa gaaggaagtt 421 gtggaagagg cggaaaatgg aagagacgcc cctgctcacg ggaatgctaa tgaggaaaat 481 ggggagccgg atgacaacga ggtagatgaa gaagaggaag aaggtgggga ggaagaggag 541 gaggaagaag gtgatggtga ggaagaggac ggagatgaag atgagggagc tgagtcagct 601 acgggcaagc gggcagctga agatgatgag gataacgatg tcgataccca gaagcagaag 661 accgacgagg atgaccagac agcaaaaaag gaaaagttaa actaaaaaaa aaaggccgcc 721 gtgacctatt caccctccac ttcccgtctc agaatctaaa cgtggtcacc ttcgagtaga 781 ggggcccgcc cgcccaccgt gggcagtgcc acccgcagat gacacgcgct ctccaccacc 841 caacccaaac catgagaatt tgcaacaggg gagggaaaaa gaaccaaaac ttccaaggcc 901 cgcttttttt ttttcttaaa agtactttaa aaaggaaact tgtatttttt atttacattt 961 tatatttttg tacatattgt tagggtcggc catttttaat gatctcggat gaccaaacca 1021 gccttcggag cgttctctgt cctacttctc actttacttg tggtgtggcc atgttcatta 1081 taatctcaaa ggagaaaaaa aaaacttgta aaaaatgcaa aaatgacaac agaaaaacca 1141 tcttattccg agcattccag taactttttt gtgtatgtac ttagctgtac tataagtagt 1201 tggtttgtat gagatggtta aaaaggccaa agataaaagg tttctttttt ttcctttttt 1261 gtctatgaag ttgctgttta tttttatttt ttggcctgtt tgatgtatgt gtgaaacaat 1321 gttgtccaac aataaacagg aattttattt tgctgagttg ttctagcaaa aaaaaaaaag 1381 aaaaaaaaaa gaaaaacatt gttctgatga aaatcacttg gaatgagcct cttagggaaa 1441 taaatagaaa gtact // LOCUS HUMPROLA 1404 bp DNA PRI 19-MAY-1995 DEFINITION Human cathepsin L gene, complete cds. ACCESSION M20496 NID g809235 KEYWORDS cathepsin L; collagenolytic lysosomal enzyme; elastinolytic lysosomal enzyme. SOURCE Human kidney, cDNA to mRNA, clone SL12.1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1404) AUTHORS Joseph,L.J., Chang,L.C., Stamenkovich,D. and Sukhatme,V.P. TITLE Complete nucleotide and deduced amino acid sequences of human and murine preprocathepsin L. An abundant transcript induced by transformation of fibroblasts JOURNAL J. Clin. Invest. 81 (5), 1621-1629 (1988) MEDLINE 88213715 FEATURES Location/Qualifiers source 1..1404 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q21-q22" sig_peptide 134..184 /gene="CTSL" /note="cathepsin L signal peptide; G00-119-824" CDS 134..1135 /gene="CTSL" /note="preprocathepsin L precursor" /codon_start=1 /db_xref="GDB:G00-119-824" /db_xref="PID:g190418" /translation="MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNE EGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRK GKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS VANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDH GVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV" gene 134..1135 /gene="CTSL" mat_peptide 473..1132 /gene="CTSL" /note="cathepsin L; G00-119-824" BASE COUNT 384 a 266 c 374 g 380 t ORIGIN 421 bp upstream of EcoRI site. 1 acctccacgt gccctgtttt tctggaggca catccttggc ctcttccaca gtccttgggt 61 aaatgcttgg gagaataatt taaatatttt tattctacca tggtggccct aatttttcag 121 ggggcagtaa gatatgaatc ctacactcat ccttgctgcc ttttgcctgg gaattgcctc 181 agctactcta acatttgatc acagtttaga ggcacagtgg accaagtgga aggcgatgca 241 caacagatta tacggcatga atgaagaagg atggaggaga gcagtgtggg agaagaacat 301 gaagatgatt gaactgcaca atcaggaata cagggaaggg aaacacagct tcacaatggc 361 catgaacgcc tttggagaca tgaccagtga agaattcagg caggtgatga atggctttca 421 aaaccgtaag cccaggaagg ggaaagtgtt ccaggaacct ctgttttatg aggcccccag 481 atctgtggat tggagagaga aaggctacgt gactcctgtg aagaatcagg gtcagtgtgg 541 ttcttgttgg gcttttagtg ctactggtgc tcttgaagga cagatgttcc ggaaaactgg 601 gaggcttatc tcactgagtg agcagaatct ggtagactgc tctgggcctc aaggcaatga 661 aggctgcaat ggtggcctaa tggattatgc tttccagtat gttcaggata atggaggcct 721 ggactctgag gaatcctatc catatgaggc aacagaagaa tcctgtaagt acaatcccaa 781 gtattctgtt gctaatgaca ccggctttgt ggacatccct aagcaggaga aggccctgat 841 gaaggcagtt gcaactgtgg ggcccatttc tgttgctatt gatgcaggtc atgagtcctt 901 cctgttctat aaagaaggca tttattttga gccagactgt agcagtgaag acatggatca 961 tggtgtgctg gtggttggct acggatttga aagcacagaa tcagataaca ataaatattg 1021 gctggtgaag aacagctggg gtgaagaatg gggcatgggt ggctacgtaa agatggccaa 1081 agaccggaga aaccattgtg gaattgcctc agcagccagc taccccactg tgtgagctgt 1141 ggacggtgat gaggaaggac ttgactgggg atggcgcatg catgggagga attcttcagt 1201 ctaccagccc ccgctgtgtc ggatacacac tcgaatcatt gaagatccga gtgtgatttg 1261 aattctgtga tattttcaca ctggtaaatg ttacctctat tttaattact gctataaata 1321 ggtttatatt attgattcac ttactgactt tgcattttcg tttttaaaag gatgtataaa 1381 tttttacctg tttaaataaa atcg // LOCUS HUMPTAFR 1467 bp DNA PRI 08-JAN-1995 DEFINITION Human platelet activating factor receptor (PTAFR) gene, complete cds. ACCESSION M88177 NID g190697 KEYWORDS platelet activating factor receptor. SOURCE Homo sapiens (tissue library: Charon 4A of T.Maniatis) fetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1467) AUTHORS Seyfried,C.E., Schweickart,V.L., Godiska,R. and Gray,P.W. TITLE The human platelet-activating factor receptor gene (PTAFR) contains no introns and maps to chromosome 1 JOURNAL Genomics 13 (3), 832-834 (1992) MEDLINE 92347886 FEATURES Location/Qualifiers source 1..1467 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="liver" /tissue_lib="Charon 4A of T.Maniatis" /map="Unassigned" gene 253..1281 /gene="PTAFR" CDS 253..1281 /gene="PTAFR" /codon_start=1 /db_xref="GDB:G00-128-806" /product="platelet activating factor receptor" /db_xref="PID:g190698" /translation="MEPHDSSHMDSEFRYTLFPIVYSIIFVLGVIANGYVLWVFARLY PCKKFNEIKIFMVNLTMADMLFLITLPLWIVYYQNQGNWILPKFLCNVAGCLFFINTY CSVAFLGVITYNRFQAVTRPIKTAQANTRKRGISLSLVIWVAIVGAASYFLILDSTNT VPDSAGSGNVTRCFEHYEKGSVPVLIIHIFIVFSFFLVFLIILFCNLVIIRTLLMQPV QQQRNAEVTGRALWMVCTVLAVFIICFVPHHVVQLPWTLAELGFQDSKFHQAINDAHQ VTLCLLSTNCVLDPVIYCFLTKKFRKHLTEKFYSMRSSRKCSRATTDTVTEVVVPFNQ IPGNSLKN" BASE COUNT 327 a 433 c 345 g 362 t ORIGIN Chromosome I. 1 cgggaggcgg aggttgcggt gagctgagat cacgccactg cactccagcc tgggcagcaa 61 gagtgaaact ccatctgaaa aaaaaaaaaa gattcaacat gaacttctga ggggacatca 121 tcattctaac catggcaagg agtcttggaa ctgatgaaat ggaacagtcc cttcttgtcc 181 ctttattaac cagaattttt gtgtggtctt ccaggcacca ccaggaccag ctgatcattc 241 cagcccacag caatggagcc acatgactcc tcccacatgg actctgagtt ccgatacact 301 ctcttcccga ttgtttacag catcatcttt gtgctcgggg tcattgctaa tggctacgtg 361 ctgtgggtct ttgcccgcct gtacccttgc aagaaattca atgagataaa gatcttcatg 421 gtgaacctca ccatggcgga catgctcttc ttgatcaccc tgccactttg gattgtctac 481 taccaaaacc agggcaactg gatactcccc aaattcctgt gcaacgtggc tggctgcctt 541 ttcttcatca acacctactg ctctgtggcc ttcctgggcg tcatcactta taaccgcttc 601 caggcagtaa ctcggcccat caagactgct caggccaaca cccgcaagcg tggcatctct 661 ttgtccttgg tcatctgggt ggccattgtg ggagctgcat cctacttcct catcctggac 721 tccaccaaca cagtgcccga cagtgctggc tcaggcaacg tcactcgctg ctttgagcat 781 tacgagaagg gcagcgtgcc agtcctcatc atccacatct tcatcgtgtt cagcttcttc 841 ctggtcttcc tcatcatcct cttctgcaac ctggtcatca tccgtacctt gctcatgcag 901 ccggtgcagc agcagcgcaa cgctgaagtc acaggccggg cgctgtggat ggtgtgcacg 961 gtcttggcgg tgttcatcat ctgcttcgtg ccccaccacg tggtgcagct gccctggacc 1021 cttgctgagc tgggcttcca ggacagcaaa ttccaccagg ccattaatga tgcacatcag 1081 gtcaccctct gcctccttag caccaactgt gtcttagacc ctgttatcta ctgtttcctc 1141 accaagaagt tccgcaagca cctcaccgaa aagttctaca gcatgcgcag tagccggaaa 1201 tgctcccggg ccaccacgga tacggtcact gaagtggttg tgccattcaa ccagatccct 1261 ggcaattccc tcaaaaatta gtccctgctt ccaggcctga agtcttctcc tccatgaaca 1321 tcatggactg agctggggga agaagggata tctactgtgg tctgggcacc acctctgtgg 1381 gcactggtgg gccattagat ttggaggcta cctcacctgg gcagggatga tggcagacga 1441 ggctgttgga aaatccagaa ctcaaat // LOCUS HUMPTPIV1G 1514 bp DNA PRI 26-DEC-1996 DEFINITION Homo sapiens (clone hh7-2) ptp-IV1b, PTP-IV1 gene, complete cds. ACCESSION L48937 NID g1246235 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1514) AUTHORS Zhao,Z., Lee,C.C., Monckton,D.G., Yazdani,A., Coolbaugh,M.I., Shen,Y. and Caskey,C.T. TITLE Characterization and genomic mapping of genes and pseudogenes of a new human protein tyrosine phosphatase JOURNAL Genomics 35 (1), 172-181 (1996) MEDLINE 96299754 FEATURES Location/Qualifiers source 1..1514 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hh7-2" /tissue_lib="heart" gene 813..1316 /gene="ptp-IV1b, PTP-IV1" CDS 813..1316 /gene="ptp-IV1b, PTP-IV1" /codon_start=1 /db_xref="PID:g1246236" /translation="MNRPAPVEISYEDMRFLITHNPTNATLNKFTEELKKYGVTTLVR VCDATYDKAPVEKEGIHVLDWPFDDGAPPPNQIVDDWLNLLKTKFREEPGCCVAVHCV AGLGRAPVLVALALIECGMKYEDAVQFIRQKRRGAFNSKQLLYLEKYRPKMRLRFRDT NGHCCVQ" BASE COUNT 413 a 317 c 351 g 433 t ORIGIN 1 ggccgctacc gttttttgtg gtgtactccg tgccatcatg tccgtcctga cgctgctgct 61 gctgccgggc cgggtgtcgc gcgccgaggc tgggggggag tcgtcgccgc cgccgccacc 121 gctaccgccg ccgccgccgc cgccgaggtg actgaggaga gaggcgcctc ctcgctcccg 181 ccaccgccgg acttcaatgc ccagtcccca gctcgccagc gtttttcgtt ggaatatacg 241 ttgcacattt atggcgattc tgagtgtgag ggcagacttc tgccaggctc agcacagcat 301 tttcgctgac aagtgagctt ggaggttcta tgtgccataa ttaacattgc cttgaagact 361 cctggacacc gagactggcc tcagaaatag ttggcttttt tttttttaat tgcaagcata 421 tttcttttaa tgactccagt aaaattaagc atcaagtaaa caagtggaaa gtgacctaca 481 cttttaactt gtctcactag tgcctaaatg tagtaaaggc tgcttaagtt ttgtatgtag 541 ttggattttt tggagtccga aggtatccat ctgcagaaat tgaggcccaa attgaatttg 601 gattcaagtg gattctaaat actttgctta tcttgaagag agaagcttca taaggaataa 661 acaagttgaa tagagaaaac actgattgat aataggcatt ttagtggtct ttttaatgtt 721 ttctgctgtg aaacatttca agatttattg attttttttt ttcactttcc ccatcacact 781 cacacgcacg ctcacacttt ttatttgcca taatgaaccg tccagcccct gtggagatct 841 cctatgagga catgcgtttt ctgataactc acaaccctac caatgctact ctcaacaagt 901 tcacagagga acttaagaag tatggagtga cgactttggt tcgagtttgt gatgctacat 961 atgataaagc tccagttgaa aaagaaggaa tccacgttct agattggcca tttgatgatg 1021 gagctccacc ccctaatcag atagtagatg attggttaaa cctgttaaaa accaaatttc 1081 gtgaagagcc aggttgctgt gttgcagtgc attgtgttgc aggattggga agggcacctg 1141 tgctggttgc acttgctttg attgaatgtg gaatgaagta cgaagatgca gttcagttta 1201 taagacaaaa aagaagggga gcgttcaatt ccaaacagct gctttatttg gagaaatacc 1261 gacctaagat gcgattacgc ttcagagata ccaatgggca ttgctgtgtt cagtagaagg 1321 aaatgtaaac gaaggctgac ttgattgtgc catttagagg gaactcttgg tacctggaaa 1381 tgtgaatctg gaatattacc tgtgtcatca aagtagtgat ggattcagta ctcctcaacc 1441 actctcctaa tgattggaac aaaagcaaac aaaaaagaaa tctctctata aaatgaataa 1501 aatgtttaag aaaa // LOCUS HUMRABCCF 2583 bp DNA PRI 24-AUG-1995 DEFINITION Homo sapiens cellular co-factor (RAB) gene, complete cds. ACCESSION L42025 NID g945222 KEYWORDS cellular co-factor. SOURCE Homo sapiens (clone: RAB) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2583) AUTHORS Bogerd,H.P., Fridell,R.A., Madore,S. and Cullen,B.R. TITLE Identification of a novel cellular cofactor for the Rev/Rex class of retroviral regulatory proteins JOURNAL Cell 82 (3), 485-494 (1995) MEDLINE 95360992 REFERENCE 2 (bases 1 to 2583) AUTHORS Cullen,B.R. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) Bryan R. Cullen, Howard Hughes Medical Institute and Department of Genetics, Duke University Medical Center, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..2583 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RAB" /cell_line="CEM-SS" /cell_type="T-cell" mRNA <1..>2583 /gene="RAB" gene 1..2583 /gene="RAB" CDS 244..1932 /gene="RAB" /note="Rev/Rex activation domain-binding protein" /codon_start=1 /function="cellular co-factor" /db_xref="PID:g945223" /translation="MAASAKRKQEEKHLKMLRDMTGLPHNRKCFDCDQRGPTYVNMTV GSFVCTSCSGSLRGLNPPHRVKSISMTTFTQQEIEFLQKHGNEVCKQIWLGLFDDRSS AIPDFRDPQKVKEFLQEKYEKKRWYVPPEQAKVVASVHASISGSSASSTSSTPEVKPL KSLLGDSAPTLHLNKGTPSQSPVVGRSQGQQQEKKQFDLLSDLGSDIFAAPAPQSTAT ANFANFAHFNSHAAQNSANADFANFDAFGQSSGSSNFGGFPTASHSPFQPQTTGGSAA SVNANFAHFDNFPKSSSADFGTFNTSQSHQTASAVSKVSTNKAGLQTADKYAALANLD NIFSAGQGGDQGSGFGTTGKAPVGSVVSVPSQSSASSDKYAALAELDSVFSSAATSSN AYTSTSNASSNVFGTVPVVASAQTQPASSSVPAPFGATPSTNPFVAAAGPSVASSTNP FQTNARGATAATFGTASMSMPTGFGTPAPYSLPTSFSGSFQQPAFPAQAAFPQQTAFS QQPNGAGFAAFGQTKPVVTPFGQVAAAGVSSNPFMTGAPTGQFPTGSSSTNPFL" BASE COUNT 702 a 640 c 572 g 669 t ORIGIN 1 tggcggcggc ggcggcggtt gtcccggctg tgccggttgg tgtggcccgt cagcccgcgt 61 accacagcgc ccgggccgcg tcgagcccag tacagccaag ccgctgcggc cgggtccggc 121 gcgggcggcg cgcgcagacg gagggcggcg gccgcggcca gggcggcccg tgggaccgcg 181 ggcccccggc gcagcgctgc ccggctcccg gccctgccgg cctcctccct tggcgccgcg 241 gccatggcgg ccagcgcgaa gcggaagcag gaggagaagc acctgaagat gctgcgggac 301 atgaccggcc tcccgcacaa ccgaaagtgc ttcgactgcg accagcgcgg ccccacctac 361 gttaacatga cggtcggctc cttcgtgtgt acctcctgct ccggcagcct gcgaggatta 421 aatccaccac acagggtgaa atctatctcc atgacaacat tcacacaaca ggaaattgaa 481 ttcttacaaa aacatggaaa tgaagtctgt aaacagattt ggctaggatt atttgatgat 541 agatcttcag caattccaga cttcagggat ccacaaaaag tgaaagagtt tctacaagaa 601 aagtatgaaa agaaaagatg gtatgtcccg ccagaacaag ccaaagtcgt ggcatcagtt 661 catgcatcta tttcagggtc ctctgccagt agcacaagca gcacacctga ggtcaaacca 721 ctgaaatctc ttttagggga ttctgcacca acactgcact taaataaggg cacacctagt 781 cagtccccag ttgtaggtcg ttctcaaggg cagcagcagg agaagaagca atttgacctt 841 ttaagtgatc tcggctcaga catctttgct gctccagctc ctcagtcaac agctacagcc 901 aattttgcta actttgcaca tttcaacagt catgcagctc agaattctgc aaatgcagat 961 tttgcaaact ttgatgcatt tggacagtct agtggttcga gtaattttgg aggtttcccc 1021 acagcaagtc actctccttt tcagccccaa actacaggtg gaagtgctgc atcagtaaat 1081 gctaattttg ctcattttga taacttcccc aaatcctcca gtgctgattt tggaaccttc 1141 aatacttccc agagtcatca aacagcatca gctgttagta aagtttcaac gaacaaagct 1201 ggtttacaga ctgcagacaa atatgcagca cttgctaatt tagacaatat cttcagtgcc 1261 gggcaaggtg gtgatcaggg aagtggcttt gggaccacag gtaaagctcc tgttggttct 1321 gtggtttcag ttcccagtca gtcaagtgca tcttcagaca agtatgcagc tctggcagaa 1381 ctagacagcg ttttcagttc tgcagccacc tccagtaatg cgtatacttc cacaagtaat 1441 gctagcagca atgtttttgg aacagtgcca gtggttgctt ctgcacagac acagcctgct 1501 tcatcaagtg tgcctgctcc atttggagct acgccttcca caaatccatt tgttgctgct 1561 gctggtcctt ctgtggcatc ttctacaaac ccatttcaga ccaatgccag aggagcaaca 1621 gcggcaacct ttggcactgc atccatgagc atgcccacgg gattcggcac tcctgctccc 1681 tacagtcttc ccaccagctt tagtggcagc tttcagcagc ctgcctttcc agcccaagca 1741 gctttccctc aacagacagc tttttctcaa cagcccaatg gtgcaggttt tgcagcattt 1801 ggacaaacaa agccagtagt aacccctttt ggtcaagttg cagctgctgg agtatctagt 1861 aatcctttta tgactggtgc accaacagga caatttccaa caggaagctc atcaaccaat 1921 cctttcttat agccttatat agacaattta ctggaacgaa cttttatgtg gtcacattac 1981 atctctccac ctcttgcact gttgtcttgt ttcactgatc ttagctttaa acacaagaga 2041 agtctttaaa aagcctgcat tgtgtattaa acaccaggta atatgtgcaa aacagagggc 2101 tccagtaaca ccttctaacc tgtgaattgg cagaaaaggg tagcggtatc atgtatatta 2161 aaattggcta atattaagtt attgcagata ccacattcat tatgctgcag tactgtacat 2221 atttttctta gaaattagct atttgtgcat atcagtattt gtaactttaa cacattgtta 2281 tgtgagaaat gttactgggg aaatagatca gccactttta aggtgctgtc atatatcttt 2341 ggaatgaatg acctaaaatc attttaacca tgctactgga aagtaacaga gtcaaaattg 2401 gaaggtttta ttcattcttg aatttttcct ttctaaagag ctcttctatt tatacatgcc 2461 taaattcttt taaaatgtag agggatacct gtctgcataa taaagctgat catgttttgc 2521 tacagtttgc aggtgaaaaa aaataaatat tataaaataa aaaaaaaaaa aaaaaaaaaa 2581 aaa // LOCUS HUMRB1A 503 bp DNA PRI 05-JUL-1995 DEFINITION Homo sapiens (clone 104) retinoblastoma 1 gene, complete cds. ACCESSION M26460 NID g341501 KEYWORDS nuclear phosphoprotein; retinoblastoma 1; retinoblastoma protein. SOURCE Homo sapiens (tissue library: of T.Tomatsu, M.Hattori and Y.Sakaki) lymph node DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 503) AUTHORS Taya,Y., Watanabe,K. and Nishimura,S. TITLE Homology between a region of the human retinoblastoma gene and L1 family repetitive sequences JOURNAL Biochem. Biophys. Res. Commun. 160, 1061-1066 (1989) MEDLINE 89273558 FEATURES Location/Qualifiers source 1..503 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymph node" /tissue_lib="of T.Tomatsu, M.Hattori and Y.Sakaki" /map="13q14.2" gene 98..376 /gene="RB1" CDS 98..376 /gene="RB1" /codon_start=1 /db_xref="GDB:G00-118-734" /product="retinoblastoma 1" /db_xref="PID:g536845" /translation="MNRNEQSLQEIWDYVKRPHIHLIGVPESDRENGTKLENTLQDII QENFPNLARQANIQIQETQRTPHRYSSRRATPRHIIVRFTEVEMKKKF" BASE COUNT 204 a 93 c 108 g 98 t ORIGIN 1 ccttcaatag ccgattcgat caagtggaag aaagggtatc agtgattgaa gatcatatta 61 atgaaataaa gcaagagaca agattagaga aaaaagaatg aatagaaatg aacaaagcct 121 ccaagaaata tgggactatg tgaaaagacc acatatacat ttgattggtg taccggaaag 181 tgacagggag aatggaacca agttagaaaa cactcttcag gatattatcc aggagaactt 241 ccctaaccta gcaaggcagg ccaacattca aattcaggaa acacagagaa caccacacag 301 atactcctcg agaagagcaa ctccaagaca cataattgtc agattcactg aggttgaaat 361 gaagaaaaaa ttttaagggc atccagagag aaaggtcagg ttacccacaa agggaagtcc 421 atcagactaa tagtggatct ctcggcagaa accctacaag ccagaagaga gtgggggcca 481 atattcttca ttcttaaaca aaa // LOCUS HUMRPL26X 2704 bp DNA PRI 09-JAN-1995 DEFINITION Human ribosomal protein L26 (RPL26) gene exon 2, complete cds. ACCESSION L07287 NID g292434 KEYWORDS ribosomal protein L26. SOURCE Homo sapiens (tissue library: lambda Charon4A) blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2704) AUTHORS Butterfield,L.H., Stephen,K., Snyder,C. and Winston,S. TITLE Characterization of the human homologue to rat ribosomal protein L26 from HL60 cells JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..2704 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="promyelocytes" /cell_type="HL60" /tissue_type="blood" /tissue_lib="lambda Charon4A" misc_feature 1..1350 /gene="RPL26" /standard_name="CpG island" gene 1..2704 /gene="RPL26" repeat_region 185..468 /gene="RPL26" /rpt_family="Alu" misc_feature 468..515 /gene="RPL26" /standard_name="Z DNA" misc_feature 1364..1377 /gene="RPL26" /standard_name="polypyrimidine tract" misc_signal 1368 /gene="RPL26" /standard_name="mRNA CAP site" /evidence=experimental misc_binding 1378..1386 /gene="RPL26" /bound_moiety="delta transcription factor" intron 1403..2243 /gene="RPL26" /number=1 /evidence=experimental exon 2244..2416 /gene="RPL26" /number=2 /evidence=experimental CDS 2249..2686 /gene="RPL26" /codon_start=1 /product="ribosomal protein L26" /db_xref="PID:g292435" /translation="MKFNPFVTSDRSKNRKRHFNAPSHIRRKIMSSPLSKELRQKYNV RSMPIRKDDEVQVVRGHYKGQQIGKVVQVYRKKYVIYIERVQREKANGTTVHVGIHPS KVVITRLKLDKDRKKILERKAKSRQVGKEKSKYKEETIEKMQE" intron 2417..>2704 /gene="RPL26" /number=2 /evidence=experimental BASE COUNT 679 a 696 c 756 g 573 t ORIGIN 1 gaattcatga tgtaaagtgt agagaagagc gtccccacta ctcagaaggc tcactacgtc 61 ttcggtattc ttgtgaaacg cctgtcctct gtcactgtga cacccgtgtc attcattcag 121 cagaagccgt tcagcgccgg tgcctgctgc actgccgctg cctcagaaac catttgcagg 181 ggccgggcgc ggtggctcac gcctgtaatc ccagcacttt gggaggctga ggcgggcgga 241 ttgcctgagg tcgggagttc gagaccagcc tggccaacat ggtgaaccct cgtctctact 301 aaaactacaa aaattagccg ggcgtggtgg cgggcgcctg taatcccagc tactcaggag 361 gctgaggcag gagaacccgg gaggcggagg ttgcagtgag tggagatcgt gccagtgcgc 421 tccagcctga gcaacaagag cgaaactgtc tcaaaaagaa gaaaaagaca cacacacaca 481 cacacacaca cacacacaca cacacacaca cacacgaaga agaagaaaag aaaccatttg 541 cagggagcgc acggcttccg tctggtcctg aactggacgc aaacaggccg agcatttaga 601 aacccgcgca gctccccgca ccgagaggct tcctctcgcc tggctccttc gaagcgtggt 661 ctggaccagc acctcccaca gcactggggg cgccttggaa acgcggcctt ggacctattt 721 ccgcgggggc aggcggcgat cggtggccct ggtcctggca aggcccctca ccacccagaa 781 ctgaatctct ggcgggccct gctgtgcggt cagagacgcc caggtgcagg ggcggtggcg 841 gcgtcgcggt ggagctcgtt aaagtaccgg cacgctggcc gcgccacgcc acgggcggag 901 gggacagtga gtgcagggac ccaggcgacc ggccggccgc gtccccgcgc ctcctggcgg 961 ccagaagcag ggcctgcgct tccccggcta aggtgccggc ccccgccccg gcacctggga 1021 gtgaaggtac cagcgggtct cctgggcggt ctcctggggc cgctcgttcc ttcatcaacc 1081 ccaggggcag ttggaccctg agcagcctta gccaatagcc ccgggactac ccttcccaga 1141 ggattcagcg gcagcacccg ggtctcggaa gggagacact aggacctgga ggaggggagt 1201 tttgcaatcc ctcgcagttc ccctcctagc cactaggtga cactagccat aaatttattt 1261 cccagtttac tcccctcgct caaactccgc gcccctctcg ctccgagaga cataggtctc 1321 gcgagatctt tggtaaactt acagaaccgg aagcagcgtg tagttctctt cccttttgcg 1381 gccatcaccg aagcgggagc gggtaaggat tcggcgggca agcgggtgta atcagcagcc 1441 atccgttctt gggcatggtg gcttccgacc gcggggaccc cgagagtctc ttgtcggccg 1501 agtagcagcc aggaaggaga gactgggatg gttttttatc tgttgctttc ttaaatcaag 1561 ggccgccggg ccggagatgg atggagggac cggggatttg ggaactcgaa aacgagctga 1621 gggaagggag cctgtggaaa tagactggag tctgggtagt gtcgtttcct agagaatggt 1681 ctcgaagtaa cttctcggta aagtcttcac ggaatttcca gaccacactt gcccactggg 1741 tggcttttag gacccgagac gtgtgcaggc ttttccaggt tcgggttggt gggccttatt 1801 cattttggcg gggctatacg tattcactct taagcctgta gaagatgaac gttgattgta 1861 ccttctaaaa cgtgaggtta tagttaagag aattgcgttg tcggtctgct cagcagagat 1921 gtaagtccag tcttgatttc ctcaggcact acttgaggag ttgggagagt gtaactaaaa 1981 ggtcttataa ctaggacaag tgtttttgat gcaaaagtct ttagtccgta aacatataca 2041 gaaaacaccc gaataaaaca aaaaataaaa agtttcgagc ttcggttgat aggcccttcc 2101 gatcaaacag tttttgttga agttgcacaa catgacactg cctatgatca ttctagacta 2161 tcaagataaa atcattgcgg ttatttgttt ggaagctgga tgagattttg gtcatttaag 2221 agtcttttta actttttatt tagccaaaat gaagtttaat ccctttgtga cttccgaccg 2281 aagcaagaat cgcaaaaggc atttcaatgc accttcccac attcgaagga agattatgtc 2341 ttcccctctt tccaaagagc tgagacagaa gtacaacgtg cgatccatgc ccatccgaaa 2401 ggatgatgaa gttcaggttg tacgtggaca ctataaaggt cagcaaattg gcaaagtagt 2461 ccaggtttac aggaagaaat atgttatcta cattgaacgg gtgcagcggg aaaaggctaa 2521 tggcacaact gtccacgtag gcattcaccc cagcaaggtg gttatcacta ggctaaaact 2581 ggacaaagac cgcaaaaaga tcctcgaacg gaaagccaaa tctcgccaag taggaaagga 2641 aaagagcaaa tacaaggaag aaaccattga gaagatgcag gaataaagta atcttatata 2701 caag // LOCUS HUMSAP49A 1275 bp DNA PRI 09-JAN-1995 DEFINITION Human spliceosomal protein (SAP 49) gene, complete cds. ACCESSION L35013 NID g556216 KEYWORDS spliceosomal protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1275) AUTHORS Champion-Arnaud,P. and Reed,R. TITLE The prespliceosome components SAP 49 and SAP 145 interact in a complex implicated in tethering U2 snRNP to the branch site JOURNAL Genes Dev. 8, 1974-1983 (1994) MEDLINE 95047348 FEATURES Location/Qualifiers source 1..1275 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..1275 /gene="SAP 49" CDS 1..1275 /gene="SAP 49" /note="RNA recognition motif amino acid 15-86 and 102-174, proline-glycine rich domain amino acid 200-424" /codon_start=1 /product="spliceosomal protein" /db_xref="PID:g556217" /translation="MAAGPISERNQDATVYVGGLDEKVSEPLLWELFLQAGPVVNTHM PKDRVTGQHQGYGFVEFLSEEDADYAIKIMNMIKLYGKPIRVNKASAHNKNLDVGANI FIGNLDPEIDEKLLYDTFSAFGVILQTPKIMRDPDTGNSKGYAFINFASFDASDAAIE AMNGQYLCNRPITVSYAFKKDSKGERHGSAAERLLAAQNPLSQADRPHQLFADAPPPP SAPNPVVSSLGSGLPPPGMPPPGSFPPPVPPPGALPPGIPPAMPPPPMPPGAAGHGPP SAGTPGAGHPGHGHSHPHPFPPGGMPHPGMSQMQLAHHGPHGLGHPHAGPPGSGGQPP PRPPPGMPHPGPPPMGMPPRGPPFGSPMGHPGPMPPHGMRGPPPLMPPHGYTGPPRPP PYGYQRGPLPPPRPTPRPPVPPRGPLRGPLPQ" BASE COUNT 270 a 437 c 302 g 266 t ORIGIN 1 atggctgccg ggccgatctc cgagcggaat caggatgcca ctgtgtacgt ggggggcctg 61 gatgagaagg ttagtgaacc gctgctgtgg gaactgtttc tccaggctgg accagtagtc 121 aacacccaca tgccaaagga tagagtcact ggccagcacc aaggctatgg ctttgtggaa 181 ttcttgagtg aggaagatgc tgactatgcc attaagatca tgaacatgat caaactctat 241 gggaagccaa tacgggtgaa caaagcatca gctcacaaca aaaacctgga tgtaggggcc 301 aacattttca ttgggaacct ggaccctgag attgatgaga agttgcttta tgatactttc 361 agcgcctttg gggtcatctt acaaaccccc aaaattatgc gggaccctga cacaggcaac 421 tccaaaggtt atgcctttat taattttgct tcatttgatg cttcggatgc agcaattgaa 481 gccatgaatg ggcagtacct ctgtaaccgt cctatcaccg tatcttatgc cttcaagaag 541 gactccaagg gtgagcgcca tggctcagca gccgaacgac ttctggcagc tcagaacccg 601 ctctcccagg ctgatcgccc tcatcagctg tttgcagatg cacctcctcc accctctgct 661 cccaatcctg tggtatcatc attggggtct gggcttcctc caccaggcat gcctcctcct 721 ggctccttcc cacccccagt gccacctcct ggagccctcc cacctgggat acccccagcc 781 atgcccccac cacctatgcc tcctggggct gcaggacatg gccccccatc ggcaggaacc 841 ccaggggcag gacatcctgg tcatggacac tcacatcctc acccattccc accgggtggg 901 atgccccatc cagggatgtc tcagatgcag cttgcacacc atggccctca tggcttagga 961 catccccacg ctggaccccc aggctctggg ggccagccac cgccccgacc accacctgga 1021 atgcctcatc ctggacctcc tccaatgggc atgccccccc gagggcctcc attcggatct 1081 cccatgggtc acccaggtcc tatgcctccg catggtatgc gtggacctcc tccactgatg 1141 cccccccatg gatacactgg ccctccacga cccccaccct atggctacca gcgggggcct 1201 ctccctccac ccagacccac tccccggcca ccagttcccc ctcgaggccc acttcgaggc 1261 cctctccctc agtaa // LOCUS HUMSAP62X 1395 bp DNA PRI 09-JAN-1995 DEFINITION Human spliceosomal protein (SAP 62) gene, complete cds. ACCESSION L21990 NID g409218 KEYWORDS nuclear protein; spliceosomal protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1395) AUTHORS Bennett,M. and Reed,R. TITLE Correspondence between a mammalian spliceosome component and an essential yeast splicing factor JOURNAL Science 262 (5130), 105-108 (1993) MEDLINE 94023929 FEATURES Location/Qualifiers source 1..1395 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..1395 /gene="SAP 62" CDS 1..1395 /gene="SAP 62" /codon_start=1 /product="spiceosomal protein" /db_xref="PID:g409219" /translation="MDFQHRPGGKTGSGGVASSSESNRDRREPLRQLALETIDINKDP YFMKNHLGSYECKLCLTLHNNEGSYLAHTQGKKHQTNLARRAAKEAKEAPAQPAPEKV KVEVKKFVKIGRPGYKVTKQRDSEMGQQSLLFQIDYPEIAEGIMPRHRFMSAYEQRIE PPDRRWQYLLMAAEPYETIAFKVPSREIDKAEGKFWTHWNRETKQFFLQFHFKMEKPP APPSLPAGPPGVKRPPPPLMNGLPPRPPLPESLPPPPPGGLPLPPMPPTGPAPSGPPG PPQLPPPAPGVHPPAPVVHPPASGVHPPAPGVHPPAPGVHPPAPGVHPPTSGVHPPAP GVHPPAPGVHPPAPGVHPPAPGVHPPAPGVHPPPSAGVHPQAPGVHPAAPAVHPQAPG VHPPAPGMHPQAPGVHPQPPGVHPSAPGVHPQPPGVHPSNPGVHPPTPMPPMLRPPLP SEGPGNIPPPPPTN" misc_feature 48..78 /gene="SAP 62" /standard_name="zinc finger" repeat_region 286..439 /note="proline repeat" BASE COUNT 262 a 570 c 372 g 191 t ORIGIN 1 atggacttcc agcatcgccc cgggggcaag accgggagcg ggggcgtggc ctcctcctcc 61 gagagcaacc gtgaccgcag ggagccgctc cggcagctgg ccctggagac catcgacatc 121 aacaaggacc cgtacttcat gaagaaccac ctgggctcct atgaatgcaa actctgcctg 181 acacttcaca acaatgaggg gagctacctg gcacatacgc aggggaagaa gcaccagacc 241 aacctggccc ggcgagcagc caaggaggcc aaggaggccc ctgcccagcc cgcgcctgag 301 aaggtcaagg tggaggtgaa gaagtttgtg aagatcggcc gcccgggcta caaagtgacc 361 aagcagagag actcggagat gggccagcag agcctcctct tccagattga ctaccctgag 421 atcgccgagg gcatcatgcc acgtcaccgc ttcatgtctg cgtacgagca gaggatcgag 481 cctccggacc ggcgctggca gtacctgctc atggccgccg aaccctacga gaccattgcc 541 ttcaaggtgc cgagcagaga gatcgacaag gcggagggca agttctggac acactggaac 601 cgggagacca agcagttctt cctccagttc cactttaaga tggagaagcc cccggctcca 661 cccagcctcc ctgctggccc ccctggggtg aagcggcctc cacccccgct gatgaacggt 721 ctgccccctc ggccaccgct gcctgagtct ttgccaccgc ccccgccagg aggcctgcct 781 ctgccaccca tgccccccac agggcctgcg ccctcagggc ccccgggacc accccagcta 841 cccccgccag ctccaggggt ccaccccccg gccccagtgg tgcatccccc tgcatctggg 901 gtccatcccc cagctcctgg cgtccacccc ccagctcctg gcgtccatcc cccagcccct 961 ggggtccacc caccaacctc tggggtccac cccccagctc ctggagtcca ccctccagcc 1021 cccggggttc acccaccagc ccccggagtc cacccaccag cccctggggt tcacccacca 1081 gccccagggg tccatcctcc cccatcagcg ggggttcacc cccaggcccc gggggtgcac 1141 ccagcagccc ccgccgttca ccctcaggcc ccaggggtgc acccaccagc cccagggatg 1201 caccctcagg ccccgggggt ccacccccaa cctcccgggg tccatccgtc ggctcctggg 1261 gtccaccctc agcctccggg agttcacccc tcaaatcctg gggtgcaccc cccaactccc 1321 atgcccccaa tgctgaggcc cccacttccc tccgaaggcc cagggaacat acctccccct 1381 cccccaacca actga // LOCUS HUMSMIT 2157 bp DNA PRI 16-MAR-1995 DEFINITION Homo sapiens Na+/myo-inositol cotransporter (SLC5A3) gene, complete cds. ACCESSION L38500 NID g662842 KEYWORDS Na+/myo-inositol cotransporter; osmoregulation. SOURCE Homo sapiens (clone hgSMIT) male placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2157) AUTHORS Berry,G.T., Mallee,J.J., Kwon,H.M., Rim,J.S., Mulla,W.R., Muenke,M. and Spinner,N.B. TITLE The human osmoregulatory Na+/myo-inositol cotransporter gene (SLC5A3): molecular cloning and localization to chromosome 21 JOURNAL Genomics 25 (2), 507-513 (1995) MEDLINE 95309919 FEATURES Location/Qualifiers source 1..2157 /organism="Homo sapiens" /note="(vector lambda FIX II)" /db_xref="taxon:9606" /clone="hgSMIT" /germline /sex="male" /tissue_type="placenta" /map="chromosome 21" gene 1..2157 /gene="SLC5A3" CDS 1..2157 /gene="SLC5A3" /codon_start=1 /db_xref="GDB:G00-373-217" /product="Na+/myo-inositol cotransporter" /db_xref="PID:g662843" /translation="MRAVLDTADIAIVALYFILVMCIGFFAMWKCNRSTVSGYFLAGR SMTWVTIGASLFVSNIGSEHFIGLAGSGAASGFAVGAWEFNALLLLQLLGWVFIPIYI RSGVYTMPEYLSKRFGGHRIQVYFAALSLILYIFTKLSVDLYSGALFIQESLGWNLYV SVILLIGMTALLTVTGGLVAVIYTDTLQALLMIIGALTLMIISIMEIGGFEEVKRRYM LASPDVTSILLTYNLSNTNSCNVSPKKEALKMLRNPTDEDVPWPGFILGQTPASVWYW CADQVIVQRVLAAKNIAHAKGSTLMAGFLKLLPMFIIVVPGMISRILFTDDIACINPE HCMLVCGSRAGCSNIAYPRLVMKLVPVGLRGLMMAVMIAALMSDLDSIFNSASTIFTL DVYKLIRKSASSRELMIVGRIFVAFMVVISIAWVPIIVEMQGGQMYLYIQEVADYLTP PVAALFLLAIFWKRCNEQGAFYGGMAGFVLGAVRLILAFAYRAPECDQPDNRPGFIKD IHYMYVATGLFWVTGLITVIVSLLTPPPTKEQIRTTTFWSKKNLVVKENCSPKEEPYQ MQEKSILRCSENNETINHIIPNGKSEDSIKGLQPEDVNLLVTCREEGNPVASLGHSEA ETPVDAYSNGQAALMGEKERKKETDDGGRYWKFIDWFCGFKSKSLSKRSLRDLMEEEA VCLQMLEETRQVKVILNIGLFAVCSLGIFMFVYFSL" BASE COUNT 537 a 460 c 550 g 610 t ORIGIN 1 atgagagctg tactggacac agcagacatt gccatagtgg ccctgtattt tatcctggtc 61 atgtgcattg gtttttttgc catgtggaaa tgtaatagaa gcaccgtgag tggatacttc 121 ctggcggggc gctctatgac ctgggtaaca attggtgcct ctctgtttgt gagcaatatt 181 gggagtgagc acttcattgg gctggcagga tctggagctg caagtggatt tgcagtgggc 241 gcatgggaat tcaatgcctt actgctttta caacttctgg gatgggtttt catcccaatt 301 tacatccggt caggggtata taccatgcct gaatacttgt ccaagcgatt tggtggccat 361 aggattcagg tctattttgc agccttgtct ctgattctct atattttcac caagctctcg 421 gtggatctgt attcgggtgc cctttttatc caggagtctt tgggttggaa tctttatgtg 481 tctgtcatcc tgctcattgg catgactgct ttgctgactg tcaccggagg ccttgttgca 541 gtgatctaca cagacactct gcaggctctg ctcatgatca ttggggcact tacacttatg 601 attattagca taatggagat tggcgggttt gaggaagtta agagaaggta catgttggcc 661 tcacccgatg tcacttccat cttattgaca tacaaccttt ccaacacaaa ttcttgtaat 721 gtctccccta agaaagaagc cctgaaaatg ctgcggaatc caacagatga agatgttcct 781 tggcctggat tcattcttgg gcagacccca gcttcagtat ggtactggtg tgctgaccaa 841 gtcatcgtgc agagggtcct tgcagccaaa aacattgctc atgccaaagg ctctactctt 901 atggctggct tcttaaagct cctgccaatg tttatcatag ttgtcccagg aatgatttcc 961 aggatactgt ttactgatga tatagcttgc atcaacccag agcactgcat gctggtgtgt 1021 ggaagcagag ctggttgctc caatattgct tacccacgcc tggtgatgaa gctggttcct 1081 gtgggccttc ggggtttaat gatggcagtg atgattgcag ctctgatgag tgacttagac 1141 tctatcttta acagtgccag taccatattc accctcgatg tgtacaaact tatccgcaag 1201 agcgcaagct cccgggagtt aatgattgtg gggaggatat ttgtggcatt tatggtggtg 1261 atcagcatag catgggtgcc aatcatcgtg gagatgcaag gaggccagat gtacctttac 1321 attcaggagg tagcagatta cctgacaccc ccagtggcag ccttgttcct gctggcaatt 1381 ttctggaagc gctgcaatga acaaggggct ttctatggtg gaatggctgg ctttgttctt 1441 ggagcagtcc gtttgatact ggcctttgcc taccgtgccc cagaatgtga ccaacctgat 1501 aataggccgg gcttcatcaa agacatccat tatatgtatg tggccacagg attgttttgg 1561 gtcacgggac tcattactgt aattgtgagc cttctcacac cacctcccac aaaggaacag 1621 attcgaacca ccaccttttg gtctaagaag aacctggtgg tgaaggagaa ctgctcccca 1681 aaagaggaac cataccaaat gcaagaaaag agcattctga gatgcagtga gaataatgag 1741 accatcaacc acatcattcc caacgggaaa tctgaagaca gcattaaggg ccttcagcct 1801 gaagatgtta atctgttggt aacctgcaga gaggagggca acccagtggc atccttaggt 1861 cattcagagg cagaaacacc agttgacgct tactccaatg ggcaagcagc tctcatgggt 1921 gagaaagaga gaaagaaaga aacggatgat ggaggtcggt actggaagtt catagactgg 1981 ttttgtggct ttaaaagtaa gagcctcagc aagaggagtc tcagagacct gatggaagag 2041 gaggctgttt gtttacagat gctagaagag actcggcaag ttaaagtaat actaaatatt 2101 ggactttttg ctgtgtgttc acttggaatt ttcatgtttg tttatttctc cttatga // LOCUS HUMSODIUM 2334 bp DNA PRI 29-SEP-1994 DEFINITION Human kidney amiloride-sensitive sodium channel, complete cds. ACCESSION L29007 NID g493125 KEYWORDS sodium channel. SOURCE Homo sapiens Kidney DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2334) AUTHORS McDonald,F.J., Snyder,P.M., McCray,P.B. Jr. and Welsh,M.J. TITLE Cloning, expression, and tissue distribution of a human amiloride-sensitive Na+ channel JOURNAL Am. J. Physiol. 266 (6 Pt 1), L728-L734 (1994) MEDLINE 94295729 FEATURES Location/Qualifiers source 1..2334 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Kidney" CDS 83..2092 /partial /codon_start=1 /product="Na+ channel" /db_xref="PID:g493605" /translation="MEGNKLEEQDSSPPQSTPGLMKGNKREEQGLGPEPAAPQQPTAE EEALIEFHRSYRELFEFFCNNTTIHGAIRLVCSQHNRMKTAFWAVLWLCTFGMMYWQF GLLFGEYFSYPVSLNINLNSDKLVFPAVTICTLNPYRYPEIKEELEELDRITEQTLFD LYKYSSFTTLVAGSRSRRDLRGTLPHPLQRLRVPPPPHGARRARSVASSLRDNNPQVD WKDWKIGFQLCNQNKSDCFYQTYSSGVDAVREWYRFHYINILSRLPETLPSLEEDTLG NFIFACRFNQVSCNQANYSHFHHPMYGNCYTFNDKNNSNLWMSSMPGINNGLSLMLRA EQNDFIPLLSTVTGARVMVHGQDEPAFMDDGGFNLRPGVETSISMRKETLDRLGGDYG DCTKNGSDVPVENLYPSKYTQQVCIHSCFQESMIKECGCAYIFYPRPQNVEYCDYRKH SSWGYCYYKLQVDFSSDHLGCFTKCRKPCSVTSYQLSAGYSRWPSVTSQEWVFQMLSR QNNYTVNNKRNGVAKVNIFFKELNYKTNSESPSVTMVTLLSNLGSQWSLWFGSSVLSV VEMAELVFDLLVIMFLMLLRRFRSRYWSPGRGGRGAQEVASTLASSPPSHFCPHPMSL SLSQPGPAPSPALTAPPPAYATLGPRPSPGGSAGASSSTCPLGGP" BASE COUNT 486 a 739 c 632 g 477 t ORIGIN 1 tccccagcca ggccgctgca cctgtcaggg gaacaagctg gaggagcagg accctagacc 61 tctgcagccc ataccaggtc tcatggaggg gaacaagctg gaggagcagg actctagccc 121 tccacagtcc actccagggc tcatgaaggg gaacaagcgt gaggagcagg ggctgggccc 181 cgaacctgcg gcgccccagc agcccacggc ggaggaggag gccctgatcg agttccaccg 241 ctcctaccga gagctcttcg agttcttctg caacaacacc accatccacg gcgccatccg 301 cctggtgtgc tcccagcaca accgcatgaa gacggccttc tgggcagtgc tgtggctctg 361 cacctttggc atgatgtact ggcaattcgg cctgcttttc ggagagtact tcagctaccc 421 cgtcagcctc aacatcaacc tcaactcgga caagctcgtc ttccccgcag tgaccatctg 481 caccctcaat ccctacaggt acccggaaat taaagaggag ctggaggagc tggaccgcat 541 cacagagcag acgctctttg acctgtacaa atacagctcc ttcaccactc tcgtggccgg 601 ctcccgcagc cgtcgcgacc tgcgggggac tctgccgcac cccttgcagc gcctgagggt 661 cccgcccccg cctcacgggg cccgtcgagc ccgtagcgtg gcctccagct tgcgggacaa 721 caacccccag gtggactgga aggactggaa gatcggcttc cagctgtgca accagaacaa 781 atcggactgc ttctaccaga catactcatc aggggtggat gcggtgaggg agtggtaccg 841 cttccactac atcaacatcc tgtcgaggct gccagagact ctgccatccc tggaggagga 901 cacgctgggc aacttcatct tcgcctgccg cttcaaccag gtctcctgca accaggcgaa 961 ttactctcac ttccaccacc cgatgtatgg aaactgctat actttcaatg acaagaacaa 1021 ctccaacctc tggatgtctt ccatgcctgg aattaacaac ggtctgtccc tgatgctgcg 1081 cgcagagcag aatgacttca ttcccctcct gtccacagtg actggggccc gggtaatggt 1141 gcacgggcag gatgaacctg cctttatgga tgatggtggc tttaacttgc ggcctggcgt 1201 ggagacctcc atcagcatga ggaaggaaac cctggacaga cttgggggcg attatggcga 1261 ctgcaccaag aatggcagtg atgttcctgt tgagaacctt tacccttcca agtacacaca 1321 gcaggtgtgt attcactcct gcttccagga gagcatgatc aaggagtgtg gctgtgccta 1381 catcttctat ccgcggcccc agaacgtgga gtactgtgac tacagaaagc acagttcctg 1441 ggggtactgc tactataagc tccaggttga cttctcctca gaccacctgg gctgtttcac 1501 caagtgccgg aagccatgca gcgtgaccag ctaccagctc tctgctggtt actcacgatg 1561 gccctcggtg acatcccagg aatgggtctt ccagatgcta tcgcgacaga acaattacac 1621 cgtcaacaac aagagaaatg gagtggccaa agtcaacatc ttcttcaagg agctgaacta 1681 caaaaccaat tctgagtctc cctctgtcac gatggtcacc ctcctgtcca acctgggcag 1741 ccagtggagc ctgtggttcg gctcctcggt gttgtctgtg gtggagatgg ctgagctcgt 1801 ctttgacctg ctggtcatca tgttcctcat gctgctccga aggttccgaa gccgatactg 1861 gtctccaggc cgagggggca ggggtgctca ggaggtagcc tccaccctgg catcctcccc 1921 tccttcccac ttctgccccc accccatgtc tctgtccttg tcccagccag gccctgctcc 1981 ctctccagcc ttgacagccc ctccccctgc ctatgccacc ctgggccccc gcccatctcc 2041 agggggctct gcaggggcca gttcctccac ctgtcctctg ggggggccct gagagggaag 2101 gagaggtttc tcacaccaag gcagatgctc ctctggtggg agggtgctgg ccctggcaag 2161 attgaaggat gtgcagggct tcctctcaga gccgcccaaa ctgccgttga tgtgtggagg 2221 ggaagcaaga tgggtaaggg ctcaggaagt tgctccaaga acagtagctg atgaagctgc 2281 ccagaagtgc cttggctcca gccctgtacc ccttggtact gcctctgaac actc // LOCUS HUMSOMAT 1427 bp DNA PRI 03-AUG-1993 DEFINITION Human somatostatin receptor gene, complete cds. ACCESSION L14856 NID g292499 KEYWORDS G protein; G-protein coupled receptor; plasma membrane; receptor; somatostatin; somatostatin receptor; transmembrane protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1427) AUTHORS Xu,Y., Song,J., Bruno,J.F. and Berelowitz,M. TITLE Molecular cloning and sequencing of a human somatostatin receptor, hSSTR4 JOURNAL Biochem. Biophys. Res. Commun. 193, 648-652 (1993) MEDLINE 93290656 FEATURES Location/Qualifiers source 1..1427 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 99..1265 /standard_name="hSSTR4" /note="intronless gene" /codon_start=1 /evidence=experimental /product="somatostatin receptor" /db_xref="PID:g292500" /translation="MSAPSTLPPGGEEGLGTAWPSAANASSAPAEAEEAVAGPGDARA AGMVAIQCIYALVCLVGLVGNALVIFVILRYAKMKTATNIYLLNLAVADELFMLSVPF VASSAALRHWPFGSVLCRAVLSVDGLNMFTSVFCLTVLSVDRYVAVVHPLRAATYRRP SVAKLINLGVWLASLLVTLPIAIFADTRPARGGQAVACNLQWPHPAWSAVFVVYTFLL GFLLPVLAIGLCYLLIVGKMRAVALRAGWQQRRRSEKKITRLVLMVVVVFVLCWMPFY VVQLLNLVVTSLDATVNHVSLILSYANSCANPILYGFLSDNFRRSFQRVLCLRCCLLE GAGGAEEEPLDYYATALKSKGGAGCMCPPLPCQQEALQPEPGRKRIPLTRTTTF" BASE COUNT 201 a 525 c 425 g 276 t ORIGIN 1 gtctgggcgc cagcccccgc cctgggcccg ccgcccgcgc tctctggcgc agcgctagct 61 ccgccgcgct cagctgccct gcgccggcac ccctggtcat gagcgccccc tcgacgctgc 121 cccccggggg cgaggaaggg ctggggacgg cctggccctc tgcagccaat gccagtagcg 181 ctccggcgga ggcggaggag gcggtggcgg ggcccgggga cgcgcgggcg gcgggcatgg 241 tcgctatcca gtgcatctac gcgctggtgt gcctggtggg gctggtgggc aacgccctgg 301 tcatcttcgt gatccttcgc tacgccaaga tgaagacggc taccaacatc tacctgctca 361 acctggccgt agccgacgag ctcttcatgc tgagcgtgcc cttcgtggcc tcgtcggccg 421 ccctgcgcca ctggcccttc ggctccgtgc tgtgccgcgc ggtgctcagc gtcgacggcc 481 tcaacatgtt caccagcgtc ttctgtctca ccgtgctcag cgtggaccgc tacgtggccg 541 tggtgcaccc tctgcgcgcg gcgacctacc ggcggcccag cgtggccaag ctcatcaacc 601 tgggcgtgtg gctggcatcc ctgttggtca ctctccccat cgccatcttc gcagacacca 661 gaccggctcg cggcggccag gccgtggcct gcaacctgca gtggccacac ccggcctggt 721 cggcagtctt cgtggtctac actttcctgc tgggcttcct gctgcccgtg ctggccattg 781 gcctgtgcta cctgctcatc gtgggcaaga tgcgcgccgt ggccctgcgc gctggctggc 841 agcagcgcag gcgctcggag aagaaaatca ccaggctggt gctgatggtc gtggtcgtct 901 ttgtgctctg ctggatgcct ttctacgtgg tgcagctgct gaacctcgtc gtgaccagcc 961 ttgatgccac cgtcaaccac gtgtccctta tcctcagcta tgccaacagc tgcgccaacc 1021 ctattctcta tggcttcctc tccgacaact tccgccgatc cttccagcgg gttctctgcc 1081 tgcgctgctg cctcctggaa ggtgctggag gtgctgagga ggagcccctg gactactatg 1141 ccactgctct caagagcaaa ggtggggcag ggtgcatgtg ccccccactc ccctgccagc 1201 aggaagccct gcaaccagaa cccggccgca agcgcatccc cctcaccagg accaccacct 1261 tctgaggagc ccttccccta cccaccctgc gtggccacct cccaaggggt gggcaccatt 1321 cctacagccc cgaagactgc atctcctgaa tgctcaccta agctccacca cctgttcctt 1381 ccagcagccc atgtacctgc cggagagtgt cagaactctt ctgctgc // LOCUS HUMSPRPC 2651 bp DNA PRI 23-JUL-1993 DEFINITION Human small proline-rich protein gene, complete cds. ACCESSION M84757 NID g338433 KEYWORDS cAMP responsive element; small proline-rich protein. SOURCE Homo sapiens Adult Placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2651) AUTHORS An,G. and Wu,R. TITLE Isolation and characterization of the human spr1 gene and its regulation of expression by phorbol ester and cyclic AMP JOURNAL J. Biol. Chem. 268, 10977-10982 (1992) MEDLINE 93266543 FEATURES Location/Qualifiers source 1..2651 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Adult" /tissue_type="Placenta" enhancer 365..372 /standard_name="cyclic AMP responsive element" misc_binding 812..818 /function="enhancer" /bound_moiety="AP-1" TATA_signal 925..930 exon 953..1000 /number=1 intron 1001..2077 /number=1 exon 2078..2651 /number=2 CDS 2097..2366 /codon_start=1 /product="small proline-rich protein" /db_xref="PID:g338434" /translation="MSSQQQKQPCIPPPQLQQQQVKQPCQPPPQEPCIPKTKEPCHPK VPEPCHPKVPEPCQPKVPEPCHPKVPEPCPSIVTPAPAQQKTKQK" polyA_signal 2630..2635 polyA_site 2651 BASE COUNT 732 a 601 c 573 g 745 t ORIGIN 1 taagatgagt tggaaaataa tgagggaggg accatcccga ccctggattt gaaggatagt 61 ttggattttc ccaaaaccaa acttgccaac aggttgtgta ctgcaggtag acctggatta 121 gtgacctaag tttcatctta tcttgaggag agtgaacttt gatataaggg ctcagggaat 181 cccagctcct ttgttccttc ccatgttggc tggagagaag ggattcccca tgatagccag 241 ggtcttcagg gcctactgtc acaatgtaca gaataaatgg atggactaaa agggccaggt 301 tgctccaagg ccgaatgtga ccattcttcc cagagttcac tgctccgaac tctccttcac 361 agactgaggt cagcctggcc cctatgttgt ttaccctgag ttcctccttg gagaaaaagg 421 ctttttagaa ccgaaataat cattttagtt tttcctttcc cctgccaggg tcgagtatac 481 tgagtgatgg attaacagaa tattactatt tttcacctag ccaccatgcc ttggcctccc 541 agaataacca gaaataccta ttgtttatct ctacatccac ttcatatatt aaacagccct 601 acaagtgtca cctgtgctga ggattagact ctccagaaga taggacagtt tctggttcca 661 gcagccccag aatgtcttcc ttctctctct ttcagcccac accacccttc ctgtaaacac 721 tacacctgga gcaaagggtg ttcaggggga taaaggccca ggtgacatcc ttgtcagaca 781 ggcaagtgcc acaagtttca tcacaaaagg ttgagtcaac aggtgggtga gggaagaggg 841 gtgaatcaca tctgacaggt aaggaatgta ggcacaggca ggcccagatg gatcctgttt 901 ccttgaggca gggcttgttc catgcataaa aagccagttg gctgggaaca ctaccaccag 961 ttctaaggga ccatacagag tattcctctc ttcacaccag gtgagtctct ttattggaac 1021 tttttagcat ccattctagg tatttaattt ttattagagt ctccttggca ttcttggtat 1081 ttttttccct cattattgaa tttgatcata gccatgcaga ctgttcaact gaattataac 1141 ctctaggaat atcttcattt taaagtgttc atactgtagg cttgaagaga atttagacat 1201 tatcagttct tcattcactc tttctgtgtc aggcaagagg gggttgggtt tattcaagga 1261 tccatagtaa gtttgtaatg ggataatgac tagaacccag gtccacaggc acttaccctg 1321 gcattataat tccttcttat ttataatttt aatacagaga attttattct ttcaacagat 1381 acttttgtga gggcctacta tatttggcaa tttgtttgca aagaaataat ttaagcatct 1441 atgatactac aatttaatca ggaaataaga gctaaaatgc attgtataac agtaacttat 1501 cagtgccagg tgattagggc aagttctgtg acagtttggg aatagagagg tttctggaag 1561 ggtttttcta gggaagccac taaaagtccc atatgaacga tctgtggagt ttggttagtg 1621 cttgaatgag taggaacaag gaaaacattt aagacagagg agaacttgag ccaggccaca 1681 ggtgggattg cttggggttc tagagaggtc agagttatac aatagatgat aagccatgct 1741 atttatcaga aatgagtgag tagatttaat ttgttagggt tttttttttc caaaagacct 1801 cacattccaa tggtcagtat tatgagtctc atattcctgc tagaagcctg ttatgtgctg 1861 tgtaggaaga aagagagcac caggagaggc cctgctccag tggagcaaga ggaaatcatc 1921 tttaggttcc tcttcttcaa aggctcagga aattaaatta agtattcttt gtggtcaaga 1981 aggggaaaga acatagcttt ttaagagatt agaagtggta ttcaccatgg cttcttccct 2041 attattctct gcttaaatca ttcttttctt tttccaggac cagccactgt tgcagcatga 2101 gttcccagca gcagaagcag ccctgcatcc caccccctca gcttcagcag cagcaggtga 2161 aacagccttg ccagcctcca cctcaggaac catgcatccc caaaaccaag gagccctgcc 2221 accccaaggt gcctgagccc tgccacccca aagtgcctga gccctgccag cccaaggttc 2281 cagagccatg ccaccccaag gtgcctgagc cctgcccttc aatagtcact ccagcaccag 2341 cccagcagaa gaccaagcag aagtaatgtg gtccacagcc atgcccttga ggagccggcc 2401 accagatgct gaatccccta tcccattctg tgtatgagtc ccatttgcct tgcaattagc 2461 attctgtctc ccccaaaaaa gaatgtgcta tgaagctttc tttcctacac actctgagtc 2521 tctgaatgaa gctgaaggtc ttagtaccag agctagtttt cagctgctca gaattcatct 2581 gaagagagac ttaagatgaa agcaaatgat tcagctccct tataccccca ttaaattcac 2641 tttcaattcc a // LOCUS HUMSPRR2B 391 bp DNA PRI 13-JAN-1995 DEFINITION Homo sapiens small proline-rich protein 2 (SPRR2B) gene, complete cds. ACCESSION L05188 NID g385226 KEYWORDS keratinocyte differentiation marker; small proline-rich protein 2. SOURCE Homo sapiens (tissue library: EMBL3) skin DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 391) AUTHORS Gibbs,S., Fijneman,R., Wiegant,J., van Kessel,A.G., van De Putte,P. and Backendorf,C. TITLE Molecular characterization and evolution of the SPRR family of keratinocyte differentiation markers encoding small proline-rich proteins JOURNAL Genomics 16 (3), 630-637 (1993) MEDLINE 93315153 FEATURES Location/Qualifiers source 1..391 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary foreskin keratinocyte" /cell_type="keratinocyte" /tissue_type="skin" /tissue_lib="EMBL3" /map="1q21-22" misc_feature 63..64 /note="splice acceptor site" gene 84..302 /gene="SPRR2B" CDS 84..302 /gene="SPRR2B" /note="bp 1-51 correspond to N-terminal domain amino acids, conserved in cornified envelope precursors; bp 157-216 correspond to C-ternimal domain amino acids, conserved in cornified envelope precursors; bp 61-141 correspond to three nonapeptide repeats" /codon_start=1 /product="small proline-rich protein 2" /db_xref="PID:g385227" /translation="MSYQQQQCKQPCQPPPVCPTPKCPEPCPPPKCPEPCPPPKCPQP CPPQQCQQKYPPVTPSPPCQPKYPPKSK" BASE COUNT 96 a 138 c 74 g 83 t ORIGIN 1 aagctttggc ttctctctct ggaggattcc caacccacat tcactgttgt atcatttctt 61 tcagatcctg agactccagc aggatgtctt atcaacagca gcagtgcaag cagccctgcc 121 agccacctcc tgtgtgcccc acgccaaagt gcccagagcc atgtccaccc ccgaagtgcc 181 ctgagccctg cccaccacca aagtgtccac agccctgccc acctcagcag tgccagcaga 241 aatatcctcc tgtgacacct tccccaccct gccagccaaa gtatccaccg aagagcaagt 301 aacagcttca ggattcatca ggagcatgag aggataagga taattggctc acctcgttcc 361 acagctccac ctcatcttct catcaaagct t // LOCUS HUMSRCPT1F 1141 bp DNA PRI 26-FEB-1993 DEFINITION Homo sapiens serotonin receptor (HTR1F) gene, complete cds. ACCESSION L04962 NID g338464 KEYWORDS . SOURCE Homo sapiens (library: Stratagene; lambda DASHII) lymphocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1141) AUTHORS Adham,N., Kao,H.-T., Schechter,L.E., Bard,J.A., Olsen,M., Urquhart,D., Durkin,M., Hartig,P.R., Weinshank,R.L. and Branchek,T.A. TITLE Cloning of another human serotonin receptor (5-HT+(sub-1F): A fifth 5-HT1 receptor subtype coupled to the inhibition of adenylate cyclase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90, 408-412 (1993) MEDLINE 93133800 FEATURES Location/Qualifiers source 1..1141 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="lymphocyte" /tissue_lib="Stratagene; lambda DASHII" 5'UTR 1..26 CDS 27..1127 /standard_name="5-HT1F" /codon_start=1 /product="serotonin receptor" /db_xref="PID:g338465" /translation="MDFLNSSDQNLTSEELLNRMPSKILVSLTLSGLALMTTTINSLV IAAIIVTRKLHHPANYLICSLAVTDFLVAVLVMPFSIVYIVRESWIMGQVVCDIWLSV DITCCTCSILHLSAIALDRYRAITDAVEYARKRTPKHAGIMITIVWIISVFISMPPLF WRHQGTSRDDECIIKHDHIVSTIYSTFGAFYIPLALILILYYKIYRAAKTLYHKRQAS RIAKEEVNGQVLLESGEKSTKSVSTSYVLEKSLSDPSTDFDKIHSTVRSLRSEFKHEK SWRRQKISGTRERKAATTLGLILGAFVICWLPFFVKELVVNVCDKCKISEEMSNFLAW LGYLNSLINPLIYTIFNEDFKKAFQKLVRCRC" 3'UTR 1128..1141 BASE COUNT 340 a 223 c 231 g 347 t ORIGIN 1 tatattaatc ttttaaaaca aagaaaatgg atttcttaaa ttcatctgat caaaacttga 61 cctcagagga actgttaaac agaatgccat ccaaaattct ggtgtccctc actctgtctg 121 ggctggcact gatgacaaca actatcaact cccttgtgat cgctgcaatt attgtgaccc 181 ggaagctgca ccatccagcc aattatttaa tttgttccct tgcagtcaca gattttcttg 241 tggctgtcct ggtgatgccc ttcagcattg tgtatattgt gagagagagc tggattatgg 301 ggcaagtggt ctgtgacatt tggctgagtg ttgacattac ctgctgcacg tgctccatct 361 tgcatctctc agctatagct ttggatcggt atcgagcaat cacagatgct gttgagtatg 421 ccaggaaaag gactccaaag catgctggca ttatgattac aatagtttgg attatatctg 481 tttttatctc tatgcctcct ctattctgga ggcaccaagg aactagcaga gatgatgaat 541 gcatcatcaa gcacgaccac attgtttcca ccatttactc aacatttgga gctttctaca 601 tcccactggc attgattttg atcctttact acaaaatata tagagcagca aagacattat 661 accacaagag acaagcaagt aggattgcaa aggaggaggt gaatggccaa gtccttttgg 721 agagtggtga gaaaagcact aaatcagttt ccacatccta tgtactagaa aagtctttat 781 ctgacccatc aacagacttt gataaaattc atagcacagt gagaagtctc aggtctgaat 841 tcaagcatga gaaatcttgg agaaggcaaa agatctcagg tacaagagaa cggaaagcag 901 ccactaccct gggattaatc ttgggtgcat ttgtaatatg ttggcttcct ttttttgtaa 961 aagaattagt tgttaatgtc tgtgacaaat gtaaaatttc tgaagaaatg tccaattttt 1021 tggcatggct tgggtatctc aattccctta taaatccact gatttacaca atctttaatg 1081 aagacttcaa gaaagcattc caaaagcttg tgcgatgtcg atgttagttt taaaaatgtt 1141 t // LOCUS HUMSRI1A 1634 bp DNA PRI 29-DEC-1994 DEFINITION Human somatostatin receptor isoform 1 gene, complete cds. ACCESSION M81829 NID g307433 KEYWORDS somatostatin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1634) AUTHORS Yamada,Y., Post,S.R., Wang,K., Tager,H.S., Bell,G.I. and Seino,S. TITLE Cloning and functional characterization of a family of human and mouse somatostatin receptors expressed in brain, gastrointestinal tract, and kidney JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (1), 251-255 (1992) MEDLINE 92108031 COMMENT genomic sequence; gene lacks introns. FEATURES Location/Qualifiers source 1..1634 /organism="Homo sapiens" /db_xref="taxon:9606" gene 100..1275 /gene="SSTR2" CDS 100..1275 /gene="SSTR2" /codon_start=1 /db_xref="GDB:G00-134-186" /product="somatostatin receptor isoform 1" /db_xref="PID:g307434" /translation="MFPNGTASSPSSSPSPSPGSCGEGGGSRGPGAGAADGMEEPGRN ASQNGTLSEGQGSAILISFIYSVVCLVGLCGNSMVIYVILRYAKMKTATNIYILNLAI ADELLMLSVPFLVTSTLLRHWPFGALLCRLVLSVDAVNMFTSIYCLTVLSVDRYVAVV HPIKAARYRRPTVAKVVNLGVWVLSLLVILPIVVFSRTAANSDGTVACNMLMPEPAQR WLVGFVLYTFLMGFLLPVGAICLCYVLIIAKMRMVALKAGWQQRKRSERKITLMVMMV VMVFVICWMPFYVVQLVNVFAEQDDATVSQLSVILGYANSCANPILYGFLSDNFKRSF QRILCLSWMDNAAEEPVDYYATALKSRAYSVEDFQPENLESGGVFRNGTCTSRITTL" BASE COUNT 283 a 513 c 495 g 343 t ORIGIN 1 ctgcaggcaa gcggtcgggt ggggagggag ggcgcaggcg gcgggtgcgc gaggagaaag 61 ccccagccct ggcagcccca ctggcccccc tcagctggga tgttccccaa tggcaccgcc 121 tcctctcctt cctcctctcc tagccccagc ccgggcagct gcggcgaagg cggcggcagc 181 aggggccccg gggccggcgc tgcggacggc atggaggagc cagggcgaaa tgcgtcccag 241 aacgggacct tgagcgaggg ccagggcagc gccatcctga tctctttcat ctactccgtg 301 gtgtgcctgg tggggctgtg tgggaactct atggtcatct acgtgatcct gcgctatgcc 361 aagatgaaga cggccaccaa catctacatc ctaaatctgg ccattgctga tgagctgctc 421 atgctcagcg tgcccttcct agtcacctcc acgttgttgc gccactggcc cttcggtgcg 481 ctgctctgcc gcctcgtgct cagcgtggac gcggtcaaca tgttcaccag catctactgt 541 ctgactgtgc tcagcgtgga ccgctacgtg gccgtggtgc atcccatcaa ggcggcccgc 601 taccgccggc ccaccgtggc caaggtagta aacctgggcg tgtgggtgct atcgctgctc 661 gtcatcctgc ccatcgtggt cttctctcgc accgcggcca acagcgacgg cacggtggct 721 tgcaacatgc tcatgccaga gcccgctcaa cgctggctgg tgggcttcgt gttgtacaca 781 tttctcatgg gcttcctgct gcccgtgggg gctatctgcc tgtgctacgt gctcatcatt 841 gctaagatgc gcatggtggc cctcaaggcc ggctggcagc agcgcaagcg ctcggagcgc 901 aagatcacct taatggtgat gatggtggtg atggtgtttg tcatctgctg gatgcctttc 961 tacgtggtgc agctggttaa cgtgtttgct gagcaggacg acgccacggt gagtcagctg 1021 tcggtcatcc tcggctatgc caacagctgc gccaacccca tcctctatgg ctttctctca 1081 gacaacttca agcgctcttt ccaacgcatc ctatgcctca gctggatgga caacgccgcg 1141 gaggagccgg ttgactatta cgccaccgcg ctcaagagcc gtgcctacag tgtggaagac 1201 ttccaacctg agaacctgga gtccggcggc gtcttccgta atggcacctg cacgtcccgg 1261 atcacgacgc tctgagcccg ggccacgcag gggctctgag cccgggccac gcaggggccc 1321 tgagccaaaa gagggggaga atgagaaggg aaggccgggt gcgaaaggga cggtatccag 1381 ggcgccaggg tgctgtcggg ataacgtggg gctaggacac tgacagcctt tgatggagga 1441 acccaagaaa ggcgcgcgac aatggtagaa gtgagagctt tgcttataaa ctgggaaggc 1501 tttcaggcta cctttttctg ggtctcccac tttctgttcc ttcctccact gcgcttgctc 1561 ctctgaccct ccttctattt tccccaccct gcaacttcta tcctttcttc cgcaccgtcc 1621 cgccagtgca gatc // LOCUS HUMSRI2A 1351 bp DNA PRI 29-DEC-1994 DEFINITION Human somatostatin receptor isoform 2 (SSTR2) gene, complete cds. ACCESSION M81830 NID g307435 KEYWORDS somatostatin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1351) AUTHORS Yamada,Y., Post,S.R., Wang,K., Tager,H.S., Bell,G.I. and Seino,S. TITLE Cloning and functional characterization of a family of human and mouse somatostatin receptors expressed in brain, gastrointestinal tract, and kidney JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (1), 251-255 (1992) MEDLINE 92108031 COMMENT genomic sequence; gene lacks introns. FEATURES Location/Qualifiers source 1..1351 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q24" gene 83..1192 /gene="SSTR2" CDS 83..1192 /gene="SSTR2" /codon_start=1 /db_xref="GDB:G00-134-186" /product="somatostatin receptor isoform 2" /db_xref="PID:g307436" /translation="MDMADEPLNGSHTWLSIPFDLNGSVVSTNTSNQTEPYYDLTSNA VLTFIYFVVCIIGLCGNTLVIYVILRYAKMKTITNIYILNLAIADELFMLGLPFLAMQ VALVHWPFGKAICRVVMTVDGINQFTSIFCLTVMSIDRYLAVVHPIKSAKWRRPRTAK MITMAVWGVSLLVILPIMIYAGLRSNQWGRSSCTINWPGESGAWYTGFIIYTFILGFL VPLTIICLCYLFIIIKVKSSGIRVGSSKRKKSEKKVTRMVSIVVAVFIFCWLPFYIFN VSSVSMAISPTPALKGMFDFVVVLTYANSCANPILYAFLSDNFKKSFQNVLCLVKVSG TDDGERSDSKQDKSRLNETTETQRTLLNGDLQTSI" BASE COUNT 307 a 375 c 333 g 336 t ORIGIN 1 ggatccttgg cctccagggt ccattaaggt gagaataaga tctctgggct ggctggaact 61 agcctaagac tgaaaagcag ccatggacat ggcggatgag ccactcaatg gaagccacac 121 atggctatcc attccatttg acctcaatgg ctctgtggtg tcaaccaaca cctcaaacca 181 gacagagccg tactatgacc tgacaagcaa tgcagtcctc acattcatct attttgtggt 241 ctgcatcatt gggttgtgtg gcaacacact tgtcatttat gtcatcctcc gctatgccaa 301 gatgaagacc atcaccaaca tttacatcct caacctggcc atcgcagatg agctcttcat 361 gctgggtctg cctttcttgg ctatgcaggt ggctctggtc cactggccct ttggcaaggc 421 catttgccgg gtggtcatga ctgtggatgg catcaatcag ttcaccagca tcttctgcct 481 gacagtcatg agcatcgacc gatacctggc tgtggtccac cccatcaagt cggccaagtg 541 gaggagaccc cggacggcca agatgatcac catggctgtg tggggagtct ctctgctggt 601 catcttgccc atcatgatat atgctgggct ccggagcaac cagtggggga gaagcagctg 661 caccatcaac tggccaggtg aatctggggc ttggtacaca gggttcatca tctacacttt 721 cattctgggg ttcctggtac ccctcaccat catctgtctt tgctacctgt tcattatcat 781 caaggtgaag tcctctggaa tccgagtggg ctcctctaag aggaagaagt ctgagaagaa 841 ggtcacccga atggtgtcca tcgtggtggc tgtcttcatc ttctgctggc ttcccttcta 901 catattcaac gtttcttccg tctccatggc catcagcccc accccagccc ttaaaggcat 961 gtttgacttt gtggtggtcc tcacctatgc taacagctgt gccaacccta tcctatatgc 1021 cttcttgtct gacaacttca agaagagctt ccagaatgtc ctctgcttgg tcaaggtgag 1081 cggcacagat gatggggagc ggagtgacag taagcaggac aaatcccggc tgaatgagac 1141 cacggagacc cagaggaccc tcctcaatgg agacctccaa accagtatct gaactgcttg 1201 gggggtggga aagaaccaag ccatgctctg tctactggca atgggctccc tacccacact 1261 ggcttcctgc ctcccacccc tcacacctgg cttctagaat agaggattgc tcagcatgag 1321 tccaattaga gaacggtgtt tgagtcagct t // LOCUS HUMSST28A 1285 bp DNA PRI 16-AUG-1994 DEFINITION Human somatostatin receptor (SST) gene, complete cds. ACCESSION L14865 NID g431094 KEYWORDS somatostatin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1285) AUTHORS Panetta,R., Greenwood,M.T., Warszynska,A., Demchyshyn,L.L., Day,R., Niznik,H.B., Srikant,C.B. and Patel,Y.C. TITLE Molecular cloning, functional characterization, and chromosomal localization of a human somatostatin receptor (somatostatin receptor type 5) with preferential affinity for somatostatin-28 JOURNAL Mol. Pharmacol. 45 (3), 417-427 (1994) MEDLINE 94195267 FEATURES Location/Qualifiers source 1..1285 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /map="3q28" gene 130..1221 /gene="SST" CDS 130..1221 /gene="SST" /codon_start=1 /product="somatostatin receptor" /db_xref="PID:g431095" /translation="MEPLFPASTPSWNASSPGAASGGGDNRTLVGPAPSAGARAVLVP VLYLLVCAAGLGGNTLVIYVVLRFAKMKTVTNIYILNLAVADVLYMLGLPFLATQNAA SFWPFGPVLCRLVMTLDGVNQFTSVFCLTVMSVDRYLAVVHPLSSARWRRPRVAKLAS AAAWVLSLCMSLPLLVFADVQEGGTCNASWPEPVGLWGAVFIIYTAVLGFFAPLLVIC LCYLLIVVKVRAAGVRVGCVRRRSERKVTRMVLVVVLVFAGCWLPFFTVNIVNLAVAL PQEPASAGLYFFVVILSYANSCANPVLYGFLSDNFRQSFQKVLCLRKGSGAKDADATE PRPDRIRQQQEATRPRTAAANGLMQTSKL" BASE COUNT 167 a 436 c 432 g 250 t ORIGIN 1 ttaccggtga tcggctctgg caccgccctg ggccagagaa ggaatgcctg cagtgtctgg 61 ttcaggactc accaccctgg cgtcctccct tcttctcttg cagagcctga cgcaccccag 121 gctgccgcca tggagcccct gttcccagcc tccacgccca gctggaacgc ctcctccccg 181 ggggctgcct ctggaggcgg tgacaacagg acgctggtgg ggccggcgcc ctcggcaggg 241 gcccgggcgg tgctggtgcc cgtgctgtac ctgctggtgt gtgcggccgg gctgggcggg 301 aacacgctgg tcatctacgt ggtgctgcgg ttcgccaaga tgaagaccgt caccaacatc 361 tacattctca acctggcagt ggccgacgtc ctgtacatgc tggggctgcc tttcctggcc 421 acgcagaacg ccgcgtcctt ctggcccttc ggccccgtcc tgtgccgcct ggtcatgacg 481 ctggacggcg tcaaccagtt caccagtgtc ttctgcctga cagtcatgag cgtggaccgc 541 tacctggcag tggtgcaccc gctgagctcg gcccgctggc gccgcccgcg tgtggccaag 601 ctggcgagcg ccgccgcctg ggtcctgtct ctgtgcatgt cgctgccgct cttggtgttc 661 gcggacgtgc aggagggcgg tacctgcaac gccagctggc cggagcccgt ggggctgtgg 721 ggcgccgtct tcatcatcta cacggccgtg ctgggcttct tcgcgccgct gctggtcatc 781 tgcctgtgct acctgctcat cgtggtgaag gtgagggcgg cgggcgtgcg cgtgggctgc 841 gtgcggcggc gctcggagcg gaaggtgacg cgcatggtgt tggtggtggt gctggtgttt 901 gcgggatgtt ggctgccctt cttcaccgtc aacatcgtca acctggcggt tgcgctgccc 961 caggagcccg cctccgccgg cctctacttc ttcgtggtca tcctctccta cgccaacagc 1021 tgtgccaacc ccgtcctcta cggcttcctc tcggacaact tccgccagag cttccagaag 1081 gttctgtgcc tccgcaaggg ctctggtgcc aaggacgctg acgccacgga gccgcgtcca 1141 gacaggatcc ggcagcagca ggaggccacg cgcccgcgca ccgccgcagc caacgggctt 1201 atgcagacca gcaagctgtg agagtgcagg cggggggtgg gcggccccgt gtcaccccca 1261 ggagtcggag gttgcactgc ggtga // LOCUS HUMSSTR3X 1413 bp DNA PRI 13-JAN-1995 DEFINITION Human somatostatin receptor subtype 3 (SSTR3) gene, complete cds. ACCESSION M96738 NID g338498 KEYWORDS somatostatin receptor. SOURCE Homo sapiens (tissue library: Stratagene #946203) male placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1413) AUTHORS Yamada,Y., Reisine,T., Law,S.F., Ihara,Y., Kubota,A., Kagimoto,S., Seino,M., Seino,Y., Bell,G.I. and Seino,S. TITLE Somatostatin receptors, an expanding gene family: cloning and functional characterization of human SSTR3, a protein coupled to adenylyl cyclase JOURNAL Mol. Endocrinol. 6 (12), 2136-2142 (1992) MEDLINE 93149123 FEATURES Location/Qualifiers source 1..1413 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="placenta" /tissue_lib="Stratagene #946203" gene 98..1354 /gene="SSTR3" CDS 98..1354 /gene="SSTR3" /codon_start=1 /product="somatostatin receptor subtype 3" /db_xref="PID:g338499" /translation="MDMLHPSSVSTTSEPENASSAWPPDATLGNVSAGPSPAGLAVSG VLIPLVYLVVCVVGLLGNSLVIYVVLRHTASPSVTNVYILNLALADELFMLGLPFLAA QNALSYWPFGSLMCRLVMAVDGINQFTSIFCLTVMSVDRYLAVVHPTRSARWRTAPVA RTVSAAVWVASAVVVLPVVVFSGVPRGMSTCHMQWPEPAAAWRAGFIIYTAALGFFGP LLVICLCYLLIVVKVRSAGRRVWAPSCQRRRRSERRVTRMVVAVVALFVLCWMPFYVL NIVNVVCPLPEEPAFFGLYFLVVALPYANSCANPILYGFLSYRFKQGFRRVLLRPSRR VRSQEPTVGPPEKTEEEDEEEEDGEESREGGKGKEMNGRVSQITQPGTSGQERPPSRV ASKEQQLLPQEASTGEKSSTMRISYL" BASE COUNT 218 a 464 c 467 g 264 t ORIGIN 1 atgggagggg gcagcacaga gaaagccatt ctctgctgtg accgagctgt ttttccttcc 61 cccaggcaaa tgactgctga ccaccctccc ctcagccatg gacatgcttc atccatcatc 121 ggtgtccacg acctcagaac ctgagaatgc ctcctcggcc tggcccccag atgccaccct 181 gggcaacgtg tcggcgggcc caagcccggc agggctggcc gtcagtggcg ttctgatccc 241 cctggtctac ctggtggtgt gcgtggtggg cctgctgggt aactcgctgg tcatctatgt 301 ggtcctgcgg cacacggcca gcccttcagt caccaacgtc tacatcctca acctggcgct 361 ggccgacgag ctcttcatgc tggggctgcc cttcctggcc gcccagaacg ccctgtccta 421 ctggcccttc ggctccctca tgtgccgcct ggtcatggcg gtggatggca tcaaccagtt 481 caccagcata ttctgcctga ctgtcatgag cgtggaccgc tacctggccg tggtacatcc 541 cacccgctcg gcccgctggc gcacagctcc ggtggcccgc acggtcagcg cggctgtgtg 601 ggtggcctca gccgtggtgg tgctgcccgt ggtggtcttc tcgggagtgc cccgcggcat 661 gagcacctgc cacatgcagt ggcccgagcc ggcggcggcc tggcgagccg gcttcatcat 721 ctacacggcc gcactgggct tcttcgggcc gctgctggtc atctgcctct gctacctgct 781 catcgtggtg aaggtgcgct cagctgggcg ccgggtgtgg gcaccctcgt gccagcggcg 841 ccggcgctcc gaacgcaggg tcacgcgcat ggtggtggcc gtggtggcgc tcttcgtgct 901 ctgctggatg cccttctacg tgctcaacat cgtcaacgtg gtgtgcccac tgcccgagga 961 gcctgccttc tttgggctct acttcctggt ggtggcgctg ccctatgcca acagctgtgc 1021 caaccccatc ctttatggct tcctctccta ccgcttcaag cagggcttcc gcagggtcct 1081 gctgcggccc tcccgccgtg tgcgcagcca ggagcccact gtggggcccc cggagaagac 1141 tgaggaggag gatgaggagg aggaggatgg ggaggagagc agggaggggg gcaaggggaa 1201 ggagatgaac ggccgggtca gccagatcac gcagcctggc accagcgggc aggagcggcc 1261 gcccagcaga gtggccagca aggagcagca gctcctaccc caagaggctt ccactgggga 1321 gaagtccagc acgatgcgca tcagctacct gtaggggcct ggggaaagcc aggatggccc 1381 gaggaagagg cagaagccgt gggtgtgcct agg // LOCUS HUMTAL 362 bp DNA PRI 13-JAN-1995 DEFINITION H.sapiens Tal2 (TAL2) gene, complete cds. ACCESSION M81078 NID g292707 KEYWORDS T-cell acute lymphoblastic leukemia; helix-loop-helix DNA binding protein. SOURCE Homo sapiens (tissue library: RPMI-8402) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 362) AUTHORS Xia,Y., Brown,L., Yang,C.Y., Tsan,J.T., Siciliano,M.J., Espinosa R,I.I.I., Le Beau,M.M. and Baer,R.J. TITLE TAL2, a helix-loop-helix gene activated by the (7;9)(q34;q32) translocation in human T-cell leukemia JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (24), 11416-11420 (1991) MEDLINE 92107961 FEATURES Location/Qualifiers source 1..362 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_lib="RPMI-8402" /map="Unassigned" gene 36..362 /gene="TAL2" CDS 36..362 /gene="TAL2" /codon_start=1 /db_xref="GDB:G00-128-434" /product="Tal2" /db_xref="PID:g292708" /translation="MTRKIFTNTRERWRQQNVNSAFAKLRKLIPTHPPDKKLSKNETL RLAMRYINFLVKVLGEQSLQQTGVAAQGNILGLFPQGPHLPGLEDRTLLENYQVPSPG PSHHIP" BASE COUNT 98 a 105 c 88 g 71 t ORIGIN chromosome 9 band q31-32. 1 agggcccttt ctctttccat ctcaggaact caaacatgac caggaagatc ttcacaaata 61 ccagggagcg gtggaggcag cagaatgtca acagcgcctt tgccaagctg aggaagctca 121 tccccactca ccctccagac aaaaagctga gcaaaaatga aacgcttcgc ctggcaatga 181 ggtatatcaa cttcttggtc aaggtcttgg gggagcaaag cctgcaacaa acgggagtgg 241 ctgctcaggg gaacattctg gggctcttcc ctcaaggacc ccacctgcca ggcctggagg 301 acagaactct gcttgagaac taccaggttc cttcacctgg tccaagccac cacattcctt 361 ag // LOCUS HUMTEF1 4443 bp DNA PRI 23-MAY-1996 DEFINITION Homo sapiens transcriptional enhancer factor (TEF1) DNA, complete CDS. ACCESSION M63896 NID g339440 KEYWORDS trans-acting transcriptional activator; transcription enhancer. SOURCE Homo sapiens (tissue library: ZAP-II random primed cDNA) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4443) AUTHORS Xiao,J.H., Davidson,I., Matthes,H., Garnier,J.M. and Chambon,P. TITLE Cloning, expression, and transcriptional properties of the human enhancer factor TEF-1 JOURNAL Cell 65 (4), 551-568 (1991) MEDLINE 91235292 FEATURES Location/Qualifiers source 1..4443 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="HeLa" /tissue_lib="ZAP-II random primed cDNA" 5'UTR 541..585 /gene="TEF-1" gene 541..3528 /gene="TEF-1" CDS 586..1821 /gene="TEF-1" /codon_start=1 /evidence=experimental /product="transcription enhancer factor" /db_xref="PID:g339441" /translation="MERMSDSADKPIDNDAEGVWSPDIEQSFQEALAIYPPCGRRKII LSDEGKMYGRNELIARYIKLRTGKTRTRKQVSSHIQVLARRKSRDFHSKLKDQTAKDK ALQHMAAMSSAQIVSATAIHNKLGLPGIPRPTFPGAPGFWPGMIQTGQPGSSQDVKPF VQQAYPIQPAVTAPIPGFEPASAPAPSVPAWQGRSIGTTKLRLVEFSAFLEQQRDPDS YNKHLFVHIGHANHSYSDPLLESVDIRQIYDKFPEKKGGLKELFGKGPQNAFFLVKFW ADLNCNIQDDAGAFYGVTSQYESSENMTVTCSTKVCSFGKQVVEKVETEYARFENGRF VYRINRSPMCEYMINFIHKLKHLPEKYMMNSVLENFTILLVVTNRDTQETLLCMACVF EVSNSEHGAQHHIYRLVKD" polyA_signal 2262..2268 /gene="TEF-1" /note="potential; putative" polyA_signal 2514..2519 /gene="TEF-1" /note="potential; putative" polyA_signal 2581..2586 /gene="TEF-1" /note="potential; putative" polyA_signal 2759..2764 /gene="TEF-1" /note="potential; putative" polyA_signal 2884..2890 /gene="TEF-1" /note="potential; putative" polyA_signal 3523..3528 /gene="TEF-1" /note="potential; putative" BASE COUNT 1226 a 1040 c 969 g 1208 t ORIGIN 1 cgccgcccgc cgcgggcgcc caccaagcac tttgcagact cgcttccacc ctgcgggcca 61 ttccgcgcgg cggggcccgg gcccggggcg gccgcgtcca ggcacaggcc atgcagtgac 121 gcccccccac ccctccacct ttgcccggac ggcgggcagc agcccagcgc gccagccggc 181 cccggggcag gagcggtgct aggcaggggt ggggtggccg ggcccaggga ccgggagccg 241 gggagggagc cgggcaccga gcagagggcg ggggaagcgg cgccgaagtt tgcctcggac 301 tcgccgggcg ctgcggtggc tccctgggcc gaggactgtt gctgccgctg ccgccgccgc 361 ttcattgcac attcaagtgg aaaattttca ggagtcagca gaaacattgt gtccaaaaaa 421 gactgagtcg cagttaccac caaacccagg aggagactct ccctggaaaa cttcccttcc 481 ctttcggttt attttcttga aaaggctcca ggcttcggct tggaaaatcc caccgccaaa 541 attgagccca gcagctggag cggcagtgag agccctgccg aaaacatgga aaggatgagt 601 gactctgcag ataagccaat tgacaatgat gcagaagggg tctggagccc cgacatcgag 661 caaagctttc aggaggccct ggctatctat ccaccatgtg ggaggaggaa aatcatctta 721 tcagacgaag gcaaaatgta tggtaggaat gaattgatag ccagatacat caaactcagg 781 acaggcaaga cgaggaccag aaaacaggtg tctagtcaca ttcaggttct tgccagaagg 841 aaatctcgtg attttcattc caagctaaag gatcagactg caaaggataa ggccctgcag 901 cacatggcgg ccatgtcctc agcccagatc gtctcggcca ctgccattca taacaagctg 961 gggctgcctg ggattccacg cccgaccttc ccaggggcgc cggggttctg gccgggaatg 1021 attcaaacag ggcagccagg atcctcacaa gacgtcaagc cttttgtgca gcaggcctac 1081 cccatccagc cagcggtcac agcccccatt ccagggtttg agcctgcatc ggccccagct 1141 ccctcagtcc ctgcctggca aggtcgctcc attggcacaa ccaagcttcg cctggtggaa 1201 ttttcagctt ttctcgagca gcagcgagac ccagactcgt acaacaaaca cctcttcgtg 1261 cacattgggc atgccaacca ttcttacagt gacccattgc ttgaatcagt ggacattcgt 1321 cagatttatg acaaatttcc tgaaaagaaa ggtggcttaa aggaactgtt tggaaagggc 1381 cctcaaaatg ccttcttcct cgtaaaattc tgggctgatt taaactgcaa tattcaagat 1441 gatgctgggg ctttttatgg tgtaaccagt cagtacgaga gttctgaaaa tatgacagtc 1501 acctgttcca ccaaagtttg ctcctttggg aagcaagtag tagaaaaagt agagacggag 1561 tatgcaaggt ttgagaatgg ccgatttgta taccgaataa accgctcccc aatgtgtgaa 1621 tatatgatca acttcatcca caagctcaaa cacttaccag agaaatatat gatgaacagt 1681 gttttggaaa acttcacaat tttattggtg gtaacaaaca gggatacaca agaaactcta 1741 ctctgcatgg cctgtgtgtt tgaagtttca aatagtgaac acggagcaca acatcatatt 1801 tacaggcttg taaaggactg aacatggtta tttatatata tagatatctg tatatacaca 1861 cacacatatg tgcacacaca cactctctct ccattatcga acgactgact gtaaacctca 1921 ccacacaggg tggtgccctg gccccgaggt caccccgact tttctaaatc ttgtttgagt 1981 gaagtcattt tttcatgtgt tcatactatc attgtagctg tgaagttctg gtacagttgt 2041 aaaaagagaa attgagttgt ttctctatgt tcttcagatg tgcagcccac aattcctcgg 2101 gaaaggtgaa cctgaacaac ccaagtctct ctctgcagag ccctgtttct aattgtggta 2161 gaaaatattg agacagagca tttgccatgg gacatttaca gcctttatac aaatgtattt 2221 agttctcttt tttccaacat aaaattcttg ttttaagata caagtaaaat taatctttaa 2281 atataaatgt aaattagtac acaaaactaa gaatctttag acttatcttt gtaactaatt 2341 agggtggaag ttatgaaaga atgtaattca ctaaattatt ttttaaatga aacctttttt 2401 tttctttttg aaaccaaatg ttaaactata gccttaagaa atgcttggta gaagtgtcct 2461 aatgagacaa atttgtactt ttatcctcaa ggttaacact aatctcctaa tccattaaac 2521 tcttgaacag gtattacaaa ggaagaaaac ttcacccctt atccttaaca tatatagtat 2581 atttaaaaaa tataaaattg tattgtacta atgtgatgat ggattattta atgaaaaaga 2641 aaaaatggct ctttttgcaa taagtagata catactgaaa aaatctaaac ttacaatgtt 2701 tatagtcttg tgtgtgcagt tatattttat atggacgacc aaatttttta ttaagatgag 2761 taaatatttg aaccactgaa ttttaataac aaaattttaa aattggcatg aatacggaat 2821 actgcactgt gagatgcaaa gtatacagaa tctgtggctg ggagaaaatt tcatcaaata 2881 gacaagtaaa aggctcatca gttttagcat ctctgctccc cagaaaattg taagcatcct 2941 caccagcctg tggatacatt ctttatttct agtgacccaa tatgcatatt aacctgctat 3001 aactagggct atatgtgtag gtatgtgtat acatatacac aaatgcacat atagagttaa 3061 cacatttagt gaacacttgt ttagtgtcac tcagtttgct aggtgctgat atgtacgtat 3121 atctcaatgt gtctgtagac ttagatacat cctcttgaag cacatccatt tctttagcgt 3181 ctctcagtaa gttacagtac ttgtttgact taggtttaag aggcccagct acctatctct 3241 gaccttttca aataggctca tttgggagat tcttttgcca ggagagattc aactttccaa 3301 tctaagtatt ccagagcatt gcccaggcag agttggtttg atgtggccag atgttttgag 3361 ttatttccct taagtgtttc actggggaga gaacagggag tgctcctcca gcttcccaaa 3421 gaaatatgtt tttgtaagtg gtaggaacat gtgcacacaa tagaacatga aataagtttt 3481 ttaacttgta aaacatgtca agatttttcc accaagctag aaaataaaaa acttagttct 3541 accacatcca attaacttac acaccccctt ccctgtctca acacctgctt tgaccctgct 3601 tttctattat tacatcagtc agcatcttgt ggtccctaac atgaggatgt ggctggctcg 3661 tgggaaacag caaaacacta agcctgacct ctcccaaatt gggaagacca gaggagaaag 3721 tgcaaaactg tccccatttg gaatgcccat tccttctaga aaccagttgg acagtgctcc 3781 tctgcccttc ataaacagac tactgttggg tccctgattc caggctggcc tgtgaaggat 3841 tgccccaggt gtcccctttc acggttgtca catttacagt gacttctgtt gaacacccct 3901 cttagggatg tttcttttgc tcttatttcc tgcatctttc caattgggaa gccccatcct 3961 ctcccaggac caggagttta tgaccaggcg agcacaaatg gctaaaagca agctgtccta 4021 gaacttcagt gggagagctg tctggttcat attctaccca ggaatggtac ttttcagtgc 4081 agccaggagg gctcttggga tttcctttcc aaagcacaaa aatactggga cccaagaaga 4141 acagctagag gacaactctg ttggcacaga gacggggaca gcccagtctg ctgacctcac 4201 agggtcagtg ggcccccctg gtgcttcacc acctgcatcc tcttgctcag aatgcctttg 4261 cagttgagtt ttctgggttt ctatgattga ccttgaggtt tactccttgc tcttacaaca 4321 tttctaagga tttttaaaag tttacttctt gtcttgttct tctaaagctt tctccaggac 4381 agatattttc cctgtcttaa ccactggtcc agtcatccca gtgggcttct ctttgtctct 4441 ccc // LOCUS HUMTHMA 4212 bp DNA PRI 14-JAN-1995 DEFINITION Human thrombomodulin gene, complete cds. ACCESSION J02973 NID g339658 KEYWORDS thrombomodulin. SOURCE Human, cDNA to mRNA, clones lambda-gt11-TM[1-3], and DNA, (library of E.Frisch),. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4212) AUTHORS Jackman,R.W., Beeler,D.L., Fritze,L., Soff,G. and Rosenberg,R.D. TITLE Human thrombomodulin gene is intron depleted: nucleic acid sequences of the cDNA and gene predict protein structure and suggest sites of regulatory control JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (18), 6425-6429 (1987) MEDLINE 87317665 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by R.W.Jackman, 11/23/87. FEATURES Location/Qualifiers source 1..4212 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20p12-cen" mRNA <1..4050 /note="TM mRNA" sig_peptide 542..589 /gene="THBD" /note="thrombomodulin signal peptide" gene 542..2269 /gene="THBD" CDS 542..2269 /gene="THBD" /note="thrombomodulin precursor" /codon_start=1 /db_xref="GDB:G00-119-613" /db_xref="PID:g339659" /translation="MLGVLVLGALALAGLGFPAPAEPQPGGSQCVEHDCFALYPGPAT FLNASQICDGLRGHLMTVRSSVAADVISLLLNGDGGVGRRRLWIGLQLPPGCGDPKRL GPLRGFQWVTGDNNTSYSRWARLDLNGAPLCGPLCVAVSAAEATVPSEPIWEEQQCEV KADGFLCEFHFPATCRPLAVEPGAAAAAVSITYGTPFAARGADFQALPVGSSAAVAPL GLQLMCTAPPGAVQGHWAREAPGAWDCSVENGGCEHACNAIPGAPRCQCPAGAALQAD GRSCTASATQSCNDLCEHFCVPNPDQPGSYSCMCETGYRLAADQHRCEDVDDCILEPS PCPQRCVNTQGGFECHCYPNYDLVDGECVEPVDPCFRANCEYQCQPLNQTSYLCVCAE GFAPIPHEPHRCQMFCNQTACPADCDPNTQASCECPEGYILDDGFICTDIDECENGGF CSGVCHNLPGTFECICGPDSALARHIGTDCDSGKVDGGDSGSGEPPPSPTPGSTLTPP AVGLVHSGLLIGISIASLCLVVALLALLCHLRKKQGAARAKMEYKCAAPSKEVVLQHV RTERTPQRL" mat_peptide 590..2266 /gene="THBD" /note="thrombomodulin" BASE COUNT 868 a 1233 c 1141 g 970 t ORIGIN 3310 bp upstream of HindIII site. 1 cttgcaatcc aggctttcct tggaagtggc tgtaacatgt atgaaaagaa agaaaggagg 61 accaagagat gaaagagggc tgcacgcgtg ggggcccgag tggtgggcgg ggacagtcgt 121 cttgttacag gggtgctggc cttccctggc gcctgcccct gtcggccccg cccgagaacc 181 tccctgcgcc agggcagggt ttactcatcc cggcgaggtg atcccatgcg cgagggcggg 241 cgcaagggcg gccagagaac ccagcaatcc gagtatgcgg catcagccct tcccaccagg 301 cacttccttc cttttcccga acgtccaggg agggagggcc gggcacttat aaactcgagc 361 cctggccgat ccgcatgtca gaggctgcct cgcaggggct gcgcgcacgg caagaagtgt 421 ctgggctggg acggacagga gaggctgtcg ccatcggcgt cctgtgcccc tctgctccgg 481 cacggccctg tcgcagtgcc cgcgctttcc ccggcgcctg cacgcggcgc gcctgggtaa 541 catgcttggg gtcctggtcc ttggcgcgct ggccctggcc ggcctggggt tccccgcacc 601 cgcagagccg cagccgggtg gcagccagtg cgtcgagcac gactgcttcg cgctctaccc 661 gggccccgcg accttcctca atgccagtca gatctgcgac ggactgcggg gccacctaat 721 gacagtgcgc tcctcggtgg ctgccgatgt catttccttg ctactgaacg gcgacggcgg 781 cgttggccgc cggcgcctct ggatcggcct gcagctgcca cccggctgcg gcgaccccaa 841 gcgcctcggg cccctgcgcg gcttccagtg ggttacggga gacaacaaca ccagctatag 901 caggtgggca cggctcgacc tcaatggggc tcccctctgc ggcccgttgt gcgtcgctgt 961 ctccgctgct gaggccactg tgcccagcga gccgatctgg gaggagcagc agtgcgaagt 1021 gaaggccgat ggcttcctct gcgagttcca cttcccagcc acctgcaggc cactggctgt 1081 ggagcccggc gccgcggctg ccgccgtctc gatcacctac ggcaccccgt tcgcggcccg 1141 cggagcggac ttccaggcgc tgccggtggg cagctccgcc gcggtggctc ccctcggctt 1201 acagctaatg tgcaccgcgc cgcccggagc ggtccagggg cactgggcca gggaggcgcc 1261 gggcgcttgg gactgcagcg tggagaacgg cggctgcgag cacgcgtgca atgcgatccc 1321 tggggctccc cgctgccagt gcccagccgg cgccgccctg caggcagacg ggcgctcctg 1381 caccgcatcc gcgacgcagt cctgcaacga cctctgcgag cacttctgcg ttcccaaccc 1441 cgaccagccg ggctcctact cgtgcatgtg cgagaccggc taccggctgg cggccgacca 1501 acaccggtgc gaggacgtgg atgactgcat actggagccc agtccgtgtc cgcagcgctg 1561 tgtcaacaca cagggtggct tcgagtgcca ctgctaccct aactacgacc tggtggacgg 1621 cgagtgtgtg gagcccgtgg acccgtgctt cagagccaac tgcgagtacc agtgccagcc 1681 cctgaaccaa actagctacc tctgcgtctg cgccgagggc ttcgcgccca ttccccacga 1741 gccgcacagg tgccagatgt tttgcaacca gactgcctgt ccagccgact gcgaccccaa 1801 cacccaggct agctgtgagt gccctgaagg ctacatcctg gacgacggtt tcatctgcac 1861 ggacatcgac gagtgcgaaa acggcggctt ctgctccggg gtgtgccaca acctccccgg 1921 taccttcgag tgcatctgcg ggcccgactc ggcccttgcc cgccacattg gcaccgactg 1981 tgactccggc aaggtggacg gtggcgacag cggctctggc gagcccccgc ccagcccgac 2041 gcccggctcc accttgactc ctccggccgt ggggctcgtg cattcgggct tgctcatagg 2101 catctccatc gcgagcctgt gcctggtggt ggcgcttttg gcgctcctct gccacctgcg 2161 caagaagcag ggcgccgcca gggccaagat ggagtacaag tgcgcggccc cttccaagga 2221 ggtagtgctg cagcacgtgc ggaccgagcg gacgccgcag agactctgag cggcctccgt 2281 ccaggagcct ggctccgtcc aggagctgtg cctcctcacc cccagctttg ctaccaaagc 2341 accttagctg gcattacagc tggagaagac cctccccgca ccccccaagc tgttttcttc 2401 tattccatgg ctaactggcg agggggtgat tagagggagg agaatgagcc tcggcctctt 2461 ccgtgacgtc actggaccac tgggcaatga tggcaatttt gtaacgaaga cacagactgc 2521 gatttgtccc aggtcctcac taccgggcgc aggagggtga gcgttattgg tcggcagcct 2581 tctgggcaga ccttgacctc gtgggctagg gatgactaaa atatttattt tttttaagta 2641 tttaggtttt tgtttgtttc ctttgttctt acctgtatgt ctccagtatc cactttgcac 2701 agctctccgg tctctctctc tctacaaact cccacttgtc atgtgacagg taaactatct 2761 tggtgaattt ttttttccta gccctctcac atttatgaag caagccccac ttattcccca 2821 ttcttcctag ttttctcctc ccaggaactg ggccaactca cctgagtcac cctacctgtg 2881 cctgacccta cttcttttgc tcatctagct gtctgctcag acagaacccc tacatgaaac 2941 agaaacaaaa acactaaaaa taaaaatggc catttgcttt ttcaccagat ttgctaattt 3001 atcctgaaat ttcagattcc cagagcaaaa taattttaaa caaagggttg agatgtaaaa 3061 ggtattaaat tgatgttgct ggactgtcat agaaattaca cccaaagagg tatttatctt 3121 tacttttaaa cagtgagcct gaattttgtt gctgttttga tttgtactga aaaatggtaa 3181 ttgttgctaa tcttcttatg caatttcctt ttttgttatt attacttatt tttgacagtg 3241 ttgaaaatgt tcagaaggtt gctctagatt gagagaagag acaaacacct cccaggagac 3301 agttcaagaa agcttcaaac tgcatgattc atgccaatta gcaattgact gtcactgttc 3361 cttgtcactg gtagaccaaa ataaaaccag ctctactggt cttgtggaat tgggagcttg 3421 ggaatggatc ctggaggatg cccaattagg gcctagcctt aatcaggtcc tcagagaatt 3481 tctaccattt cagagaggcc ttttggaatg tggcccctga acaagaattg gaagctgccc 3541 tgcccatggg agctggttag aaatgcagaa tcctaggctc caccccatcc agttcatgag 3601 aatctatatt taacaagatc tgcagggggt gtgtctgctc agtaatttga ggacaaccat 3661 tccagactgc ttccaatttt ctggaataca tgaaatatag atcagttata agtagcaggc 3721 caagtcaggc ccttattttc aagaaactga ggaattttct ttgtgtagct ttgctctttg 3781 gtagaaaagg ctaggtacac agctctagac actgccacac agggtctgca aggtctttgg 3841 ttcagctaag ctaggaatga aatcctgctt cagtgtatgg aaataaatgt atcatagaaa 3901 tgtaactttt gtaagacaaa ggttttcctc ttctattttg taaactcaaa atatttgtac 3961 atagttattt atttattgga gataatctag aacacaggca aaatccttgc ttatgacatc 4021 acttgtacaa aataaacaaa taacaatgtg ctctcggtgt gtgtctgttc acttttcctc 4081 cctcagtgcc ctgcatttat gtcattaaat gcgggctcac aaaccatgca aatgctatga 4141 gatgcatgga gggctgccct gtaccccagc acttctattg tctggtgatg gcaccatctc 4201 tgatctttca aa // LOCUS HUMTRIP9G 1940 bp DNA PRI 15-MAR-1995 DEFINITION Homo sapiens thyroid receptor interactor (TRIP9) gene, complete cds. ACCESSION L40407 NID g703117 KEYWORDS TRIP9 gene; thyroid receptor interactor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1940) AUTHORS Lee,J.W., Choi,H.S., Gyuris,J., Brent,R. and Moore,D.D. TITLE Two classes of proteins dependent on either the presence or absence of thyroid hormone for interaction with the thyroid hormone receptor JOURNAL Mol. Endocrinol. 9 (2), 243-254 (1995) MEDLINE 95295737 COMMENT Trip9 was isolated as interacting with the thyroid hormone receptor in the yeast 2-hybrid system. Interaction is dependent on the presence of hormone. Submitted sequence is full length or nearly full length at the 5' end and includes the apparently complete protein coding region. The fusion junction in the original yeast 2-hybrid isolate is at position 569. Trip9 includes 6 copies of the motif commonly referred to as the ankyrin repeat. Two mRNAs of approximate 1.8 and 2.8 kb are detected by Northern blotting. FEATURES Location/Qualifiers source 1..1940 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 53..1069 /gene="TRIP9" CDS 53..1069 /gene="TRIP9" /codon_start=1 /product="thyroid receptor interactor" /db_xref="PID:g703118" /translation="MAGVACLGKAADADEWCDTGLGSLGPDAAAPGGPGLGAELGPGL SWAPLVFGYVTEDGDTALHLAVIHQHEPFLDFLLGFSAGTEYMDLQNDLGQTALHLAA ILGETSTVEKLYAAGAGLCVAERRGHTALHLACRVGAHACARALLQPRPRRPREAPDT YLAQGPDRTPDTNHTPVALYPDSDLEKEEEESEEDWKLQLEAENYEGHTPLHVAVIHK DVEMVRLLRDAGADLDKPEPTCGRSPLHLAVEAQAADVLELLLRAGANPAARMYGGRT PLGSAMLRPNPILARLLRAHGAPEPEGEDEKSGPCSSSSDSDGGDEGVSQEERQGSPA GGSG" BASE COUNT 420 a 526 c 678 g 316 t ORIGIN 1 aaagccagct acaggcgggc gactgcgggg ggcccctgag gcggcggggg ccatggctgg 61 ggtcgcgtgc ttgggaaaag ctgccgacgc agatgaatgg tgcgacacgg gcctgggctc 121 cctgggtccg gacgcagcgg cccccggagg acctgggttg ggcgcggagt tgggcccggg 181 gctgtcgtgg gctcccctcg tcttcggcta cgtcactgag gatggggaca cggcactgca 241 cttggctgtg attcatcagc atgaaccctt cctggatttt cttctaggct tctcggccgg 301 cactgagtac atggacctgc agaatgacct aggccagaca gccctgcacc tggcagccat 361 cctgggggag acatccacgg tggagaagct gtacgcagca ggcgccgggc tgtgtgtggc 421 ggagcgtagg ggccacacgg cgctgcacct ggcctgccgt gtgggggcac acgcctgtgc 481 ccgtgccctg cttcagcccc gcccccggcg ccccagggaa gcccccgaca cctacctcgc 541 tcagggccct gaccgtactc ccgacaccaa ccatacccct gtcgccttgt accccgattc 601 cgacttggag aaggaagaag aggagagtga ggaggactgg aagctgcagc tggaggctga 661 aaactacgag ggccacaccc cactccacgt ggccgttatc cacaaagatg tggagatggt 721 ccggctgctc cgagatgctg gagctgacct tgacaaaccg gagcccacgt gcggccggag 781 cccccttcat ttggcagtgg aggcccaggc agccgatgtg ctggagcttc tcctgagggc 841 aggcgcgaac cctgctgccc gcatgtacgg tggccgcacc ccactcggca gtgccatgct 901 ccggcccaac cccatcctcg cccgcctcct ccgtgcacac ggagcccctg agcccgaggg 961 cgaggatgag aaatccggcc cctgcagcag cagtagcgac agcgacggcg gagacgaggg 1021 cgtgagtcag gaggagagac agggcagccc agctgggggg tcaggataga ccggcagcaa 1081 gaagcccaag aagataatta ggcaccgacc ttgggctgct gttagagaac tcaggcggca 1141 cgccagtgac acggggcact agtcaggaga gacctggaca ggggtggtgg gaagagcttg 1201 ggcagaagtg gctgaaaaac taaggcagtg gcaaaggtag aactcaggca ggggtggaga 1261 aaagcgttgg tcgcagtgat tggtgaacac agcgggggtg ggtggtagcg ctgggggtga 1321 ttttaggcag caagaattgg agaactcaca ctgcgaaaag aaaaccttgg gtggcagtga 1381 tttgaacacc ggcagtgctg gggcaggacc cgagccagcg gtggggagag atatagtcag 1441 agaacccagc aatacagatc cgtccttggg caaggcgcgg tgctggatga tgggtgcgga 1501 ggaatttggg taaaggcaga gggaaggggt ggaggagggc cagctcagtt gccgaaactc 1561 tggagtggcg gctggctaga aattggtctg tagaaatgac cttgaaaatg gagttctggc 1621 caggtgcggt ggctcacgcc tgtaatccca gcactttggg aggccgaggc aggcagatca 1681 cgaggtcagg agttcgagac cagcctggcc aacatggcaa gaccctgtct ctactaaaaa 1741 tacaaaaatt agctgggcgt ggtggcgcat gcctataatc ccagctactt gggaggctga 1801 ggtaggagaa ttgcttgaac ctgggaggtg gaggttgcag tgaacctaga tcacgccact 1861 gcattccagc ctgggcaaca gagcgcatga tcagtcaacc gctcgaggga tcttccatag 1921 gatggtcaag acgcggacgt // LOCUS HUMUDPCNA 4705 bp DNA PRI 19-SEP-1995 DEFINITION Human alpha-1,3-mannosyl-glycoprotein beta-1, 2-N-acetylglucosaminyltransferase (MGAT) gene, complete cds. ACCESSION M61829 NID g340075 KEYWORDS alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltrae. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4705) AUTHORS Hull,E., Sarkar,M., Spruijt,M.P., Hoppener,J.W., Dunn,R. and Schachter,H. TITLE Organization and localization to chromosome 5 of the human UDP-N-acetylglucosamine:alpha-3-D-mannoside beta-1,2-N-acetylglucosaminyltransferase I gene JOURNAL Biochem. Biophys. Res. Commun. 176 (2), 608-615 (1991) MEDLINE 91222222 COMMENT From EMBL entry HSUDPCNA; dated 23-JUL-1991. FEATURES Location/Qualifiers source 1..4705 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5" mRNA 2056..4592 /partial /gene="MGAT" /note="G00-128-225" /product="alpha-1,3-mannosyl-glycoprotein beta-1, 2-N-acetylglucosaminyltransferase" exon 2056..4592 /gene="MGAT" /EC_number="2.4.1.101" /note="G00-128-225" /number=2 /product="alpha-1,3-mannosyl-glycoprotein beta-1, 2-N-acetylglucosaminyltransferase" gene 2056..4592 /gene="MGAT" CDS 2182..3519 /gene="MGAT" /EC_number="2.4.1.101" /codon_start=1 /db_xref="GDB:G00-128-225" /product="alpha-1,3-mannosyl-glycoprotein beta-1, 2-N-acetylglucosaminyltransferase" /db_xref="PID:g340076" /translation="MLKKQSAGLVLWGAILFVAWNALLLLFFWTRPAPGRPPSVSALD GDPASLTREVIRLAQDAEVELERQRGLLQQIGDALSSQRGRVPTAAPPAQPRVPVTPA PAVIPILVIACDRSTVRRCLDKLLHYRPSAELFPIIVSQDCGHEETAQAIASYGSAVT HIRQPDLSSIAVPPDHRKFQGYYKIARHYRWALGQVFRQFRFPAAVVVEDDLEVAPDF FEYFRATYPLLKADPSLWCVSAWNDNGKEQMVDASRPELLYRTDFFPGLGWLLLAELW AELEPKWPKAFWDDWMRRPEQRQGRACIRPEISRTMTFGRKGVSHGQFFDQHLKFIKL NQQFVHFTQLDLSYLQREAYDRDFLARVYGAPQLQVEKVRTNDRKELGEVRVQYTGRD SFKAFAKALGVMDDLKSGVPRAGYRGIVTFQFRGRRVHLAPPPTWEGYDPSWN" BASE COUNT 958 a 1217 c 1234 g 1296 t ORIGIN 1 tgatttctgt aatatagatt agtttctatt ttcggcagtt ttatgtcaat ggaatcacag 61 tgtatgtgct cttttttgtt tttcccaaaa agacccatat tgtgtaatta tttggagatt 121 gtgttgtgta tgtcagtggt ttattcagtt ttattgctga atattattcc attctatgag 181 taaaccacaa tttgtttatc tgtatttccc tgttgatggg cctttgggtt atttccagtt 241 ttgagtgatt atgaataaag ttaccatcaa cactcctgtg taggtttttg tagggacatg 301 gctgaaagta gaaagaaatg gggcatgcag gagagactgg gcttggttat aacagcctgt 361 tatccctggg gaactagctc actcaccaga ggacgttagg aaggatctgc ccctgccccc 421 cttgtgatcc aaacacctac cactagaccc cacctccggc agcaccacca cacaggggat 481 caaatcccaa catgtgtttc agagaacaaa ccatattcaa gctatagcat agctacgtaa 541 ttattataac actattaata agaccatctt tgccttggga attttaagta gcacctttat 601 tacatattag atttttttgt gtaatgagtc tgtttctgga gcttaaattc tatttcactg 661 gatggtctgt ttatttatgt accagtacca cgcagttgta attatggagg ttttgtagta 721 tgccatatat atatatatat aataaaaaaa aatttttttg agacagagtc tcactctctt 781 gcccaggctg gagtgcaatg gcacaatcca atcacagctc actacagcct cagcctccca 841 gactcaggtg atcccatctc agcctcctga gtagctggga ccacaggtgc gtgccaccat 901 gcccagcttt tttttttttt ttttttaatt gttacttgta gagacagggt ctccgtgtgc 961 tgcccaggct attcttgaac tcctgggctc aagtgatcct cctgtattgg cctcccaaag 1021 tgctgggatt gcaggtagga gctactgcat ctggccagta tgcagtattt tcatatctgg 1081 tagggctgag tccctcctca cagcttattt ttattgtttt cctagttttt catgtgaact 1141 ttactgtcag ttcgtctagc tccataaaaa ggtattttta ttgggatttt tgatgatgct 1201 gaaatgtcct gaccaagaac aagggatatt cagatgttct tttgtgtctt tcaggaatga 1261 tagttttgtt cacataagtt ttgaatgttt aagtttattt aagtttattt ctaaatattt 1321 tctcatttct ctggcttttg taagtagggt tttctcatcc atgttttctt ctcatgagtt 1381 atttgtggat atgaaggcta tccattagta tatgttgatt tttatattac acttccttgc 1441 tcagttcatt attgattctt tttgagtttt ccaggcatat tctcacaagt aaagataata 1501 gaaatagttt gcttcctttc cacttctgct ttgaattttt ttttcttggt tcatttgcat 1561 tggctgcttc ctccagcaaa atgttaaata accctggaga tgatgggcaa cttcgttttg 1621 ctcctgacat tcgtggggtg cctctggtgc ttccctgttg gtaaggggtt aactgtagcc 1681 ctgaggtggg acatttgatt ttaaaaatca gtcatcttgg ggcgcttagg ttagaggaat 1741 ggtaggcaga tgctgtcact ccttgcccct cccctcctcc ttcccacctg gaggggaaat 1801 gaaatctgac aggtagaaag aggggagttg gggttctttt tctctctccc tccaccagca 1861 tcactctctg cctctccctc aaaaatacgt tcctgggtca ggatatatgt tgactcccta 1921 gagagctctg gagtcaacct cctggccttc ctccaccctc actcttggcc ttttcctgcc 1981 cccatttcct ctacctgtgg ggcatggagc cacgagcctt tgtgtgacgg tttgctttct 2041 ctctcctgtc tttaggtgca tggctgcctc ctaatcccat agtccagagg aggcatccct 2101 aggactgcgg gcaagggagc cgggcaagcc cagggcagcc ttgaaccgtc ccctggcctg 2161 ccctccccgg tgggggccag gatgctgaag aagcagtctg cagggcttgt gctgtggggc 2221 gctatcctct ttgtggcctg gaatgccctg ctgctcctct tcttctggac gcgcccagca 2281 cctggcaggc caccctcagt cagcgctctc gatggcgacc ccgccagcct cacccgggaa 2341 gtgattcgcc tggcccaaga cgccgaggtg gagctggagc ggcagcgtgg gctgctgcag 2401 cagatcgggg atgccctgtc gagccagcgg gggagggtgc ccaccgcggc ccctcccgcc 2461 cagccgcgtg tgcctgtgac ccccgcgccg gcggtgattc ccatcctggt catcgcctgt 2521 gaccgcagca ctgttcggcg ctgcctggac aagctgctgc attatcggcc ctcggctgag 2581 ctcttcccca tcatcgttag ccaggactgc gggcacgagg agacggccca ggccatcgcc 2641 tcctacggca gcgcggtcac gcacatccgg cagcccgacc tgagcagcat tgcggtgccg 2701 ccggaccacc gcaagttcca gggctactac aagatcgcgc gccactaccg ctgggcgctg 2761 ggccaggtct tccggcagtt tcgcttcccc gcggccgtgg tggtggagga tgacctggag 2821 gtggccccgg acttcttcga gtactttcgg gccacctatc cgctgctgaa ggccgacccc 2881 tccctgtggt gcgtctcggc ctggaatgac aacggcaagg agcagatggt ggacgccagc 2941 aggcctgagc tgctctaccg caccgacttt ttccctggcc tgggctggct gctgttggcc 3001 gagctctggg ctgagctgga gcccaagtgg ccaaaggcct tctgggacga ctggatgcgg 3061 cggccggagc agcggcaggg gcgggcctgc atacgccctg agatctcaag aacgatgacc 3121 tttggccgca agggtgtgag ccacgggcag ttctttgacc agcacctcaa gtttatcaag 3181 ctgaaccagc agtttgtgca cttcacccag ctggacctgt cttacctgca gcgggaggcc 3241 tatgaccgag atttcctcgc ccgcgtctac ggtgctcccc agctgcaggt ggagaaagtg 3301 aggaccaatg accggaagga gctgggggag gtgcgggtgc agtatacggg cagggacagc 3361 ttcaaggctt tcgccaaggc tctgggtgtc atggatgacc ttaagtcggg ggttccgaga 3421 gctggctacc ggggtattgt caccttccag ttccggggcc gccgtgtcca cctggcgccc 3481 ccaccgacgt gggagggcta tgatcctagc tggaattagc acctgcctgt ccttcctggg 3541 cccctccttg ccacatcatg agctgaggtg ggaccacagt ccccaggctg catcggcctg 3601 cctgtgtttc cctcttaggt gcatttatct ttttgatttt tccgagtggc atttaagtgc 3661 acaaatgata acaagaggat tattctcccg ttctcaaggg agtcagatca ggggaactat 3721 tctagggtat gttgcggggt attaagcagg aaaccactgt gtggtggggg gcactgggct 3781 tgttggggcc agaaatgtcc acgtcctgag ctttctcctg gagcatgtgc agagagtttg 3841 gcaacgttcg ctctcttgac cagacccctt ctccctgacc tggctcttcc agccagggca 3901 cgagccctcc ttctatacct gctccccttc ccccagtggg gactgagtta tgggagaagg 3961 ggacatattt gtggccaaaa tgatactaac caaaggggct tccttgtcag ggcctggtgg 4021 agttggtggg tcatcggggc tcactgcctc ctgcccttct ctcctgtctg acccccactt 4081 agcccttctc tccttgcagc ctagcagttt atagttctga gatggaaagt tgaagggggc 4141 aagcaagacc tctcctcagc ccatgcccag ctgtcaggag agaggtgcag ggaggaaggc 4201 cttgtgctgg gacaacctct ctcttgcctt acctcagaga gggactatgc cctgacccct 4261 cctttctgaa aatcagtgcc ctccctgttg ctctaggagg ctcctgctgg cttggtagaa 4321 gacagaattc gatctgcctg tccctttttc ccctggggtt tgacacacag gctcctctca 4381 gcatgaggtg gagcagtgac caggtggagc agtgaccagg acgcctctgg cccagtgctg 4441 cccagcctcc ccgcccgctc ccaggcgccc catgtcctca caggccagga cgccatggca 4501 ggatggagag gacttggtgg atttttgttt cttgcctgac ctcagtttca tgaaagaaag 4561 tggaagctac agaattattt tctaaaataa aggctgaatt gtctgaaaaa tatttatgtg 4621 tgtgtgtcct ggaaaagaag gtggcaggca gggaaagaaa ggaaaaggga gaataaagag 4681 ttaagaagag gtctagacgg gtggg // LOCUS HUMVIM 1749 bp DNA PRI 14-JAN-1995 DEFINITION Human vimentin gene, complete cds. ACCESSION M14144 NID g340218 KEYWORDS intermediate filament; vimentin. SOURCE Human liver DNA (library of T.Maniatis), clone lambda-4F1 c1.37, and cDNA to mRNA (library of Okayama-Berg), clone pL3-A7A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1749) AUTHORS Ferrari,S., Battini,R., Kaczmarek,L., Rittling,S., Calabretta,B., de Riel,J.K., Philiponis,V., Wei,J.F. and Baserga,R. TITLE Coding sequence and growth regulation of the human vimentin gene JOURNAL Mol. Cell. Biol. 6 (11), 3614-3620 (1986) MEDLINE 87089701 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.Battini, 09-MAR-1987. Bases 1-835 are from a genomic clone and bases 836-1749 are from cDNA. FEATURES Location/Qualifiers source 1..1749 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10p13" mRNA <292..1749 /note="vim mRNA" gene 292..1692 /gene="VIM" CDS 292..1692 /gene="VIM" /note="vimentin" /codon_start=1 /db_xref="GDB:G00-119-630" /db_xref="PID:g340219" /translation="MSTRSVSSSSYRRMFGGPGTASRPSSSRSYVTTSTRTYSLGDAL RPSTSRSLYASSPGGVYATRSSAVRLRSSVPGVRLLQDSVDFSLADAINTEFKNTRTN EKVELQELNDRFANYIDKVRFLEQQNKILLAELEQLKGQGKSRLGDLYEEEMRELRRQ VDQLTNDKARVEVERDNLAEDIMRLREKLQEEMLQREEAENTLQSFRQDVDNASLARL DLERKVESLQEEIAFLKKLHEEEIQELQAQIQEQHVQIDVDVSKPDLTAALRDVRQQY ESVAAKNLQEAEEWYKSKFADLSEAANRNNDALRQAKQESTEYRRQVQSLTCEVDALK GTNESLERQMREMEENFAVEAANYQDTIGRLQDEIQNMKEEMARHLREYQDLLNVKMA LDIEIATYRKLLEGEESRISLPLPNFSSLNLRETNLDSLPLVDTHSKRTFLIKTVETR DGQVINETSQHHDDLE" BASE COUNT 415 a 548 c 476 g 310 t ORIGIN Chromosome 10p13. 1 gagggcgccc ccaccccacc cgcccaccct ccccgcttct cgctaggtcc cgattggctg 61 gcgcgctccg cggctgggat ggcagtggga ggggaccctc tttcctaacg gggttataaa 121 aacagcgccc tcggcggggt ccagtcctct gccactctcg ctccgaggtc cccgcgccag 181 agacgcagcc gcgctcccac cacccacacc caccgcgccc tcgttcgcct cttctccggg 241 agccagtccg cgccaccgcc gccgcccagc ccatcgccac cctccgcagc catgtccacc 301 aggtccgtgt cctcgtcctc ctaccgcagg atgttcggcg gcccgggcac cgcgagccgg 361 ccgagctcca gccggagcta cgtgactacg tccacccgca cctacagcct gggcgacgcg 421 ctgcgcccca gcaccagccg cagcctctac gcctcgtccc cgggcggcgt gtatgccacg 481 cgctcctctg ccgtgcgcct gcggagcagc gtgcccgggg tgcggctcct gcaggactcg 541 gtggacttct cgctggccga cgccatcaac accgagttca agaacacccg caccaacgag 601 aaggtggagc tgcaggagct gaatgaccgc ttcgccaact acatcgacaa ggtgcgcttc 661 ctggagcagc agaataagat cctgctggcc gagctcgagc agctcaaggg ccaaggcaag 721 tcgcgcctag gggacctcta cgaggaggag atgcgggagc tgcgccggca ggtggaccag 781 ctaaccaacg acaaagcccg cgtcgaggtg gagcgcgaca acctggccga ggacatcatg 841 cgcctccggg aaaaattgca ggaggagatg cttcagagag aggaagccga aaacaccctg 901 caatctttca gacaggatgt tgacaatgcg tctctggcac gtcttgacct tgaacgcaaa 961 gtggaatctt tgcaagaaga gattgccttt ttgaagaaac tccacgaaga ggaaatccag 1021 gagctgcagg ctcagattca ggaacagcat gtccaaatcg atgtggatgt ttccaagcct 1081 gacctcacgg ctgccctgcg tgacgtacgt cagcaatatg aaagtgtggc tgccaagaac 1141 ctgcaggagg cagaagaatg gtacaaatcc aagtttgctg acctctctga ggctgccaac 1201 cggaacaatg acgccctgcg ccaggcaaag caggagtcca ctgagtaccg gagacaggtg 1261 cagtccctca cctgtgaagt ggatgccctt aaaggaacca atgagtccct ggaacgccag 1321 atgcgtgaaa tggaagagaa ctttgccgtt gaagctgcta actaccaaga cactattggc 1381 cgcctgcagg atgagattca gaatatgaag gaggaaatgg ctcgtcacct tcgtgaatac 1441 caagacctgc tcaatgttaa gatggccctt gacattgaga ttgccaccta caggaagctg 1501 ctggaaggcg aggagagcag gatttctctg cctcttccaa acttttcctc cctgaacctg 1561 agggaaacta atctggattc actccctctg gttgataccc actcaaaaag gacattcctg 1621 attaagacgg ttgaaactag agatggacag gttatcaacg aaacttctca gcatcacgat 1681 gaccttgaat aaacaattgc acactcagtg cagcactcat ataccagcag ataaaagaat 1741 ccatatctt // LOCUS HUMVIPR2A 1317 bp DNA PRI 24-MAY-1995 DEFINITION Homo sapiens vasoactive intestinal polypeptide receptor 2 (VIPR2) mRNA, complete cds. ACCESSION L40764 NID g712836 KEYWORDS G-protein coupled receptor; PACAP receptor; VIP receptor; transmembrane protein; vasoactive intestinal polypeptide receptor; vasoactive intestinal polypeptide receptor 2. SOURCE Homo sapiens (clone: phVIP2-13) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Svoboda,M., Tastenoy,M., Van Rampelbergh,J., Goossens,J.F., De Neef,P., Waelbroeck,M. and Robberecht,P. TITLE Molecular cloning and functional characterization of a human VIP receptor from SUP-T1 lymphoblasts JOURNAL Biochem. Biophys. Res. Commun. 205 (3), 1617-1624 (1994) MEDLINE 95110300 REFERENCE 2 (bases 1 to 1317) AUTHORS Adamou,J.E., Aiyar,N., Van Horn,S. and Elshourbagy,N.A. TITLE Cloning and functional characterization of the human vasoactive intestinal peptide (VIP)-2 receptor JOURNAL Biochem. Biophys. Res. Commun. 209 (2), 385-392 (1995) MEDLINE 95251631 FEATURES Location/Qualifiers source 1..1317 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phVIP2-13" /tissue_type="placenta" sig_peptide 1..60 /gene="VIPR2" /note="putative" CDS 1..1317 /gene="VIPR2" /standard_name="VIP-2 receptor" /note="transmembrane domains at basepairs 384-444, 477-534, 612-681, 720-780, 840-909, 987-1044, and 1074-1140; putative" /codon_start=1 /product="vasoactive intestinal polypeptide receptor 2" /db_xref="PID:g712837" /translation="MRTLLPPALLTCWLLAPVNSIHPECRFHLEIQEEETKCTELLRS QTEKHKACSGVWDNITCWRPANVGETVTVPCPKVFSNFYSKAGNISKNCTSDGWSETF PDFVDACGYSDPEDESKITFYILVKAIYTLGYSVSLMSLATGSIILCLFRKLHCTRNY IHLNLFLSFILRAISVLVKDDVLYSSSGTLHCPDQPSSWVGCKLSLVFLQYCIMANFF WLLVEGLYLHTLLVAMLPPRRCFLAYLLIGWGLPTVCIGAWTAARLYLEDTGCWDTND HSVPWWVIRIPILISIIVNFVLFISIIRILLQKLTSPDVGGNDQSQYKRLAKSTLLLI PLFGVHYMVFAVFPISISSKYQILFELCLGSFQGLVVAVLYCFLNSEVQCELKRKWRS RCPTPSASRDYRVCGSSFSHNGSEGALQFHRASRAQSFLQTETSVI" gene 1..1317 /gene="VIPR2" mat_peptide 61..1314 /gene="VIPR2" /standard_name="VIP-2 receptor" /note="putative" /product="vasoactive intestinal polypeptide receptor 2" BASE COUNT 278 a 393 c 337 g 309 t ORIGIN 1 atgcggacgc tgctgcctcc cgcgctgctg acctgctggc tgctcgcccc cgtgaacagc 61 attcacccag aatgccgatt tcatctggaa atacaggagg aagaaacaaa atgtacagag 121 cttctgaggt ctcaaacaga aaaacacaaa gcctgcagtg gcgtctggga caacatcacg 181 tgctggcggc ctgccaatgt gggagagacc gtcacggtgc cctgcccaaa agtcttcagc 241 aatttttaca gcaaagcagg aaacataagc aaaaactgta cgagtgacgg atggtcagag 301 acgttcccag atttcgtcga tgcctgtggc tacagcgacc cggaggatga gagcaagatc 361 acgttttata ttctggtgaa ggccatttat accctgggct acagtgtctc tctgatgtct 421 cttgcaacag gaagcataat tctgtgcctc ttcaggaagc tgcactgcac caggaattac 481 atccacctga acctgttcct gtccttcatc ctgagagcca tctcagtgct ggtcaaggac 541 gacgttctct actccagctc tggcacgttg cactgccctg accagccatc ctcctgggtg 601 ggctgcaagc tgagcctggt cttcctgcag tactgcatca tggccaactt cttctggctg 661 ctggtggagg ggctctacct ccacaccctc ctggtggcca tgctcccccc tagaaggtgc 721 ttcctggcct acctcctgat cggatggggc ctccccaccg tctgcatcgg tgcatggact 781 gcggccaggc tctacttaga agacaccggt tgctgggata caaacgacca cagtgtgccc 841 tggtgggtca tacgaatacc gattttaatt tccatcatcg tcaattttgt ccttttcatt 901 agtattatac gaattttgct gcagaagtta acatccccag atgtcggcgg caacgaccag 961 tctcagtaca agaggctggc caagtccacg ctcctgctta tcccgctgtt cggcgtccac 1021 tacatggtgt ttgccgtgtt tcccatcagc atctcctcca aataccagat actgtttgag 1081 ctgtgcctcg ggtcgttcca gggcctggtg gtggccgtcc tctactgttt cctgaacagt 1141 gaggtgcagt gcgagctgaa gcgaaaatgg cgaagccggt gcccgacccc gtccgcgagc 1201 cgggattaca gggtctgcgg ttcctccttc tcccacaacg gctcggaggg cgccctgcag 1261 ttccaccgcg cgtcccgagc ccagtccttc ctgcaaacgg agacctcggt catctag // LOCUS S45489 2239 bp DNA PRI 05-JAN-1993 DEFINITION bradykinin B2 receptor=G protein-coupled receptor [human, Genomic, 2239 nt]. ACCESSION S45489 NID g256536 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2239) AUTHORS Eggerickx,D., Raspe,E., Bertrand,D., Vassart,G. and Parmentier,M. TITLE Molecular cloning, functional expression and pharmacological characterization of a human bradykinin B2 receptor gene JOURNAL Biochem. Biophys. Res. Commun. 187 (3), 1306-1313 (1992) MEDLINE 93038601 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 114201] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..2239 /organism="Homo sapiens" /db_xref="taxon:9606" gene 148..1242 /gene="bradykinin B2 receptor" CDS 148..1242 /gene="bradykinin B2 receptor" /note="G protein-coupled receptor; This sequence comes from Fig. 1" /codon_start=1 /product="bradykinin B2 receptor" /db_xref="PID:g256537" /translation="MLNVTLQGPTLNGTFAQSKCPQVEWLGWLNTIQPPFLWVLFVLA TLENIFVLSVFCLHKSSCTVAEIYLGNLAAADLILACGLPFWAITISNNFDWLFGETL CRVVNAIISMNLYSSICFLMLVSIDRYLALVKTMSMGRMRGVRWAKLYSLVIWGCTLL LSSPMLVFRTMKEYSDEGHNVTACVISYPSLIWEVFTNMLLNVVGFLLPLSVITFCTM QIMQVLRNNEMQKFKEIQTERRATVLVLVVLLLFIICWLPFQISTFLDTLHRLGILSS CQDERIIDVITQIASFMAYSNSCLNPLVYVIVGKRFRKKSWEVYQGVCQKGGCRSEPI QMENSMGTLRTSISVERQIHKLQDWAGSRQ" BASE COUNT 520 a 610 c 604 g 504 t 1 others ORIGIN 1 ctgcagaaaa cagcctgagc tccacctcgg cttctccttg ccctggctgg ttgtccttaa 61 cccctgtctc cttctggacc agtttttgtc cttcccttgt gaccctgagg ggtaacagcc 121 tcttttccac tttctttcag cgccgacatg ctcaatgtca ccttgcaagg gcccactctt 181 aacgggacct ttgcccagag caaatgcccc caagtggagt ggctgggctg gctcaacacc 241 atccagcccc ccttcctctg ggtgctgttc gtgctggcca ccctagagaa catctttgtc 301 ctcagcgtct tctgcctgca caagagcagc tgcacggtgg cagagatcta cctggggaac 361 ctggccgcag cagacctgat cctggcctgc gggctgccct tctgggccat caccatctcc 421 aacaacttcg actggctctt tggggagacg ctctgccgcg tggtgaatgc cattatctcc 481 atgaacctgt acagcagcat ctgtttcctg atgctggtga gcatcgaccg ctacctggcc 541 ctggtgaaaa ccatgtccat gggccggatg cgcggcgtgc gctgggccaa gctctacagc 601 ttggtgatct gggggtgtac gctgctcctg agctcaccca tgctggtgtt ccggaccatg 661 aaggagtaca gcgatgaggg ccacaacgtc accgcttgtg tcatcagcta cccatccctc 721 atctgggaag tgttcaccaa catgctcctg aatgtcgtgg gcttcctgct gcccctgagt 781 gtcatcacct tctgcacgat gcagatcatg caggtgctgc ggaacaacga gatgcagaag 841 ttcaaggaga tccagacgga gaggagggcc acggtgctag tcctggttgt gctgctgcta 901 ttcatcatct gctggctgcc cttccagatc agcaccttcc tggatacgct gcatcgcctc 961 ggcatcctct ccagctgcca ggacgagcgc atcatcgatg taatcacaca gatcgcctcc 1021 ttcatggcct acagcaacag ctgcctcaac ccactggtgt acgtgatcgt gggcaagcgc 1081 ttccgaaaga agtcttggga ggtgtaccag ggagtgtgcc agaaaggggg ctgcaggtca 1141 gaacccattc agatggagaa ctccatgggc acactgcgga cctccatctc cgtggaacgc 1201 cagattcaca aactgcagga ctgggcaggg agcagacagt gagcaaacgc cagcagggct 1261 gctgtgaatt tgtgtaagga ttgagggaca gttgcttttc agcatgggcc caggaatgcc 1321 aaggagacat ctatgcacga ccttgggaaa tgagttgatg tctccggtaa aacaccggag 1381 actaattcct gncctgccca attttgcagg gagcatggct gtgaggatgg ggtgaactca 1441 cgcacagcca aggactccaa aatcacaaca gcattactgt tcttatttgc tgccacacct 1501 gagccagcct gctccttccc aggagtggag gaggcctggg ggcagggaga ggagtgactg 1561 agcttccctc ccgtgtgttc tccgtccctg ccccagcaag acaacttaga tctccaggag 1621 aactgccatc cagctttggt gcaatggctg agtgcacaag tgagttgttg ccctgggttt 1681 ctttaatcta ttcagctaga actttgaagg acaatttctt gcattaataa aggttaagcc 1741 ctgaggggtc cctgataaca acctggagac caggatttta tggctcccct cactgatgga 1801 caagggaggt ctgtgccaaa gaagaatcca ataagcacat attgagcact tgctgtatat 1861 gcagtattga gcactgtagg caagagggaa gaaagagaag gagccatctc catcttgaag 1921 gaactcaaag actcaagtgg gaacgactgg cactgccacc accagaaagc tgttcgacga 1981 gacggtcgag cagggtgctg tgggtgatat ggacagcaga agggggagac caaggttcca 2041 gctcaaccaa taactattgc acaaccacct gtccctgcct cagttccctc ttctgtaaca 2101 tgaagtcgtt gtgagggtta aaggcagtaa caggtataaa gtacttagaa aagcaaaggg 2161 tgctacgtac atgtgaggca tcattacgca gacgtaactg ggatatgttt actataagga 2221 aaagacactg aggtctaga // LOCUS S48475S2 878 bp DNA PRI 10-FEB-1993 DEFINITION CSF2RA=GM-CSF receptor alpha subunit {3' region, 5' region} [human, Genomic, 878 nt, segment 2 of 2]. ACCESSION S48539 NID g258858 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 878) AUTHORS Rappold,G., Willson,T.A., Henke,A. and Gough,N.M. TITLE Arrangement and localization of the human GM-CSF receptor alpha chain gene CSF2RA within the X-Y pseudoautosomal region JOURNAL Genomics 14 (2), 455-461 (1992) MEDLINE 93052350 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 117979] from the original journal article. This sequence comes from Fig. 3B. Map location: X-Y pseudoautosomal region. FEATURES Location/Qualifiers source 1..878 /organism="Homo sapiens" /db_xref="taxon:9606" gene 46..123 /partial /note="GM-CSF receptor alpha subunit" /gene="CSF2RA" CDS 46..123 /partial /gene="CSF2RA" /note="hemopoietic growth factor receptor; This sequence comes from Fig. 3B" /codon_start=1 /product="GM-CSF receptor alpha subunit" /db_xref="PID:g258859" /translation="MIWEEFTPEEGKGYREEVLTVKEIT" BASE COUNT 244 a 205 c 231 g 198 t ORIGIN 1 ctgagctcgt gaagatctga cagcctgaac cctccttttt ctcagatgat ctgggaggaa 61 ttcaccccag aggaagggaa aggctaccgc gaagaggtct tgaccgtgaa ggaaattacc 121 tgagacccag agggtgtagg aatggcatgg acatctccgc ctccgcgaca cgggggaact 181 gttttcttga tgatgctgtg aacctttata tcattttcta tgtttttatt taaaaacatg 241 acatttgggg ccaggcgcgg tggctcacgc ctgtaatccc agcactttgg gaggccaagg 301 caggcggatc acctgaggtc aggagttcaa gaccagcctg cccaacatgg tgaaacccca 361 tctggactaa aaatgcagaa atttacccag gcacggcggc ggagcccatc atcccagcta 421 cttgggaggc tgaggcagga gaattgcttg aaccctttga gcggaggttg tagtgagcca 481 gatcgcacca ttgcacacca acctgcgtga cagagcaaga ttgcatctca aaacaaacaa 541 taataataaa taataaaaac ctgatatttg gctgggcgcg tggctcatcg tctaatccta 601 acactttgga ggattgctgg agacaggagt ttaagaccag tctgggcaac atagcaagac 661 cctgtctcta caaaaaaggc aaaaattagc tgggcgtggt ggcttgtgcc tgtagtgcca 721 gctatctggg aggctgaggc gggaggatca cttgagttca agctgacagt aagctatgat 781 tgcacgttgc actccagcct gggtaacaaa ctaagacccc atctctctgt ctcaaaaaaa 841 gtgatacttc gcaaagatgg ttagactcca agaagctt // LOCUS S70458 201 bp DNA PRI 22-SEP-1994 DEFINITION CMAR=cellular adhesion regulatory molecule [human, lymphoblastoid B cells, melanoma cell line CDM8, Genomic Mutant, 201 nt]. ACCESSION S70458 NID g546796 KEYWORDS . SOURCE human lymphoblastoid B cells melanoma cell line CDM8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 201) AUTHORS Durbin,H., Novelli,M. and Bodmer,W. TITLE Detection of a 4-bp insertion (CACA) functional polymorphism at nucleotide 241 of the cellular adhesion regulatory molecule CMAR (formerly CAR) JOURNAL Genomics 19 (1), 181-182 (1994) MEDLINE 94245163 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 148751] from the original journal article. This sequence comes from Fig. 1b. Map location: 16q.24. FEATURES Location/Qualifiers source 1..201 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..171 /note="cellular adhesion regulatory molecule, CMAR; formerly CAR" /gene="CMAR" CDS 1..171 /gene="CMAR" /note="This sequence comes from Fig. 1b." /codon_start=1 /product="cellular adhesion regulatory molecule" /db_xref="PID:g546797" /translation="MGPHSQAQALGSGGLTTAPCCHVDWCKLRTSCWSSHACSVGDAL VFTALRIVEILY" BASE COUNT 50 a 44 c 55 g 52 t ORIGIN 1 atggggccac acagtcaggc ccaggcactg ggctccggag gactcaccac tgccccctgc 61 tgccatgtgg actggtgcaa gttgaggact tcttgctggt ctagtcacgc atgcagtgtt 121 ggggatgcct tggtttttac tgctctgaga attgttgaga tactttacta ataaactgtg 181 tagttggaaa aaaaaaaaaa a // LOCUS S72459 2475 bp DNA PRI 10-JUL-1992 DEFINITION CREB327=cyclic AMP-responsive enhancer binding protein {alternatively spliced} [human, Genomic, 2475 nt]. ACCESSION S72459 NID g240428 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2475) AUTHORS Waeber,G., Meyer,T.E., Hoeffler,J.P. and Habener,J.F. TITLE Diversification of cyclic AMP-responsive enhancer binding proteins-generated by alternative exon splicing JOURNAL Trans. Assoc. Am. Physicians 103, 28-37 (1990) MEDLINE 92087371 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 72459] from the original journal article. This sequence comes from Figure 3. FEATURES Location/Qualifiers source 1..2475 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2475 /gene="CREB327" CDS 126..1109 /gene="CREB327" /note="cyclic AMP-responsive enhancer binding protein; This sequence comes from Figure 3" /codon_start=1 /product="CREB327" /db_xref="PID:g240429" /translation="MTMESGAENQQSGDAAVTEAENQQMTVQAQPQIATLAQVSMPAA HATSSAPTVTLVQLPNGQTVQVHGVIQAAQPSVIQSPQVQTVQISTIAESEDSQESVD SVTDSQKRREILSRRPSYRKILNDLSSDAPGVPRIEEEKSEEETSAPAITTVTVPTPI YQTSSGQYIAITQGGAIQLANNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQ ILVPSNQVVVQAASGDVQTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRL MKNREAARECRRKKKEYVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD" BASE COUNT 805 a 480 c 491 g 699 t ORIGIN 1 gaattcgggc gcggcggagg tgtagtttga cgcggtgtgt tacgtggggg agagaataaa 61 actccagcga gatccgggcc gtgaacgaaa gcagtgacgg aggagcttgt accaccggta 121 actaaatgac catggaatct ggagccgaga accagcagag tggagatgca gctgtaacag 181 aagctgaaaa ccaacaaatg acagttcaag cccagccaca gattgccaca ttagcccagg 241 tatctatgcc agcagctcat gcaacatcat ctgctcccac cgtaactcta gtacagctgc 301 ccaatgggca gacagttcaa gtccatggag tcattcaggc ggcccagcca tcagttattc 361 agtctccaca agtccaaaca gttcagattt caactattgc agaaagtgaa gattcacagg 421 agtcagtgga tagtgtaact gattcccaaa agcgaaggga aattctttca aggaggcctt 481 cctacaggaa aattttgaat gacttatctt ctgatgcacc aggagtgcca aggattgaag 541 aagagaagtc tgaagaggag acttcagcac ctgccatcac cactgtaacg gtgccaactc 601 caatttacca aactagcagt ggacagtata ttgccattac ccagggagga gcaatacagc 661 tggctaacaa tggtaccgat ggggtacagg gcctgcaaac attaaccatg accaatgcag 721 cagccactca gccgggtact accattctac agtatgcaca gaccactgat ggacagcaga 781 tcttagtgcc cagcaaccaa gttgttgttc aagctgcctc tggagacgta caaacatacc 841 agattcgcac agcacccact agcactattg cccctggagt tgttatggca tcctccccag 901 cacttcctac acagcctgct gaagaagcag cacgaaagag agaggtccgt ctaatgaaga 961 acagggaagc agctcgagag tgtcgtagaa agaagaaaga atatgtgaaa tgtttagaaa 1021 acagagtggc agtgcttgaa aatcaaaaca agacattgat tgaggagcta aaagcactta 1081 aggaccttta ctgccacaaa tcagattaat ttgggattta aattttcacc tgttaaggtg 1141 gaaaatggac tggcttggcc acaacctgaa agacaaaata aacattttat tttctaaaca 1201 tttctttttt tctatgcgca aaactgcctg aaagcaacta cagaatttga ttcatttgtg 1261 cttttgcatt aaactgtgaa tgttccaaca cctgcctcca cttctcccct caagaaattt 1321 tcaacgccag gaatcatgaa gagacttctg cttttcaacc cccaccctcc tcaagaagta 1381 ataatttgtt tacttgtaaa ttgatgggag aaatgaggaa aagaaaatct ttttaaaaat 1441 gatttcaagg tttgtgctga gctccttgat tgccttaggg acagaattac cccagcctct 1501 tgagctgaag taatgtgtgg gccgcatgca taaagtaagt aaggtgcaat gaagaagtgt 1561 tgattgccaa attgacatgt tgtcacattc tcattgtgaa ttatgtaaag ttgttaagag 1621 acataccctc taaaaaagaa ctttagcatg gtattgaagg aattagaaat gaatttggag 1681 tgctttttat gtatgttgtc ttcttcaata ctgaaaattt gtccttggtt cttaaaagca 1741 ttctgtacta atacagctct tccatagggc agttgttgct tcttaattca gttctgtatg 1801 tgttcaacat ttttgaatac attaaaagaa gtaaccaact gaacgacaaa gcatggtatt 1861 tgaattttaa attaaagcaa agtaaataaa agtacaaagc atattttagt tagtactaaa 1921 ttcttagtaa aatgctgatc agtaaaccaa tcccttgagt tatataacaa gatttttaaa 1981 taaatgttat tgtcctcacc ttcaaaaata tttatattgt cactcattta cgtaaaaaga 2041 tatttctaat ttactgttgc ccattgcact tacataccac caccaagaaa gccttcaaga 2101 tgtcaaataa agcaaagtga tatatatttg tttatgaaat gttacatgta gaaaaatact 2161 gattttaaat attttccata ttaacaattt aacagagaat ctctagtgaa ttttttaaat 2221 gaaagaagtt gtaaggatat aaaaagtaca gtgttagatg tgcacaagga aagttatttt 2281 cagacatatt tgaatgactg ctgtactgca atatttggat tgtcattctt acaaaacatt 2341 tttttgttct cttgtaaaaa gagtagttat tagttctgct ttagctttcc aatatgctgt 2401 atagcctttg tcattttata attttaattc ctgattaaaa cagtctgtat ttgtgtatat 2461 catccccccg aattc // LOCUS S73199 1724 bp DNA PRI 28-FEB-1995 DEFINITION follicle-stimulating hormone receptor {5' region} [human, Genomic, 1724 nt]. ACCESSION S73199 NID g685036 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1724) AUTHORS Gromoll,J., Dankbar,B. and Gudermann,T. TITLE Characterization of the 5' flanking region of the human follicle-stimulating hormone receptor gene JOURNAL Mol. Cell. Endocrinol. 102 (1-2), 93-102 (1994) MEDLINE 95011044 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 155309] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1724 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1487..1642 /partial /gene="follicle-stimulating hormone receptor, FSHR" CDS 1487..1642 /partial /gene="follicle-stimulating hormone receptor, FSHR" /note="This sequence comes from Fig. 1; FSHR" /codon_start=1 /product="follicle-stimulating hormone receptor" /db_xref="PID:g685037" /translation="MALLLVSLLAFLSLGSGCHHRICHCSNRVFLCQESKVTEIPSDL PRNAIEL" BASE COUNT 514 a 319 c 381 g 510 t ORIGIN 1 ctccaaaagc tagaagggga cttatggggc agaggaattg ggtaaaaggt ttaattggta 61 gtgagtttct tgtcactgga gttattcaag tacagatgag aaaatccgtt taaattaata 121 tgaatactta aacttaagca ttagttggtt gattgaatta acttcttcta catctgaaag 181 actataattc cttggatcag ttgttgaaca aactaattta accattcatg ttagccaatt 241 tgttaaaaat atgaagtcca tatagatact gcctttgtct tgccaattcc agaattagaa 301 taaatccatg ttcccacaaa aacagggatt acacttgccc caatcacttg cttctgtttt 361 cttaataatt taatcattct ttacatggtt ttccttggta aggacagggg agagggaagg 421 aggggtggga taagtaggta tttttaacat tttcatgaag ctcaatttta tgcaactaat 481 tatctaaaga tagtattgaa actaatttta gcaggtaata agttgtgggc aggatcttag 541 agaatattct gccttttaat cttcatagat tctacaattg aaagtagaaa gccacttttc 601 taaaagtgtt gccaaacgtt cctcagatcc ctcaccatcg aggaggggaa tgccaacaaa 661 agtgctgggc tgtcacctca gggaaccttt taaaagtaag ttaaaagcgt agattaaata 721 aattcattta tgccaattgc caaaaggcaa gggcaatctt cagagccctg gaaatgcgag 781 aaggcaggca ggatatgcac gagggggtgg atgtgctggc atttactaaa tccccactgg 841 aaacatagca catttcttag atcactcagt tcagtcaatt atcgctacag ataaccctat 901 cagaagtacc tcccacctac ggaaatctgc ctaaggtttg ctaacccacc tgcctgtttc 961 tgaatagtgc catttttggc ataattacag caaaaaaaga ggatcattga atgcaaccca 1021 gaagggctgg ttttctctca ggtgggctgg ttgaagaaat tgatgctgaa gactgaaggt 1081 cccagccctt cacttattag tacccctctt agtgatgtgt catattgtgg gttgtgtctt 1141 ttttggagaa agtcaatcat gtcactctgt tgagaagaga atggtgaaca gcaaggagac 1201 ttttgcaatg aaataatgca aactattcca gacatgccta atggttctat ttgctgtgtg 1261 ccttaggtca gggtgtaaga aacccaatct tgaaggaaaa cagagtagct tatcttgcct 1321 ggaagtaaca aaaaaaaaaa aaaaaagcat cccttggtgg gtcacatgac cctaccagtt 1381 ctcaagtcag atctcttctc ataagggcac tgtgtggagc ttctgagatc tgtggaggtt 1441 tttctctgca aatgcaggaa gaaatcaggt ggatggatgc ataattatgg ccctgctcct 1501 ggtctctttg ctggcattcc tgagcttggg ctcaggatgt catcatcgga tctgtcactg 1561 ctctaacagg gtttttctct gccaagagag caaggtgaca gagattcctt ctgacctccc 1621 gaggaatgcc attgaactgt gagtatcaga gggaggggga acaactgcat ggctggcatt 1681 tgtgcattgc gtactattat tatttttaca ttaggctgat atct // LOCUS S73619 156 bp DNA PRI 01-MAR-1995 DEFINITION COL10A1=type X alpha 1 collagen {3' region} [human, Schmid metaphyseal chondrodysplasia patient, peripheral blood leukocytes, Genomic Mutant, 156 nt]. ACCESSION S73619 NID g688346 KEYWORDS . SOURCE human peripheral blood leukocytes Schmid metaphyseal chondrodysplasia patient. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 156) AUTHORS Dharmavaram,R.M., Elberson,M.A., Peng,M., Kirson,L.A., Kelley,T.E. and Jimenez,S.A. TITLE Identification of a mutation in type X collagen in a family with Schmid metaphyseal chondrodysplasia JOURNAL Hum. Mol. Genet. 3 (3), 507-509 (1994) MEDLINE 94282047 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 156226] from the original journal article. This sequence comes from Fig. 2A. FEATURES Location/Qualifiers source 1..156 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..156 /partial /note="type X alpha 1 collagen" /gene="COL10A1" CDS 1..156 /partial /gene="COL10A1" /note="This sequence comes from Fig. 2A." /codon_start=1 /product="type X alpha 1 collagen" /db_xref="PID:g688347" /translation="MMNTPKATWIRLQGVPSSISQKMTRCGSSFPMPSQMAYTPLSMS TPLSQDS" BASE COUNT 40 a 44 c 33 g 39 t ORIGIN 1 atgatgaata caccaaaggc tacctggatc aggcttcagg gagtgccatc atcgatctca 61 cagaaaatga ccaggtgtgg ctccagcttc ccaatgccga gtcaaatggc ctatactcct 121 ctgagtatgt ccactcctct ttctcaggat tcctag // LOCUS S75308 1183 bp DNA PRI 26-MAY-1995 DEFINITION Msx-2 homolog [human, dental pulp-derived cells, Genomic, 1183 nt]. ACCESSION S75308 NID g834013 KEYWORDS . SOURCE human dental pulp-derived cells. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1183) AUTHORS Iimura,T. TITLE [Molecular cloning and expression of homeobox-containing genes during hard tissue development] JOURNAL Kokubyo Gakkai Zasshi 61 (4), 590-604 (1994) MEDLINE 95204995 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 160463] from the original journal article. This sequence comes from Fig. 2B. FEATURES Location/Qualifiers source 1..1183 /organism="Homo sapiens" /db_xref="taxon:9606" gene 62..865 /note="mouse Msx-2 homolog" /gene="Msx-2 homolog" CDS 62..865 /gene="Msx-2 homolog" /codon_start=1 /translation="MASPSKGNDLFSPDEEGPAVVAGPGPGPGGAEGAAEERRVKVSS LPFSVEALMSDKKPPKEASPLPAESASAGATLRPLLLSGHGAREAHSPGPLVKPFETA SVKSENSEDGAAWMQEPGRYSPPPRHTSPTTCTLRKHKTNRKPRTPFTTSQLLALERK FRQKQYLSIAERAEFSSSLNLTETQVKIWFQNRRAKAKRLQEAELEKLKMAAKPMLPS SFSLPFPISSPLQAASIYGASYPFHRPVLPIPPVGLYATPVGYGMYHLS" BASE COUNT 277 a 365 c 316 g 225 t ORIGIN 1 gtcgccgctg ccgggttgcc agcggagtcg cgcgtcggga gctacgtagg gcagagaagt 61 catggcttct ccgtccaaag gcaatgactt gttttcgccc gacgaggagg gcccagcagt 121 ggtggccgga ccaggcccgg ggcctggggg cgccgagggg gccgcggagg agcgccgcgt 181 caaggtctcc agcctgccct tcagcgtgga ggcgctcatg tccgacaaga agccgcccaa 241 ggaggcgtcc ccgctgccgg ccgaaagcgc ctcggccggg gccaccctgc ggccactgct 301 gctgtcgggg cacggcgctc gggaagcgca cagccccggg ccgctggtga agcccttcga 361 gaccgcctcg gtcaagtcgg aaaattcaga agatggagcg gcgtggatgc aggaacccgg 421 ccgatattcg ccgccgccaa gacatacgag ccctaccacc tgcaccctga ggaaacacaa 481 gaccaatcgg aagccgcgca cgccctttac cacatcccag ctcctcgccc tggagcgcaa 541 gttccgtcag aaacagtacc tctccattgc agagcgtgca gagttctcca gctctctgaa 601 cctcacagag acccaggtca aaatctggtt ccagaaccga agggccaagg cgaaaagact 661 gcaggaggca gaactggaaa agctgaaaat ggctgcaaaa cctatgctgc cctccagctt 721 cagtctccct ttccccatca gctcgcccct gcaggcagcg tccatatatg gagcatccta 781 cccgttccat agacctgtgc ttcccatccc gcctgtggga ctctatgcca cgccagtggg 841 atatggcatg taccacctgt cctaaggaag accagatcaa tagactccat gatggatgct 901 tgtttcaaag ggtttcctct ccctctccac gaaggcagta ccagccagta ctcctgctct 961 gctaaccctg cgtgcaccac cctaagcggc taggctgaca gggccacacg acatagctga 1021 aatttcgttc tgtaggcgga ggcaccaagc cctgttttct tggtgtaatc ttccagatgc 1081 ccccttttcc tttcacaaag attggctctg atggttttta tgtataaata tatatatata 1141 ataaaatata atacattttt atacaaaaaa aaaaaaaaaa aaa // LOCUS S76825 183 bp DNA PRI 26-JUL-1995 DEFINITION insulin receptor {exon 1} [human, insulin resistant patient, Genomic Mutant, 183 nt]. ACCESSION S76825 NID g914085 KEYWORDS . SOURCE human insulin resistant patient. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 183) AUTHORS Cama,A., Sierra,M.L., Kadowaki,T., Kadowaki,H., Quon,M.J., Rudiger,H.W., Dreyer,M. and Taylor,S.I. TITLE Two mutant alleles of the insulin receptor gene in a family with a genetic form of insulin resistance: a 10 base pair deletion in exon 1 and a mutation substituting serine for asparagine-462 JOURNAL Hum. Genet. 95 (2), 174-182 (1995) MEDLINE 95163934 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 163664] from the original journal article. This sequence comes from Fig. 3B. FEATURES Location/Qualifiers source 1..183 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..183 /partial /gene="insulin receptor" CDS 1..183 /partial /gene="insulin receptor" /note="This sequence comes from Fig. 3B." /codon_start=1 /product="insulin receptor" /db_xref="PID:g914086" /translation="MGTGGRRPRRCWWRWPRCYWAPRATCTPERCVPAWISGTTSLGC MSWRIALSSKDTCRYS" BASE COUNT 30 a 54 c 65 g 34 t ORIGIN 1 atgggcaccg ggggccggcg gccgcgccgc tgctggtggc ggtggccgcg ctgctactgg 61 gcgccgcggg ccacctgtac cccggagagg tgtgtcccgg catggatatc cggaacaacc 121 tcactaggtt gcatgagctg gagaattgct ctgtcatcga aggacacttg cagatactct 181 tga // LOCUS S76830 3068 bp DNA PRI 26-JUL-1995 DEFINITION glycoprotein D=Duffy group antigen [human, blood, Genomic DNA, 3068 nt]. ACCESSION S76830 NID g914303 KEYWORDS . SOURCE human blood. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3068) AUTHORS Iwamoto,S., Omi,T., Kajii,E. and Ikemoto,S. TITLE Genomic organization of the glycoprotein D gene: Duffy blood group Fya/Fyb alloantigen system is associated with a polymorphism at the 44-amino acid residue JOURNAL Blood 85 (3), 622-626 (1995) MEDLINE 95134891 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 163440] from the original journal article. This sequence comes from Fig. 2B. FEATURES Location/Qualifiers source 1..3068 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1075..2091 /gene="glycoprotein D, gpFy" CDS 1075..2091 /gene="glycoprotein D, gpFy" /note="Duffy group antigen; This sequence comes from Fig. 2B; gpFy" /codon_start=1 /product="glycoprotein D" /db_xref="PID:g914304" /translation="MASSGYVLQAELSPSTENSSQLDFEDVWNSSYGVNDSFPDGDYG ANLEAAAPCHSCNLLDDSALPFFILTSVLGILASSTVLFMLFRPLFRWQLCPGWPVLA QLAVGSALFSIVVPVLAPGLGSTRSSALCSLGYCVWYGSAFAQALLLGCHASLGHRLG AGQVPGLTLGLTVGIWGVAALLTLPVTLASGASGGLCTLIYSTELKALQATHTVACLA IFVLLPLGLFGAKGLKKALGMGPGPWMNILWAWFIFWWPHGVVLGLDFLVRSKLLLLS TCLAQQALDLLLNLAEALAILHCVATPLLLALFCHQATRTLLPSLPLPEGWSSHLDTL GSKS" BASE COUNT 585 a 908 c 724 g 851 t ORIGIN 1 gagctcacgc ccacgtgcac acacccctca gttgggacag agttgaccac caccaccttt 61 ctcccaaaca catggctttt gaactgcctt tccttggatc cagttcaagg ggatggagga 121 gcagtgagag tcagccgccc ttccactcca atttcccagc acctccctta tctctgcctc 181 acaagtcacc cagcccccct ctcttccttc cttgtgcttg aagaatctct ccttgctgga 241 aagccccctg ttttctcaat ctccctttcc acttcggtaa aatctctact tgctggaaag 301 ccccctgttt tctcaatctc cctttccact tcggtaaaat gcccactttc tggtccccac 361 ctttttcctg agtgtagtcc caaccagcca aatccaacct caaaacagga agacccaagg 421 ccagtgaccc ccataggcct gaggcttgtg caggcagtgg gcgtggggta aggcttcctg 481 atgccccctg tccctgccca gaacctgatg gccctcatta gtccttggct cttatcttgg 541 aagcacaggc gctgacagcc gtcccagccc ttctgtctgc gggcctgaac caaacggtgc 601 catggggaac tgtctgcaca gggtgagtat ggggccaggc cccagagtcc cttatcccta 661 tgcccctcat ttcccctgct gtttgcccct cagtctttat atctcttcct tttcctcctc 721 atcttttctc ccttcctgct tttttcctct tccttcaaag tctttttcct tctctccttc 781 ctatgctagc ctcctagctc cctcttgtgt ccctcccttt gcctttgagt cagttccatc 841 ctggtctctt ggtgcctttc cttctgacct tgcactgctc ctccagcccc agctgccctg 901 gcttccccag gactgttcct gctccggctc ttcaggctcc ctgctttgtc cttttccact 961 gtccgcactg catctgactc ctgcagagac cttgttctcc cacccgacct tcctctctgt 1021 cctcccctcc cacctgcccc tcaattccca ggagactctt ccggtgtaac tctgatggcc 1081 tcctctgggt atgtcctcca ggcggagctc tccccctcaa ctgagaactc aagtcagctg 1141 gacttcgaag atgtatggaa ttcttcctat ggtgtgaatg attccttccc agatggagac 1201 tatggtgcca acctggaagc agctgccccc tgccactcct gtaacctgct ggatgactct 1261 gcactgccct tcttcatcct caccagtgtc ctgggtatcc tagctagcag cactgtcctc 1321 ttcatgcttt tcagacctct cttccgctgg cagctctgcc ctggctggcc tgtcctggca 1381 cagctggctg tgggcagtgc cctcttcagc attgtggtgc ccgtcttggc cccagggcta 1441 ggtagcactc gcagctctgc cctgtgtagc ctgggctact gtgtctggta tggctcagcc 1501 tttgcccagg ctttgctgct agggtgccat gcctccctgg gccacagact gggtgcaggc 1561 caggtcccag gcctcaccct ggggctcact gtgggaattt ggggagtggc tgccctactg 1621 acactgcctg tcaccctggc cagtggtgct tctggtggac tctgcaccct gatatacagc 1681 acggagctga aggctttgca ggccacacac actgtagcct gtcttgccat ctttgtcttg 1741 ttgccattgg gtttgtttgg agccaagggg ctgaagaagg cattgggtat ggggccaggc 1801 ccctggatga atatcctgtg ggcctggttt attttctggt ggcctcatgg ggtggttcta 1861 ggactggatt tcctggtgag gtccaagctg ttgctgttgt caacatgtct ggcccagcag 1921 gctctggacc tgctgctgaa cctggcagaa gccctggcaa ttttgcactg tgtggctacg 1981 cccctgctcc tcgccctatt ctgccaccag gccacccgca ccctcttgcc ctctctgccc 2041 ctccctgaag gatggtcttc tcatctggac acccttggaa gcaaatccta gttctcttcc 2101 cacctgtcaa cctgaattaa agtctacact gcctttgtga agcgggtggt ttcttatttt 2161 gtctggggag aagaaggaga atggagagag agacattttt atgtcagact ttcttgccag 2221 tgtctgcttc tatagctggc ttgggaagaa ggtgaatgat gaataaatac cctcagggta 2281 cacagatgtt ctcttgaggt gtggggtcac ggccatctca agggagaaga gaagaggaac 2341 tagagcatga ggggagtcat taaaccaaaa aaaacagaag ggatggctta gctggaaaaa 2401 aagctgttct gggaagcaaa tggaatagga actcaaactg agagataaac agtgaagagt 2461 gatgacaaag cccagagcaa taccacctcc ccctgtccaa cctgcccagc ctctgtcttc 2521 tgtctcctct ctggctttgt ttagtgatta ggacagtggt ggggaaggtg aaagaagcat 2581 cccaggggat gttactcagt tcagggaaca tatcaaggta atttaaaaag ccacttcctg 2641 ggagtcatct ctcccaggtt cctcagcatg acctgaatgt gtgtgtgtgc gtgtgtgtgt 2701 gtgtgtgtac acatctgttt ctcgatctgt tagaatctac ctttatgtta gatgtatgca 2761 tgtaaaaaca tatgtccacc catgagcttg catctctgtc agcacctgaa ctgcgacacc 2821 tgtgcgtgtg cactgacttt tctcaggacc caaaccccca ctcaattctg cactcatccc 2881 tgttcacagg atatagaatc gggatttatg actcactcct tacccaaatg agttttcttt 2941 accctggttt ttaagcctag tcttttctgt gtaggatgtg tggagggaag aaaagatcaa 3001 gaagttgtga ggggtggaga aacttgaagg gggaggccct gatttgattc atcttctgct 3061 tggaattc // LOCUS S78653 2416 bp DNA PRI 10-JUL-1992 DEFINITION mrg=mas-related [human, Genomic, 2416 nt]. ACCESSION S78653 NID g244209 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2416) AUTHORS Monnot,C., Weber,V., Stinnakre,J., Bihoreau,C., Teutsch,B., Corvol,P. and Clauser,E. TITLE Cloning and functional characterization of a novel mas-related gene, modulating intracellular angiotensin II actions JOURNAL Mol. Endocrinol. 5 (10), 1477-1487 (1991) MEDLINE 92130997 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 78653] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..2416 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2416 /note="mas-related" /gene="mrg" CDS 997..2133 /gene="mrg" /note="mas product homolog modulating intracellular angiotensin II actions; This sequence comes from Fig. 1" /codon_start=1 /db_xref="PID:g244210" /translation="MVWGKICWFSQRAGWTVFAESQISLSCSLCLHSGDQEAQNPNLV SQLCGVFLQNETNETIHMQMSMAVGQQALPLNIIAPKAVLVSLCGVLLNGTVFWLLCC GATNPYMVYILHLVAADVIYLCCSAVGFLQVTLLTYHGVVFFIPDFLAILSPFSFEVC LCLLVAISTERCVCVLFPIWYRCHRPKYTSNVVCTLIWGLPFCINIVKSLFLTYWKHV KACVIFLKLSGLFHAILSLVMCVSSLTLLIRFLCCSQQQKATRVYAVVQISAPMFLLW ALPLSVAPLITDFKMFVTTSYLISLFLIINSSANPIIYFFVGSLRKKRLKESLRVILQ RALADKPEVGRNKKAAGIDPMEQPHSTQHVENLLPREHRVDVET" BASE COUNT 600 a 605 c 583 g 628 t ORIGIN 1 ttttgtattt gttgcaccct aagtctgttc atttccttct cctcagctga catttggagc 61 atagcagtcg atgatgccca cacagacact gcctgagact cacgcccctg gagaaacgca 121 gatttcctta ttttccaggt caagtcctgc cagccataga aaggacttct ttggtgccaa 181 ctgctgtgaa atgcctgcct tggaaatctc agtgctccct tgtacctgtc tgagcccagg 241 gaaatgccat actgtggcac tgctgcatcc tgtatggcta cccaaggatg cccaggactg 301 gtttgaaaga gatgagacat ggccaggtgc gtggctcacg cttgtaatcc agcactttgg 361 gaggtcaagg cagtggatca caaggtcaga gttgagacca gccaggccaa tatggtgaaa 421 accccatctc tactaaaaat acaaaaaatt agccgggcaa tggtggtggg tgcctgtagt 481 tccagctagt caggaggccg aggcaggaga atcgcttgaa cctggaaggt ggaggttcca 541 gtgagctgag atcgcgccac tgcactccag cctgggtgac agagtgagac tccaactcaa 601 aaaaaaaaaa aaaaaagaga tgagacacta gtgtctcatg agtagaacct ggaccagaca 661 caaatctcca ttcccaatgt ttagtgcctc attagtgccc aacaacaaga tattgggtct 721 atgtgggtag gcctggggca tcctgtacaa caggagatgt gttaggggag ggagaacaga 781 tcacaaattc atggagagct atttgcagag cagatactcc catccactct gatatgtagt 841 taatgttcag ctgttcctaa aaagcacacc caacaatggg tgttctattc cagcctagga 901 aaatgtagag gcaaggggtc tgaggccaga ggacaccact agatggacca ctgctcctga 961 ctgtgatgtt gtggcccact caggtcccag caccccatgg tctgggggaa aatttgctgg 1021 ttcagccaga gggctggatg gacagtgttt gctgagtcac agatatctct ctcatgtagc 1081 ctttgtctcc acagtggtga ccaggaggca cagaacccaa acctggtatc tcagctctgt 1141 ggcgtctttc ttcaaaatga gacgaatgaa accatacata tgcagatgag catggcagtg 1201 ggacagcagg ccctgccctt gaatatcatt gcccccaagg ctgtgctggt ctccctctgt 1261 ggggtcttat tgaatggcac tgtcttctgg ctgctttgct gtggggccac gaatccctac 1321 atggtataca tcctccacct ggtcgctgct gacgtgatct atctttgctg ctcggcagtg 1381 gggttcttac aggtgactct gctaacttat catggagtcg tgttttttat ccctgatttc 1441 ctggccatat tgtctccctt ctcctttgag gtgtgtctct gtctcctggt ggccatcagc 1501 acagagcggt gtgtgtgtgt cctcttcccc atctggtaca gatgccaccg cccaaaatac 1561 acatctaatg ttgtctgcac cctcatctgg ggcctgcctt tttgcatcaa catagtaaaa 1621 tcacttttcc taacttactg gaaacatgta aaggcatgtg tcatatttct aaagctttct 1681 gggctcttcc atgctatcct ttcacttgtg atgtgtgtgt cgagtctgac tctactcatt 1741 agattcctgt gctgctccca gcagcaaaag gccaccaggg tctatgcggt ggtgcagatc 1801 tcggccccca tgttcctact ctgggcccta cccctgagcg tggcacccct cataacagat 1861 ttcaaaatgt ttgtcaccac ctcctattta atttccttgt tcctcattat aaacagcagc 1921 gccaacccta tcatttattt ctttgtgggg agcctcagaa agaaaaggct gaaggaatct 1981 ctcagagtga ttctccaacg ggcgttagca gataagccag aggtggggag gaacaaaaag 2041 gcagctggca tcgacccaat ggagcaacca cactctactc agcatgtgga gaaccttctt 2101 cccagggagc acagggtcga tgtggaaaca taatttccca catctgagct ggggaattgt 2161 acacatagta acccagcctg ttctgcatca taaggctgct gcatcaaatc aatgctttat 2221 tctaatcaag ttcagctttc atggactttc aaaacaaccc cttgctgttt gtggttggaa 2281 gagacattaa cttccttcct aggcagtaag cccagtttga atgtgctcca gttccaacga 2341 tgaggggaat gggacccagt gagactttcc tggtacctgt ggaatccaaa taaagaccat 2401 acaaaggcat gaattc // LOCUS S81191 179 bp DNA PRI 29-MAY-1996 DEFINITION Rh50=Rh blood group null antigen complex glycoprotein subunit [human, Rh-deficiency patient T.B., Genomic Mutant, 179 nt]. ACCESSION S81191 NID g1336717 KEYWORDS . SOURCE human Rh-deficiency patient T.B. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 179) AUTHORS Cherif-Zahar,B., Raynal,V., Gane,P., Mattei,M.G., Bailly,P., Gibbs,B., Colin,Y. and Cartron,J.P. TITLE Candidate gene acting as a suppressor of the RH locus in most cases of Rh-deficiency JOURNAL Nature Genet. 12 (2), 168-173 (1996) MEDLINE 96154189 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 175999] from the original journal article. This sequence comes from Fig. 2C. Map location: 6p11-6p21.1. COMMENT A1086 deletion causes frameshift and premature termination. FEATURES Location/Qualifiers source 1..179 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..81 /partial /note="Rh blood group null antigen complex glycoprotein subunit" /gene="Rh50" CDS 1..81 /partial /gene="Rh50" /note="Rh blood group null antigen complex glycoprotein subunit; This sequence comes from Fig. 2C. Map location 6p11-21.1" /codon_start=1 /db_xref="PID:g1336718" /translation="MGASNTSMAMQALHWVPLSEQQLLEV" BASE COUNT 42 a 41 c 48 g 48 t ORIGIN 1 atgggcgcct ccaacacgtc tatggccatg caggcgctgc actgggttcc tctatcggaa 61 cagcagttgt tggaggtctg atgacaggtt taattctaaa gttgcctctc tggggacagc 121 catctgacca gaactgctat gatgattctg tttattggaa ggtccctaag acgagataa // LOCUS S81950 1312 bp DNA PRI 11-FEB-1997 DEFINITION P2 purinoceptor subtype Y1 [human, Genomic, 1312 nt]. ACCESSION S81950 NID g1839438 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1312) AUTHORS Janssens,R., Communi,D., Pirotton,S., Samson,M., Parmentier,M. and Boeynaems,J.M. TITLE Cloning and tissue distribution of the human P2Y1 receptor JOURNAL Biochem. Biophys. Res. Commun. 221 (3), 588-593 (1996) MEDLINE 96205320 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 177586] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1312 /organism="Homo sapiens" /db_xref="taxon:9606" gene 125..1246 /gene="P2 purinoceptor subtype Y1, P2Y1" CDS 125..1246 /gene="P2 purinoceptor subtype Y1, P2Y1" /note="This sequence comes from Fig. 1; P2Y1" /codon_start=1 /product="P2 purinoceptor subtype Y1" /db_xref="PID:g1839439" /translation="MTEVLWPAVPNGTDAAFLAGPGSSWGNSTVASTAAVSSSFKCAL TKTGFQFYYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVL TLPALIFYYFNKTDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGVVYPLKSL GRLKKKNAICISVLVWLIVVVAISPILFYSGTGVRKNKTITCYDTTSDEYLRSYFIYS MCTTVAMFCVPLVLILGCYGLIVRALIYKDLDNSPLRRKSIYLVIIVLTVFAVSYIPF HVMKTMNLRARLDFQTPAMCAFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRR LSRATRKASRRSEANLQSKSEDMTLNILPEFKQNGDTSL" BASE COUNT 282 a 361 c 324 g 345 t ORIGIN 1 ggatccagtt cgcctgctcc cttccgctcg ctggcttttc cgatgcttgc tgcgcccctg 61 gccgccgctg ccctctcgcc gcctcctacc cctcggagcc gccgcctaag tcgaggagga 121 gagaatgacc gaggtgctgt ggccggctgt ccccaacggg acggacgctg ccttcctggc 181 cggtccgggt tcgtcctggg ggaacagcac ggtcgcctcc actgccgccg tctcctcgtc 241 gttcaaatgc gccttgacca agacgggctt ccagttttac tacctgccgg ctgtctacat 301 cttggtattc atcatcggct tcctgggcaa cagcgtggcc atctggatgt tcgtcttcca 361 catgaagccc tggagcggca tctccgtgta catgttcaat ttggctctgg ccgacttctt 421 gtacgtgctg actctgccag ccctgatctt ctactacttc aataaaacag actggatctt 481 cggggatgcc atgtgtaaac tgcagaggtt catctttcat gtgaacctct atggcagcat 541 cttgtttctg acatgcatca gtgcccaccg gtacagcggt gtggtgtacc ccctcaagtc 601 cctgggccgg ctcaaaaaga agaatgcgat ctgtatcagc gtgctggtgt ggctcattgt 661 ggtggtggcg atctccccca tcctcttcta ctcaggtacc ggggtccgca aaaacaaaac 721 catcacctgt tacgacacca cctcagacga gtacctgcga agttatttca tctacagcat 781 gtgcacgacc gtggccatgt tctgtgtccc cttggtgctg attctgggct gttacggatt 841 aattgtgaga gctttgattt acaaagatct ggacaactct cctctgagga gaaaatcgat 901 ttacctggta atcattgtac tgactgtttt tgctgtgtct tacatccctt tccatgtgat 961 gaaaacgatg aacttgaggg cccggcttga ttttcagacc ccagcaatgt gtgctttcaa 1021 tgacagggtt tatgccacgt atcaggtgac aagaggtcta gcaagtctca acagttgtgt 1081 ggaccccatt ctctatttct tggcgggaga tactttcaga aggagactct cccgagccac 1141 aaggaaagct tctagaagaa gtgaggcaaa tttgcaatcc aagagtgaag acatgaccct 1201 caatatttta cctgagttca agcagaatgg agatacaagc ctgtgaaggc acaagaatct 1261 ccaaacacct ctctgttgta atatggtagg atgcttaaca gaatcaagta ct //